Easy way of converting a string to lowercase in python

Question

I have a text as follows.

mytext = "This is AVGs_ABB and NMN_ABB and most importantly GFD_ABB This is so important that you have to CLEAN  the lab everyday"

I want to convert it to lowercase, except the words that has _ABB in it.

So, my output should look as follows.

    mytext = "this is AVGs_ABB and NMN_ABB and most importantly GFD_ABB this is so important that you have to clean  the lab everyday"

My current code is as follows.

splits = mytext.split()
newtext = []
for item in splits:
   if not '_ABB' in item:
        item = item.lower()
        newtext.append(item)
   else:
        newtext.append(item)

However, I want to know if there is any easy way of doing this, possibly in one line?

Sure, one could invent one-liners for this, but what exactly is wrong with your version? It is significantly more readable. — Kendas
– Kendas, Commented Dec 20, 2017 at 8:56
@Kendas I wouldn't say so; nobody would expect to see so much code for that task. Creates too many variables too — Ma0
– Ma0, Commented Dec 20, 2017 at 9:05

Chris_Rands · Accepted Answer · 2017-12-20 09:00:21Z

11

You can use a one liner splitting the string into words, check the words with str.endswith() and then join the words back together:

' '.join(w if w.endswith('_ABB') else w.lower() for w in mytext.split())
# 'this is AVGs_ABB and NMN_ABB and most importantly GFD_ABB this is so important that you have to clean the lab everyday'

Of course use the in operator rather than str.endswith() if '_ABB' can actually occur anywhere in the word and not just at the end.

edited Dec 20, 2017 at 9:00

answered Dec 20, 2017 at 8:57

Chris_Rands

41.7k15 gold badges92 silver badges126 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

LcdDrm Over a year ago

May use ... if '_ABB' in w, since the question asks for "words that has _ABB in it". It doesn't have to be at the end

Adelin Over a year ago

Still I'd write it as if '_ABB' in w, just so that it applies to what OP asked for words that has _ABB in it

RomanPerekhrest · Accepted Answer · 2017-12-20 09:33:53Z

Extended regex approach:

import  re

mytext = "This is AVGs_ABB and NMN_ABB and most importantly GFD_ABB This is so important that you have to CLEAN  the lab everyday"
result = re.sub(r'\b((?!_ABB)\S)+\b', lambda m: m.group().lower(), mytext)

print(result)

The output:

this is AVGs_ABB and NMN_ABB and most importantly GFD_ABB this is so important that you have to clean  the lab everyday

Details:

\b - word boundary
(?!_ABB) - lookahead negative assertion, ensures that the given pattern will not match
\S - non-whitespace character
\b((?!_ABB)\S)+\b - the whole pattern matches a word NOT containing substring _ABB

RoadRunner · Accepted Answer · 2017-12-20 09:35:12Z

Here is another possible(not elegant) one-liner:

mytext = "This is AVGs_ABB and NMN_ABB and most importantly GFD_ABB This is so important that you have to CLEAN  the lab everyday"

print(' '.join(map(lambda x : x if '_ABB' in x else x.lower(), mytext.split())))

Which Outputs:

this is AVGs_ABB and NMN_ABB and most importantly GFD_ABB this is so important that you have to clean the lab everyday

Note: This assumes that your text will only seperate the words by spaces, so split() suffices here. If your text includes punctuation such as",!.", you will need to use regex instead to split up the words.

Collectives™ on Stack Overflow

Easy way of converting a string to lowercase in python

3 Answers 3

2 Comments

Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

2 Comments

Comments

Comments

Related