How do I remove repeating substring from a list of string?

Question

I would like to remove the repeated substring from a list of strings. *Assuming that the repeated substring is different for each list.

Example:

lst = ['State your favorite fruit: Apple', 'State your favorite fruit: Orange', 'State your favorite fruit: Grapes']

Desired outcome:

final_lst = ['Apple', 'Orange', 'Grapes']

Edit: Sorry if my initial question was not clear. I hope to find the unique words from each list of strings.

lst1 = ['This is a bag', 'This is a cat', 'This is a dog']
lst2 = ['Favorite drink: Cola', 'Favorite drink: Sprite']
lst3 = ['My name is James', 'My name is Mary Jane', 'My name is Lopez']

Desired output:

final_lst1 = ['bag', 'cat', 'dog']
final_lst2 = ['Cola', 'Sprite']
final_lst3 = ['James', 'Mary Jane', 'Lopez']

A better way to phrase this might be to say that you want to extract all fruit names. What is the exact problem statement here? — Tim Biegeleisen
– Tim Biegeleisen, Commented Oct 17, 2021 at 10:35
Do you know the repeated substring beforehand, e. g. is it defined in a variable? Or must it be detected from the list items? — Michael Butscher
– Michael Butscher, Commented Oct 17, 2021 at 10:40
@MichaelButscher Hi, the repeated substring must be detected from the list items. Thanks! — Boon
– Boon, Commented Oct 17, 2021 at 11:14

A S Adithiyaa · Accepted Answer · 2021-10-17 10:56:56Z

There might be certain other ways to do this but this below one works just fine for the purpose

So, your list is as below:

lst = ['State your favorite fruit: Apple', 'State your favorite fruit: Orange', 'State your favorite fruit: Grapes']

Now, separate all the words in to a new list

seperate_words = (" ".join(lst)).split(" ") #First we join all the sentences of the list
# with a space in between using the "join" method of string.
#Then consequently splitting the list by a space

Finally, to get the unique words, use the list comprehension as below

unique_words = [word for word in seperate_words if seperate_words.count(word) == 1]

print(unique_words)

Output:['Apple', 'Orange', 'Grapes']

Regards

Here, you can separate out the unique words for any given list, i.e., For a list where the repeated substring is different for each list.

hi2meuk · Accepted Answer · 2021-10-17 10:51:53Z

0

This is a solution to the question asked:

final_lst = [s.replace('State your favorite fruit: ') for s in lst]

answered Oct 17, 2021 at 10:51

hi2meuk

2,08420 silver badges10 bronze badges

Comments

Maximilian Freitag · Accepted Answer · 2021-10-17 10:55:26Z

0

Just use list comprehension:

lst = ['State your favorite fruit: Apple', 'State your favorite fruit: Orange', 'State your favorite fruit: Grapes']

final_lst = [s.replace('State your favorite fruit: ', '') for s in lst]

print(final_lst)

Output:

['Apple', 'Cherry', 'Grapes']

answered Oct 17, 2021 at 10:55

Maximilian Freitag

1,0493 gold badges14 silver badges30 bronze badges

Comments

Hackaholic · Accepted Answer · 2021-10-17 10:56:24Z

0

you can split on ":" and get last index like below:

[x.split(":")[-1] for x in lst]

answered Oct 17, 2021 at 10:56

Hackaholic

19.8k6 gold badges59 silver badges77 bronze badges

Comments

Richard Römer · Accepted Answer · 2021-10-17 12:41:57Z

0

You can iterate trough the list and then take the last word from each string:

final_lst = [w.split(" ")[-1] for w in lst]

answered Oct 17, 2021 at 12:41

Richard Römer

11 bronze badge

Collectives™ on Stack Overflow

How do I remove repeating substring from a list of string?

5 Answers 5

1 Comment

Comments

Comments

Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

1 Comment

Comments

Comments

Comments

Comments

Related