Keep duplicates in a list in Python

Question

I know this is probably an easy answer but I can't figure it out. What is the best way in Python to keep the duplicates in a list:

x = [1,2,2,2,3,4,5,6,6,7]

The output should be:

[2,6]

I found this link: Find (and keep) duplicates of sublist in python, but I'm still relatively new to Python and I can't get it to work for a simple list.

Post the code you are having trouble with, otherwise this is a complete retread of that other question. — Steven Rumbalski
– Steven Rumbalski, Commented Apr 4, 2013 at 13:28
@StevenRumbalski: Not precisely, the other question is also flattening nested lists at the same time. — MattH
– MattH, Commented Apr 4, 2013 at 13:31

mgilson · Accepted Answer · 2013-04-04 13:55:51Z

16

I'd use a collections.Counter:

from collections import Counter
x = [1, 2, 2, 2, 3, 4, 5, 6, 6, 7]
counts = Counter(x)
output = [value for value, count in counts.items() if count > 1]

Here's another version which keeps the order of when the item was first duplicated that only assumes that the sequence passed in contains hashable items and it will work back to when set or yeild was introduced to the language (whenever that was).

def keep_dupes(iterable):
    seen = set()
    dupes = set()
    for x in iterable:
        if x in seen and x not in dupes:
            yield x
            dupes.add(x)
        else:
            seen.add(x)

print list(keep_dupes([1,2,2,2,3,4,5,6,6,7]))

edited Apr 4, 2013 at 13:55

answered Apr 4, 2013 at 13:27

mgilson

312k70 gold badges656 silver badges722 bronze badges

Sign up to request clarification or add additional context in comments.

11 Comments

Jochen Ritzel Over a year ago

However you lose the order of the elements in the output.

mgilson Over a year ago

Yep. There are a lot of situations where this isn't the best way to go. It also requires the input be hashable... But, it's O(n) even for un-sorted lists which is nice.

DSM Over a year ago

The shortest ordered variant I can think of offhand is [k for k in OrderedDict.fromkeys(x) if counts[k] > 1].

mgilson Over a year ago

@DSM -- Why do you need an OrderedDict there? why not just [k for k in x if counts[k] > 1]? Actually, that's better than what I have. I'll update...

DSM Over a year ago

@mgilson: try it and see.. :^)

|

Jochen Ritzel · Accepted Answer · 2013-04-04 13:27:45Z

10

This is a short way to do it if the list is sorted already:

x = [1,2,2,2,3,4,5,6,6,7]

from itertools import groupby
print [key for key,group in groupby(x) if len(list(group)) > 1]

answered Apr 4, 2013 at 13:27

Jochen Ritzel

108k33 gold badges205 silver badges195 bronze badges

2 Comments

mgilson Over a year ago

This will also work with python2.6 which is a problem with mine.

Jochen Ritzel Over a year ago

@luchosrock: No, groupby groups consecutive elements

Ivan · Accepted Answer · 2023-08-24 07:31:29Z

3

List Comprehension in combination with set() will do exactly what you want.

>>> list(set([i for i in x if x.count(i) > 1]))

[2, 6]

edited Aug 24, 2023 at 7:31

answered Oct 20, 2022 at 10:14

Ivan

631 gold badge1 silver badge6 bronze badges

Comments

luchosrock · Accepted Answer · 2013-04-04 13:42:35Z

0

keepin' it simple:

array2 = []
aux = 0
aux2=0
for i in x:
    aux2 = i
    if(aux2==aux):
        array2.append(i)
    aux= i
list(set(array2))

That should work

edited Apr 4, 2013 at 13:42

answered Apr 4, 2013 at 13:34

luchosrock

70610 silver badges24 bronze badges

2 Comments

DSM Over a year ago

Won't that give [2,2,6]?

luchosrock Over a year ago

@DSM ahaha you're totally right, I edited my answer, Thanks :)

L Ken · Accepted Answer · 2020-03-06 13:07:37Z

0

Not efficient but just to get the output, you could try:

import numpy as np

def check_for_repeat(check_list):
    repeated_list = []

    for idx in range(len(check_list)):
        elem = check_list[idx]
        check_list[idx] = None

        if elem in temp_list:
            repeated_list.append(elem)

    repeated_list = np.array(repeated_list)

    return list(np.unique(repeated_list))

edited Mar 6, 2020 at 13:07

answered Mar 6, 2020 at 12:45

L Ken

93 bronze badges

Collectives™ on Stack Overflow

Keep duplicates in a list in Python

5 Answers 5

11 Comments

2 Comments

Comments

2 Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

11 Comments

2 Comments

Comments

2 Comments

Comments

Linked

Related