In Python, what is the fastest algorithm for removing duplicates from a list so that all elements are unique while preserving order? [duplicate]

Question

For example:

>>> x = [1, 1, 2, 'a', 'a', 3]
>>> unique(x)
[1, 2, 'a', 3]

Assume list elements are hashable.

Clarification: The result should keep the first duplicate in the list. For example, [1, 2, 3, 2, 3, 1] becomes [1, 2, 3].

Are we keeping the first of duplicates, or last, or somewhere in the middle? e.g., [1,2,3,2,3,1], does that become [1,2,3], or [2,3,1], or something else? — C. K. Young
– C. K. Young, Commented Sep 18, 2008 at 1:29
How Do I apply the Homework tag to something? When it says assume elements are hashable, your prof is asking you to put the entries in a hashtable, then it's easy to see if youve come across them before as you walk down the list. — Karl
– Karl, Commented Nov 12, 2008 at 0:06

ShadowRanger · Accepted Answer · 2021-12-27 19:44:29Z

33

def unique(items):
    found = set()
    keep = []

    for item in items:
        if item not in found:
            found.add(item)
            keep.append(item)
            
    return keep

print unique([1, 1, 2, 'a', 'a', 3])

edited Dec 27, 2021 at 19:44

ShadowRanger

158k12 gold badges221 silver badges314 bronze badges

answered Sep 18, 2008 at 1:41

Terhorst

2,1813 gold badges16 silver badges19 bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

Constantin Over a year ago

set() is better than set([]).

jfs Over a year ago

In-place algorithms are faster. See james' and mine answers.

Justin Peel Over a year ago

This is an old thread, but if you make the add() and append() methods local function (put add = found.add and app = keep.append before the loop and then use add(item) and app(item), then this is the fastest by far. The reason that the dictionary usage was faster was that it didn't require an attribute look-up for each add and append. Just my two cents.

Michael Over a year ago

If you put it into a list comprehension afterwards, you get another speed improvement. Taking all the changes together, speed nearly doubles. See my comparison further down this page.

user_ Over a year ago

@so.very.tired Because keep is a list, and checking for membership in a list takes on average linear time in the length of the list. Meanwhile, checking for membership in a set takes on average constant time (see this). Using appropriate data structures is a deal breaker when it comes to performance. In any case, this answer is outdated. Check out this question instead.

|

mkierc · Accepted Answer · 2016-12-24 12:08:24Z

Using:

lst = [8, 8, 9, 9, 7, 15, 15, 2, 20, 13, 2, 24, 6, 11, 7, 12, 4, 10, 18, 13, 23, 11, 3, 11, 12, 10, 4, 5, 4, 22, 6, 3, 19, 14, 21, 11, 1, 5, 14, 8, 0, 1, 16, 5, 10, 13, 17, 1, 16, 17, 12, 6, 10, 0, 3, 9, 9, 3, 7, 7, 6, 6, 7, 5, 14, 18, 12, 19, 2, 8, 9, 0, 8, 4, 5]

And using the timeit module:

$ python -m timeit -s 'import uniquetest' 'uniquetest.etchasketch(uniquetest.lst)'

And so on for the various other functions (which I named after their posters), I have the following results (on my first generation Intel MacBook Pro):

Allen:                  14.6 µs per loop [1]
Terhorst:               26.6 µs per loop
Tarle:                  44.7 µs per loop
ctcherry:               44.8 µs per loop
Etchasketch 1 (short):  64.6 µs per loop
Schinckel:              65.0 µs per loop
Etchasketch 2:          71.6 µs per loop
Little:                 89.4 µs per loop
Tyler:                 179.0 µs per loop

[1] Note that Allen modifies the list in place – I believe this has skewed the time, in that the timeit module runs the code 100000 times and 99999 of them are with the dupe-less list.

Summary: Straight-forward implementation with sets wins over confusing one-liners :-)

james suggested a faster version. See stackoverflow.com/questions/89178/#91430
@jfs: Both james and Allen's versions mutate in-place, so unless the microbenchmark used accounts for that (e.g. by calling the functions using a fresh list each time and/or including no duplicates at all), the timings aren't comparable. The fastest solutions nowadays (from 3.6 onwards) are either the unique_everseen recipe from itertools (if you need to process each element as it is found) which is about 10% faster than Terhorst's solution, or, at 40-80% of the runtime of unique_everseen, list(dict.fromkeys(iterable)) (or lst[:] = dict.fromkeys(lst) to operate "in place").
@ShadowRanger: my microbenchmarks include f(lst[:]) (i.e., the copying happens on each call). Though Python has changed in 12+ years. I would not rely on the microbenchmarks results from so many years ago.

jfs · Accepted Answer · 2020-12-11 18:12:16Z

18

Update: on Python3.7+:

>>> list(dict.fromkeys('abracadabra'))
['a', 'b', 'r', 'c', 'd']

^{old answer:}

Here is the fastest solution so far (for the following input):

def del_dups(seq):
    seen = {}
    pos = 0
    for item in seq:
        if item not in seen:
            seen[item] = True
            seq[pos] = item
            pos += 1
    del seq[pos:]

lst = [8, 8, 9, 9, 7, 15, 15, 2, 20, 13, 2, 24, 6, 11, 7, 12, 4, 10, 18, 
       13, 23, 11, 3, 11, 12, 10, 4, 5, 4, 22, 6, 3, 19, 14, 21, 11, 1, 
       5, 14, 8, 0, 1, 16, 5, 10, 13, 17, 1, 16, 17, 12, 6, 10, 0, 3, 9, 
       9, 3, 7, 7, 6, 6, 7, 5, 14, 18, 12, 19, 2, 8, 9, 0, 8, 4, 5]
del_dups(lst)
print(lst)
# -> [8, 9, 7, 15, 2, 20, 13, 24, 6, 11, 12, 4, 10, 18, 23, 3, 5, 22, 19, 14, 
#     21, 1, 0, 16, 17]

Dictionary lookup is slightly faster then the set's one in Python 3.

edited Dec 11, 2020 at 18:12

answered Nov 12, 2008 at 0:04

jfs

417k210 gold badges1k silver badges1.7k bronze badges

9 Comments

Stephen Emslie Over a year ago

Could you explain why the Dictionary lookup is faster than a test for set membership in this case?

jfs Over a year ago

@Stephen Emslie: I don't know. It might be a benchmark artifact. Try it yourself. A pure speculation: dictionary is a fundamental data structure for CPython (namespaces, classes are/were implemented via dictionaries) therefore dicts are more tuned/optimized than sets.

Raymond Hettinger Over a year ago

Good timing, but wrong conclusion. The timing shows only that operator access such as d[k] = v is faster than method call access such as d.__setitem__(k, v) even if the latter has been pre-bound using d_setitem = d.__setitem__ and then timing d_setitem(k, v).

Honest Abe Over a year ago

Using Python 3.4, I tried your test script; and the function that uses a set is consistently slightly faster.

ShadowRanger Over a year ago

@jfs: In 3.6+, this can be made much simpler and even faster by simplifying to just def del_dups(seq): seq[:] = dict.fromkeys(seq) (to modify in-place) or def del_dups(seq): return list(dict.fromkeys(seq)) to make a new copy without duplicates.

|

Allen · Accepted Answer · 2008-09-18 01:33:44Z

15

What's going to be fastest depends on what percentage of your list is duplicates. If it's nearly all duplicates, with few unique items, creating a new list will probably be faster. If it's mostly unique items, removing them from the original list (or a copy) will be faster.

Here's one for modifying the list in place:

def unique(items):
  seen = set()
  for i in xrange(len(items)-1, -1, -1):
    it = items[i]
    if it in seen:
      del items[i]
    else:
      seen.add(it)

Iterating backwards over the indices ensures that removing items doesn't affect the iteration.

answered Sep 18, 2008 at 1:33

Allen

5,11625 silver badges30 bronze badges

2 Comments

James Hopkin Over a year ago

This gives different results to the other solutions (the OP didn't specify which is correct), as regards which duplicate to keep. This solution: [1, 2, 1] -> [2, 1] Other solutions: [1, 2, 1] -> [1, 2]

J Miller Over a year ago

I added a clarification about this in the question text.

James Hopkin · Accepted Answer · 2008-09-18 10:17:44Z

10

This is the fastest in-place method I've found (assuming a large proportion of duplicates):

def unique(l):
    s = set(); n = 0
    for x in l:
        if x not in s: s.add(x); l[n] = x; n += 1
    del l[n:]

This is 10% faster than Allen's implementation, on which it is based (timed with timeit.repeat, JIT compiled by psyco). It keeps the first instance of any duplicate.

repton-infinity: I'd be interested if you could confirm my timings.

answered Sep 18, 2008 at 10:17

James Hopkin

14k1 gold badge46 silver badges73 bronze badges

1 Comment

jfs Over a year ago

Dictionaries are slightly faster than sets. See my answer stackoverflow.com/questions/89178/#282589

Constantin · Accepted Answer · 2008-09-27 15:54:03Z

7

Obligatory generator-based variation:

def unique(seq):
  seen = set()
  for x in seq:
    if x not in seen:
      seen.add(x)
      yield x

answered Sep 27, 2008 at 15:54

Constantin

28.3k10 gold badges64 silver badges79 bronze badges

Comments

Raymond Hettinger · Accepted Answer · 2016-10-03 15:49:13Z

7

This may be the simplest way:

list(OrderedDict.fromkeys(iterable))

As of Python 3.5, OrderedDict is now implemented in C, so this was is now the shortest, cleanest, and fastest.

edited Oct 3, 2016 at 15:49

answered Oct 21, 2011 at 1:10

Raymond Hettinger

229k67 gold badges405 silver badges503 bronze badges

2 Comments

Michael Over a year ago

Elegant, but unfortunately about a magnitude slower than the fastest solution, and surprisingly one of the slowest solutions in general. The OrderedDict seems to be a real performance killer.

jamylak Over a year ago

Perhaps if OrderedSet ever becomes a builtin we'd have a very fast solution

Chris Cherry · Accepted Answer · 2008-09-18 01:30:50Z

5

Taken from http://www.peterbe.com/plog/uniqifiers-benchmark

def f5(seq, idfun=None):  
    # order preserving 
    if idfun is None: 
        def idfun(x): return x 
    seen = {} 
    result = [] 
    for item in seq: 
        marker = idfun(item) 
        # in old Python versions: 
        # if seen.has_key(marker) 
        # but in new ones: 
        if marker in seen: continue 
        seen[marker] = 1 
        result.append(item) 
    return result

answered Sep 18, 2008 at 1:30

Chris Cherry

28.6k6 gold badges73 silver badges72 bronze badges

1 Comment

jfs Over a year ago

it is slower than corresponding in-place version (at least for some inputs). See stackoverflow.com/questions/89178/#282589

Tyler · Accepted Answer · 2008-09-18 01:59:05Z

5

One-liner:

new_list = reduce(lambda x,y: x+[y][:1-int(y in x)], my_list, [])

answered Sep 18, 2008 at 1:59

Tyler

28.9k13 gold badges94 silver badges108 bronze badges

Comments

Mario Ruggier · Accepted Answer · 2010-04-09 13:11:48Z

4

An in-place one-liner for this:

>>> x = [1, 1, 2, 'a', 'a', 3]
>>> [ item for pos,item in enumerate(x) if x.index(item)==pos ]
[1, 2, 'a', 3]

answered Apr 9, 2010 at 13:11

Mario Ruggier

9268 silver badges6 bronze badges

3 Comments

James Sapam Over a year ago

HI Mario, How this works, please explain, what I understood is index return only one value, so its unique ?

Mario Ruggier Over a year ago

The list.index(item) method returns the position of the first item found in the list, and comparing this with the actual position of the item (from enumerate) we can therefore tell whether that item is the first occurring or not, keeping only the first occurring.

lifebalance Over a year ago

Very nice solution. But is it really in-place? When you use list comprehension, isn't a new list being created?

Michael · Accepted Answer · 2013-12-27 07:54:31Z

4

This is the fastest one, comparing all the stuff from this lengthy discussion and the other answers given here, refering to this benchmark. It's another 25% faster than the fastest function from the discussion, f8. Thanks to David Kirby for the idea.

def uniquify(seq):
    seen = set()
    seen_add = seen.add
    return [x for x in seq if x not in seen and not seen_add(x)]

Some time comparison:

$ python uniqifiers_benchmark.py 
* f8_original 3.76
* uniquify 3.0
* terhorst 5.44
* terhorst_localref 4.08
* del_dups 4.76

edited Dec 27, 2013 at 7:54

answered Dec 26, 2013 at 15:38

Michael

7,8061 gold badge41 silver badges64 bronze badges

4 Comments

jfs Over a year ago

I don't see time comparison with solutions from the top answers here. In my experience, explicit loops are faster than list comprehension in CPython (at least it requires a benchmark to test for each particular case).

Michael Over a year ago

I added the timings above. The main overhead in the presented solutions is the property lookup for add, append and so on, but even if you kick that out, the list comprehension is about 25% faster than terhorst_localreferences.

jfs Over a year ago

could you include the complete benchmark code in your answer? I don't see terhorst (or any other relevant code) in the file you linked.

Michael Over a year ago

pastebin.com/C5SQmT1R

Jake · Accepted Answer · 2008-09-18 02:33:38Z

3

You can actually do something really cool in Python to solve this. You can create a list comprehension that would reference itself as it is being built. As follows:

   # remove duplicates...
   def unique(my_list):
       return [x for x in my_list if x not in locals()['_[1]'].__self__]

Edit: I removed the "self", and it works on Mac OS X, Python 2.5.1.

The _[1] is Python's "secret" reference to the new list. The above, of course, is a little messy, but you could adapt it fit your needs as necessary. For example, you can actually write a function that returns a reference to the comprehension; it would look more like:

return [x for x in my_list if x not in this_list()]

edited Sep 18, 2008 at 2:33

answered Sep 18, 2008 at 1:43

Jake

15.3k22 gold badges74 silver badges86 bronze badges

2 Comments

Kevin Little Over a year ago

The example as given does not compile for me -- the trailing ".__self__" is not valid [[Linux 2.6 w/ Python 2.5.1]]

Parand Over a year ago

Holy cow, you're turning Python into Perl with the magic underscore business. Just say no.

Jason Baker · Accepted Answer · 2008-09-18 02:06:40Z

Do the duplicates necessarily need to be in the list in the first place? There's no overhead as far as looking the elements up, but there is a little bit more overhead in adding elements (though the overhead should be O(1) ).

>>> x  = []
>>> y = set()
>>> def add_to_x(val):
...     if val not in y:
...             x.append(val)
...             y.add(val)
...     print x
...     print y
... 
>>> add_to_x(1)
[1]
set([1])
>>> add_to_x(1)
[1]
set([1])
>>> add_to_x(1)
[1]
set([1])
>>>

Scot · Accepted Answer · 2010-12-29 17:11:51Z

Remove duplicates and preserve order:

This is a fast 2-liner that leverages built-in functionality of list comprehensions and dicts.

x = [1, 1, 2, 'a', 'a', 3]

tmpUniq = {} # temp variable used below 
results = [tmpUniq.setdefault(i,i) for i in x if i not in tmpUniq]

print results
[1, 2, 'a', 3]

The dict.setdefaults() function returns the value as well as adding it to the temp dict directly in the list comprehension. Using the built-in functions and the hashes of the dict will work to maximize efficiency for the process.

Wesley Tarle · Accepted Answer · 2008-09-18 02:16:36Z

1

O(n) if dict is hash, O(nlogn) if dict is tree, and simple, fixed. Thanks to Matthew for the suggestion. Sorry I don't know the underlying types.

def unique(x):    
  output = []
  y = {}
  for item in x:
    y[item] = ""

  for item in x:
    if item in y:
      output.append(item)

  return output

edited Sep 18, 2008 at 2:16

answered Sep 18, 2008 at 1:35

Wesley Tarle

6683 silver badges6 bronze badges

1 Comment

Jason Baker Over a year ago

FYI, you can also do that with a set so you don't have to set it equal to an empty string.

etchasketch · Accepted Answer · 2008-09-18 04:28:42Z

1

has_key in python is O(1). Insertion and retrieval from a hash is also O(1). Loops through n items twice, so O(n).

def unique(list):
  s = {}
  output = []
  for x in list:
    count = 1
    if(s.has_key(x)):
      count = s[x] + 1

    s[x] = count
  for x in list:
    count = s[x]
    if(count > 0):
      s[x] = 0
      output.append(x)
  return output

edited Sep 18, 2008 at 4:28

answered Sep 18, 2008 at 4:07

etchasketch

6264 silver badges6 bronze badges

Comments

Eli Courtwright · Accepted Answer · 2008-11-10 21:56:05Z

1

There are some great, efficient solutions here. However, for anyone not concerned with the absolute most efficient O(n) solution, I'd go with the simple one-liner O(n^2*log(n)) solution:

def unique(xs):
    return sorted(set(xs), key=lambda x: xs.index(x))

or the more efficient two-liner O(n*log(n)) solution:

def unique(xs):
    positions = dict((e,pos) for pos,e in reversed(list(enumerate(xs))))
    return sorted(set(xs), key=lambda x: positions[x])

edited Nov 10, 2008 at 21:56

answered Sep 18, 2008 at 13:23

Eli Courtwright

195k69 gold badges224 silver badges257 bronze badges

3 Comments

J Miller Over a year ago

That code is difficult to understand, and you say it's less efficient than the other solutions already presented here. So why would you go with it?

Eli Courtwright Over a year ago

I consider this easy to understand; passing a lambda function as the key parameter of sorted is really the canonical way to sort a list in Python. Most of my Python work involves generating reports on lists of statistics, and so to me this seems like the simplest and most Pythonic approach.

J Miller Over a year ago

While I agree your solution is succinct, the question asked for the fastest algorithm, not the most Pythonic.

Raymond Hettinger · Accepted Answer · 2011-10-21 01:07:11Z

Here are two recipes from the itertools documentation:

def unique_everseen(iterable, key=None):
    "List unique elements, preserving order. Remember all elements ever seen."
    # unique_everseen('AAAABBBCCDAABBB') --> A B C D
    # unique_everseen('ABBCcAD', str.lower) --> A B C D
    seen = set()
    seen_add = seen.add
    if key is None:
        for element in ifilterfalse(seen.__contains__, iterable):
            seen_add(element)
            yield element
    else:
        for element in iterable:
            k = key(element)
            if k not in seen:
                seen_add(k)
                yield element

def unique_justseen(iterable, key=None):
    "List unique elements, preserving order. Remember only the element just seen."
    # unique_justseen('AAAABBBCCDAABBB') --> A B C D A B
    # unique_justseen('ABBCcAD', str.lower) --> A B C A D
    return imap(next, imap(itemgetter(1), groupby(iterable, key)))

solinent · Accepted Answer · 2008-09-18 01:29:06Z

0

I have no experience with python, but an algorithm would be to sort the list, then remove duplicates (by comparing to previous items in the list), and finally find the position in the new list by comparing with the old list.

Longer answer: http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/52560

answered Sep 18, 2008 at 1:29

solinent

1,6631 gold badge18 silver badges19 bronze badges

Comments

etchasketch · Accepted Answer · 2008-09-18 01:32:47Z

0

>>> def unique(list):
...   y = []
...   for x in list:
...     if x not in y:
...       y.append(x)
...   return y

answered Sep 18, 2008 at 1:32

etchasketch

6264 silver badges6 bronze badges

1 Comment

Hamish Downer Over a year ago

To explain why: searching for x in a list structure (y) is O(n), while searching for x in a set (or dictionary) is O(1).

user18695 · Accepted Answer · 2008-09-19 09:42:20Z

0

If you take out the empty list from the call to set() in Terhost's answer, you get a little speed boost.

Change: found = set([])
to: found = set()

However, you don't need the set at all.

def unique(items):
    keep = []

    for item in items:
        if item not in keep:
            keep.append(item)

    return keep

Using timeit I got these results:

with set([]) -- 4.97210427363
with set() -- 4.65712377445
with no set -- 3.44865284975

answered Sep 19, 2008 at 9:42

user18695

2172 silver badges3 bronze badges

1 Comment

Bite code Over a year ago

yeah, when you have few data, I bet the set internal mecanisme is slower that iterating over a list. But if you got maaaaaaaaaaany element, I think set are faster. Or what would be the point of this data structures ;-)

Lucas Zamboulis · Accepted Answer · 2014-09-21 12:01:23Z

0

x = [] # Your list  of items that includes Duplicates

# Assuming that your list contains items of only immutable data types

dict_x = {} 

dict_x = {item : item for i, item in enumerate(x) if item not in dict_x.keys()}
# Average t.c. = O(n)* O(1) ; furthermore the dict comphrehension and generator like behaviour of enumerate adds a certain efficiency and pythonic feel to it.

x = dict_x.keys() # if you want your output in list format

edited Sep 21, 2014 at 12:01

Lucas Zamboulis

2,5515 gold badges24 silver badges28 bronze badges

answered Sep 21, 2014 at 11:26

BigDataGuy

1

5 Comments

ByteEater Over a year ago

What can go wrong with items of mutable types?

Barmar Over a year ago

This doesn't work the way you think it does. if item not in dict_x.keys() is checking the keys of the original, empty dict_x, not the dictionary being created. It's always true. The duplicates are removed simply because trying to create a duplicate key is ignored.

Barmar Over a year ago

Why are you using enumerate()?

Barmar Over a year ago

@ByteEater Mutable types can't be used as dictionary keys.

ByteEater Over a year ago

@Bramar, maybe he tried to enable mutable types by writing i: item instead if item : item (although a single name for the pair would suffice) and then do .values() instead of .keys() in both places. But that won't work because of your first comment.

Kevin Little · Accepted Answer · 2008-09-18 01:54:40Z

-1

>>> x=[1,1,2,'a','a',3]
>>> y = [ _x for _x in x if not _x in locals()['_[1]'] ]
>>> y
[1, 2, 'a', 3]

"locals()['_[1]']" is the "secret name" of the list being created.

answered Sep 18, 2008 at 1:54

Kevin Little

13.1k6 gold badges43 silver badges48 bronze badges

2 Comments

Constantin Over a year ago

Presence of _[1] local is not guaranteed by language.

Charles Duffy Over a year ago

"<item> in <list>" is O(n), so this is slow.

Franck Mesirard · Accepted Answer · 2008-09-18 08:51:32Z

-1

I don't know if this one is fast or not, but at least it is simple.

Simply, convert it first to a set and then again to a list

def unique(container):
  return list(set(container))

answered Sep 18, 2008 at 8:51

Franck Mesirard

3,2493 gold badges22 silver badges17 bronze badges

1 Comment

Eli Courtwright Over a year ago

This does not preserve order.

Sergey Stolyarov · Accepted Answer · 2008-09-18 05:08:13Z

-2

One pass.

a = [1,1,'a','b','c','c']

new_list = []
prev = None

while 1:
    try:
        i = a.pop(0)
        if i != prev:
            new_list.append(i)
        prev = i
    except IndexError:
        break

answered Sep 18, 2008 at 5:08

Sergey Stolyarov

2,6873 gold badges29 silver badges42 bronze badges

1 Comment

Constantin Over a year ago

Requires sorted input, doesn't it?

Matthew Schinckel · Accepted Answer · 2008-11-26 02:30:59Z

-2

I haven't done any tests, but one possible algorithm might be to create a second list, and iterate through the first list. If an item is not in the second list, add it to the second list.

x = [1, 1, 2, 'a', 'a', 3]
y = []
for each in x:
    if each not in y:
        y.append(each)

edited Nov 26, 2008 at 2:30

answered Sep 18, 2008 at 1:29

Matthew Schinckel

35.7k6 gold badges91 silver badges122 bronze badges

5 Comments

rjmunro Over a year ago

I find your use of the variable name "each" really confusing to read, probably because in many languages it is a keyword. It's much clearer to use item or just i.

Matthew Schinckel Over a year ago

'i' to me implies an index - we aren't iterating through indices, we are iterating through objects. I'd prefer item, but I don't see 'each' as bad - just because it is a keyword in another language, why prevent it's use here. Syntax highlighting (as shown above) picks it up fine...

Matthew Schinckel Over a year ago

Other than AppleScript, what languages use the word 'each' as a keyword?

Marcin Over a year ago

You should have used a set. This is unlikely to be the fastest.

Matthew Schinckel Over a year ago

Marcin: "... while preserving order".

vvvvv · Accepted Answer · 2021-10-12 12:22:20Z

-2

a=[1,2,3,4,5,7,7,8,8,9,9,3,45]

def unique(l):

    ids={}
    for item in l:
        if not ids.has_key(item):
            ids[item]=item
    return  ids.keys()
print a

print unique(a)

Inserting elements will take theta(n) retrieving if element is exiting or not will take constant time testing all the items will take also theta(n) so we can see that this solution will take theta(n). Bear in mind that dictionary in python implemented by hash table.

edited Oct 12, 2021 at 12:22

vvvvv

32.8k19 gold badges70 silver badges103 bronze badges

answered Nov 11, 2008 at 0:32

aboSamoor

33.4k3 gold badges20 silver badges10 bronze badges

1 Comment

jfs Over a year ago

The questions says "while preserving order". A Python dictionary doesn't preserve order.

Collectives™ on Stack Overflow

In Python, what is the fastest algorithm for removing duplicates from a list so that all elements are unique while preserving order? [duplicate]

27 Answers 27

8 Comments

4 Comments

9 Comments

2 Comments

1 Comment

Comments

2 Comments

1 Comment

Comments

3 Comments

4 Comments

2 Comments

Comments

Comments

1 Comment

Comments

3 Comments

Comments

Comments

1 Comment

1 Comment

5 Comments

2 Comments

1 Comment

1 Comment

5 Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

27 Answers 27

8 Comments

4 Comments

9 Comments

2 Comments

1 Comment

Comments

2 Comments

1 Comment

Comments

3 Comments

4 Comments

2 Comments

Comments

Comments

1 Comment

Comments

3 Comments

Comments

Comments

1 Comment

1 Comment

5 Comments

2 Comments

1 Comment

1 Comment

5 Comments

1 Comment

Linked

Related