Pandas: get index of each element

Question

I guess this is a duplicate of Find element's index in pandas Series .

This is my dataframe;

      WORD1    CAT1   
    elephant   animal  
        lion   animal
       tiger   animal
      hoopoe    bird 
    hornbill    bird
   sunflower   flower
        rose   flower
     giraffe   animal
       zebra   animal
     sparrow    bird  
        duck   animal

I would like to get the index of each element from 'CAT1';

Let me put it this way;

for d in data['CAT1']:
    print data[data['CAT1'] == d].index[0]
...
0
0
0
3
3
5
5
0
0
3
0

The above returns the index, but falters when there are duplicates. How do I get this rectified?

For future readers of this question, could you update to be clearer about what you actually want as an output? "get the index of each element from 'CAT1'" is ambiguous. Do you want the first index of each distinct entry in CAT1 or do you want to assign each distinct entry a number and replace the text with this number? — LondonRob
– LondonRob, Commented Feb 12, 2014 at 14:40

jonrsharpe · Accepted Answer · 2014-02-12 11:06:24Z

1

You can enumerate in Python to get the indices along with the items:

for i, d in enumerate(data['CAT1']):
     print(i)

If you want to select from WORD1 by CAT1, you could zip them, for example:

birds = [w for w, c in zip(data['WORD1'], data['CAT1']) if c == "bird")]

Note: str.index is a method for finding the index of a sub-string within a string.

answered Feb 12, 2014 at 11:06

jonrsharpe

123k30 gold badges275 silver badges487 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

jonrsharpe Over a year ago

As you've seen, list.index gives you the first index only. It's not entirely clear what you're trying to achieve; have you tried the suggestions in my answer?

richie Over a year ago

@jonrharpe yes. Tried it. Makes sense. But I'm looking for something like this stackoverflow.com/q/18327624/1948860

jonrsharpe Over a year ago

The answers there cover this too, you can use data['CAT1'].get_loc(d) or data[data['CAT1'] == d]

Collectives™ on Stack Overflow

Pandas: get index of each element

1 Answer 1

3 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

3 Comments

Linked

Related