Replacing values in a data frame in Python

Question

I'm new to python, and trying to learn how to data analysis with it. I have a data frame in python (called "data"). I am looking to recode a variable, GEND, which has three values (1, 2, 3). Using pandas, I read in a csv file using pd.read_csv(). I am trying to replace all instances of "3" in the variable GEND to missing (NaN). However, I can't seem to find out how to do it. So far I've tried a for loop, which doesn't show an error, but doesn't change the variable information:

for value in data.GEND:
if value == 3:
    value = np.nan

I've also tried this, which doesn't show an error, but also doesn't do anything:

data.GEND.loc[3] = np.nan

and this, which works but changes the value of the ID variable to "3", but otherwise correctly changes the value of "3" in the GEND variable to NaN:

data.GEND.replace(to_replace=3, value = nan)

What am I missing here? I'd also like to know how I can do the above but create a new column in the data frame that contains the new information (so I can keep the original values if I mess up).

dting · Accepted Answer · 2015-08-04 22:06:02Z

4

You can use loc to replace the 3's:

df = pd.DataFrame({'GEND':[1,2,1,2,3,1,2,3,1,2,1,2,]})
df.loc[df.GEND == 3, 'GEND'] = np.NaN

Also using where you can obtain the same result:

df.GEND = df.GEND.where(df.GEND != 3)

edited Aug 4, 2015 at 22:06

answered Aug 4, 2015 at 22:01

dting

39.4k10 gold badges98 silver badges117 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

dting Over a year ago

That replaces the 3rd loc with NaN, print out what df.GEND.loc[3] is and you should see what it is doing.

EdChum Over a year ago

@Daniel loc performs label indexing, so it returns just the row where the index is 3

user4326875 Over a year ago

Thanks, guys! This was super frustrating for me and you helped me a lot! The code works now!

user4326875 Over a year ago

The following code worked for me: data.loc[data.GEND == 3, 'GEND'] = np.NaN

Collectives™ on Stack Overflow

Replacing values in a data frame in Python

1 Answer 1

4 Comments

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

4 Comments

Related