Creating a new column from each row in pandas

Question

I'm trying to create a new column and populate it using values from each row. I have a column 'Journey' and the new column is 'Origin'.

def getOrigin(journey):
    if " to " in journey:
        return journey.split(" to ")[0]
    else:
        return "No origin"

df['Origin'] = getOrigin(df.Journey)

print(df['Origin'])

If df.Journey is "America to England", then I'd expect df['Origin'] to be 'America', but instead every row of Origin is "No origin". How do I do this?

Kevin Ramnauth · Accepted Answer · 2018-04-23 20:55:48Z

1

I believe you need to map it like so:

df['Origin'] = df.Journey.applymap(getOrigin)

this should apply your function to every item in the Journey column

answered Apr 23, 2018 at 20:55

Kevin Ramnauth

431 silver badge4 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

cs95 Over a year ago

You can 100% vectorize with a non-loopy solution... although this is an alternative, I'd still recommend 5 other more pertinent ones before getting to this.

Ryan · Accepted Answer · 2018-04-23 21:16:58Z

This solution is less efficient with a lot more code, but as a beginner, easier to understand maybe... Consistent with the way you tried to solve the problem...!

df = pd.DataFrame(data = {'Journey' : ['england to america', 'peru', 'france to china']})

origin = []
def getOrigin(Journey):
    for i in range(len(Journey)):
        if " to " in Journey[i]:
            origin.append(Journey[i].split(" to ")[0])
        else:
            origin.append("No origin")
return origin



df['Origin'] = getOrigin(df['Journey'])

print (df['Origin'])

0      england
1    No origin
2       france
Name: Origin, dtype: object

cs95 · Accepted Answer · 2019-05-29 22:52:34Z

0

`str.extract` + `fillna`

df['Origin'] = df['Journey'].str.extract('^(.*?)(?=\s*to)').fillna('No origin')

`str.split` + `fillna`

df['Origin'] = df['Journey'].str.split(' to').str[0].fillna('No origin')

List comprehension

df['Origin'] = [
    x.split(' to ')[0] if 'to' in x else 'No origin' for x in df['Journey']
]

edited May 29, 2019 at 22:52

answered Apr 23, 2018 at 20:52

cs95

406k106 gold badges744 silver badges794 bronze badges

Collectives™ on Stack Overflow

Creating a new column from each row in pandas

3 Answers 3

1 Comment

Comments

`str.extract` + `fillna`

`str.split` + `fillna`

List comprehension

Comments

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

Comments

str.extract + fillna

str.split + fillna

List comprehension

Comments

Related

`str.extract` + `fillna`

`str.split` + `fillna`