I have a dataframe like this,
df_nba = pd.DataFrame({'col1': ['name', np.nan,np.nan,'course','eca','pages',
'name', np.nan,np.nan,'course','pages',
'name', np.nan,np.nan,'course','eca','pages',
'name', np.nan,np.nan,'course','eca','pages',
'name', np.nan,np.nan,'course','pages',
'name', np.nan,np.nan,'course','eca','pages',
],
'col2': ['jim', 'California','M','Biology','Biology Club',1,
'jim', 'California','M','Physics',2,
'greg', 'Arizona','M','Geography','Jazz Band',3,
'greg', 'Arizona','M','Physics','Photography',4,
'jesse', 'Washington','F','Economics',5,
'jesse', 'Washington','F','Literature','Photography',6,
]})
col1 col2
0 name jim
1 NaN California
2 NaN M
3 course Biology
4 eca Biology Club
5 pages 1
6 name jim
7 NaN California
8 NaN M
9 course Physics
10 pages 2
11 name greg
12 NaN Arizona
13 NaN M
14 course Geography
15 eca Jazz Band
16 pages 3
17 name greg
18 NaN Arizona
19 NaN M
20 course Physics
21 eca Photography
22 pages 4
23 name jesse
24 NaN Washington
25 NaN F
26 course Economics
27 pages 5
28 name jesse
29 NaN Washington
30 NaN F
31 course Literature
32 eca Photography
33 pages 6
There are two consecutive rows always missing after the row name for each person. Can I fill the data with States and Gender first then I can transpose the data to a column wise view?
The output will be like,
name states gender course eca pages
0 jim California M Biology Biology Club 1
1 jim California M Physics NaN 2
2 greg Arizona M Geography Jazz Band 3
3 greg Arizona M Physics Photography 4
4 jesse Washington F Economics NaN 5
5 jesse Washington F Literature Photography 6