Im trying to sort columns for individual patients based on dates in those columns in R. I made an example data set, however, the data set does not return dates, but long numbers (no idea why). Forgive my perhaps silly way of creating the data frame :)...
dd<-
data.frame(rbind(
c(as.POSIXct(as.Date("01/01/2008", format="%d/%m/%Y")),
as.POSIXct(as.Date("01/01/2009", format="%d/%m/%Y")),
as.POSIXct(as.Date("01/01/2011", format="%d/%m/%Y")),
as.POSIXct(as.Date("01/01/2010", format="%d/%m/%Y")))
,
c(as.POSIXct(as.Date("01/01/2002", format="%d/%m/%Y")),
as.POSIXct(as.Date("01/01/2001", format="%d/%m/%Y")),
as.POSIXct(as.Date("01/01/2006", format="%d/%m/%Y")),
as.POSIXct(as.Date("01/01/2004", format="%d/%m/%Y")))
))
dd$patient[1] <- 1
dd$patient[2] <- 2
names(dd) <- c("date1", "date2", "date3", "date4", "patient")
What I am after is a list of colum names per patient, sorted by dates within those columns. Thus,
Patient 1 : date1, date2, date4, date3
Patient 2 : date2, date1, date4, date3
EDIT:
So, one more thing. What if one date is missing... thus:
dd <- data.frame(
patient = 1:2,
date1 = as.Date(c("01/01/2008","01/01/2002"),format="%d/%m/%Y"),
date2 = as.Date(c("01/01/2009","01/01/2001"),format="%d/%m/%Y"),
date3 = as.Date(c("01/01/2011","01/01/2006"),format="%d/%m/%Y"),
date4 = as.Date(c("01/01/2010","01/01/2004"),format="%d/%m/%Y")
)
dd[2,2]<- NA
Matthews answer gives:
> t(apply(dd, 1, function(x) c(x[1], names(x[-1])[order(x[-1])])))
patient
[1,] "1" "date1" "date2" "date4" "date3"
[2,] "2" "date2" "date4" "date3" "date1"
So the column name of the missing data point is included in the sorted list of dates at the end.But id like it to be not there... thus:
patient
[1,] "1" "date1" "date2" "date4" "date3"
[2,] "2" "date2" "date4" "date3"
POSIXctwhen you don't have a time component? AvoidPOSIXctif you don't need H:M:S, else you are likely to run into issues with daylight saving and timezones.