I know I've come across this problem before, but I'm having a bit of a mental block at the moment. and as I can't find it on SO, I'll post it here so I can find it next time.
I have a dataframe that contains a field representing an ID label. This label has two parts, an alpha prefix and a numeric suffix. I want to split it apart and create two new fields with these values in.
structure(list(lab = c("N00", "N01", "N02", "B00", "B01", "B02",
"Z21", "BA01", "NA03")), .Names = "lab", row.names = c(NA, -9L
), class = "data.frame")
df$pre<-strsplit(df$lab, "[0-9]+")
df$suf<-strsplit(df$lab, "[A-Z]+")
Which gives
lab pre suf
1 N00 N , 00
2 N01 N , 01
3 N02 N , 02
4 B00 B , 00
5 B01 B , 01
6 B02 B , 02
7 Z21 Z , 21
8 BA01 BA , 01
9 NA03 NA , 03
So, the first strsplit works fine, but the second gives a list, each having two elements, an empty string and the result I want, and stuffs them both into the dataframe column.
How can I select the second sub-element from each element of the list ? (or, is there a better way to do this)
Thanks
Paul.