R problems using rpart with 4000 records and 13 attributes

Posted by josh on Stack Overflow See other posts from Stack Overflow or by josh
Published on 2010-04-21T20:46:55Z Indexed on 2010/04/21 21:13 UTC
Read the original article Hit count: 190

Filed under:

I have attempted to email the author of this package without success, just wondering if anybody else has experienced this.

I am having an using rpart on 4000 rows of data with 13 attributes. I can run the same test on 300 rows of the same data with no issue. When I run on 4000 rows, Rgui.exe runs consistently at 50% cpu and the UI hangs.... it will stay like this for at least 4-5hours if I let it run, and never exit or become responsive.

here is the code I am using both on the 300 and 4000 size subset :

train<-read.csv("input.csv",header=T)
y<-train[,18]
x<-train[,3:17]
library(rpart)
fit<-rpart(y~.,x)

Is this a known limitation of rpart, am I doing something wrong? potential workarounds?

any assistance appreciated

© Stack Overflow or respective owner

Related posts about r