R problems using rpart with 4000 records and 13 attributes
Posted
by josh
on Stack Overflow
See other posts from Stack Overflow
or by josh
Published on 2010-04-21T20:46:55Z
Indexed on
2010/04/21
21:13 UTC
Read the original article
Hit count: 198
r
I have attempted to email the author of this package without success, just wondering if anybody else has experienced this.
I am having an using rpart on 4000 rows of data with 13 attributes. I can run the same test on 300 rows of the same data with no issue. When I run on 4000 rows, Rgui.exe runs consistently at 50% cpu and the UI hangs.... it will stay like this for at least 4-5hours if I let it run, and never exit or become responsive.
here is the code I am using both on the 300 and 4000 size subset :
train<-read.csv("input.csv",header=T)
y<-train[,18]
x<-train[,3:17]
library(rpart)
fit<-rpart(y~.,x)
Is this a known limitation of rpart, am I doing something wrong? potential workarounds?
any assistance appreciated
© Stack Overflow or respective owner