what is the best way to generate fake data for classification problem ?

Posted by Berkay on Stack Overflow See other posts from Stack Overflow or by Berkay
Published on 2010-04-10T00:35:26Z Indexed on 2010/04/10 0:43 UTC
Read the original article Hit count: 490

i'm working on a project and i have a subset of user's key-stroke time data.This means that the user makes n attempts and i will use these recorded attempt time data in various kinds of classification algorithms for future user attempts to verify that the login process is done by the user or some another person. (Simply i can say that this is biometrics)

I have 3 different times of the user login attempt process, ofcourse this is subset of the infinite data.

until now it is an easy classification problem, i decided to use WEKA but as far as i understand i have to create some fake data to feed the classification algorithm.

can i use some optimization algorithms ? or is there any way to create this fake data to get min false positives ?

Thanks

© Stack Overflow or respective owner

Related posts about machine-learning

Related posts about weka