what is the best way to generate fake data for classification problem ?
Posted
by Berkay
on Stack Overflow
See other posts from Stack Overflow
or by Berkay
Published on 2010-04-10T00:35:26Z
Indexed on
2010/04/10
0:43 UTC
Read the original article
Hit count: 490
i'm working on a project and i have a subset of user's key-stroke time data.This means that the user makes n attempts and i will use these recorded attempt time data in various kinds of classification algorithms for future user attempts to verify that the login process is done by the user or some another person. (Simply i can say that this is biometrics)
I have 3 different times of the user login attempt process, ofcourse this is subset of the infinite data.
until now it is an easy classification problem, i decided to use WEKA but as far as i understand i have to create some fake data to feed the classification algorithm.
can i use some optimization algorithms ? or is there any way to create this fake data to get min false positives ?
Thanks
© Stack Overflow or respective owner