Python, web log data mining for frequent patterns
- by descent
Hello!
I need to develop a tool for web log data mining.
Having many sequences of urls, requested in a particular user session (retrieved from web-application logs), I need to figure out the patterns of usage and groups (clusters) of users of the website.
I am new to Data Mining, and now examining Google a lot.
Found some useful info, i.e. querying Frequent Pattern Mining in Web Log Data seems to point to almost exactly similar studies.
So my questions are:
Are there any python-based tools that do what I need or at least smth similar?
Can Orange toolkit be of any help?
Can reading the book Programming Collective Intelligence be of any help?
What to Google for, what to read, which relatively simple algorithms to use best?
I am very limited in time (to around a week), so any help would be extremely precious. What I need is to point me into the right direction and the advice of how to accomplish the task in the shortest time.
Thanks in advance!