Python, web log data mining for frequent patterns

Posted by descent on Stack Overflow See other posts from Stack Overflow or by descent
Published on 2010-05-27T22:46:01Z Indexed on 2010/05/27 22:51 UTC
Read the original article Hit count: 158

Filed under:
|

Hello!

I need to develop a tool for web log data mining.

Having many sequences of urls, requested in a particular user session (retrieved from web-application logs), I need to figure out the patterns of usage and groups (clusters) of users of the website.

I am new to Data Mining, and now examining Google a lot. Found some useful info, i.e. querying Frequent Pattern Mining in Web Log Data seems to point to almost exactly similar studies.

So my questions are:

  1. Are there any python-based tools that do what I need or at least smth similar?
  2. Can Orange toolkit be of any help?
  3. Can reading the book Programming Collective Intelligence be of any help?
  4. What to Google for, what to read, which relatively simple algorithms to use best?

I am very limited in time (to around a week), so any help would be extremely precious. What I need is to point me into the right direction and the advice of how to accomplish the task in the shortest time.

Thanks in advance!

© Stack Overflow or respective owner

Related posts about python

Related posts about data-mining