Python, web log data mining for frequent patterns
Posted
by descent
on Stack Overflow
See other posts from Stack Overflow
or by descent
Published on 2010-05-27T22:46:01Z
Indexed on
2010/05/27
22:51 UTC
Read the original article
Hit count: 163
python
|data-mining
Hello!
I need to develop a tool for web log data mining.
Having many sequences of urls, requested in a particular user session (retrieved from web-application logs), I need to figure out the patterns of usage and groups (clusters) of users of the website.
I am new to Data Mining, and now examining Google a lot. Found some useful info, i.e. querying Frequent Pattern Mining in Web Log Data seems to point to almost exactly similar studies.
So my questions are:
- Are there any python-based tools that do what I need or at least smth similar?
- Can Orange toolkit be of any help?
- Can reading the book Programming Collective Intelligence be of any help?
- What to Google for, what to read, which relatively simple algorithms to use best?
I am very limited in time (to around a week), so any help would be extremely precious. What I need is to point me into the right direction and the advice of how to accomplish the task in the shortest time.
Thanks in advance!
© Stack Overflow or respective owner