How to analyse Wikipedia article's data base with R?
Posted
by Tal Galili
on Stack Overflow
See other posts from Stack Overflow
or by Tal Galili
Published on 2010-04-10T20:58:16Z
Indexed on
2010/04/10
21:03 UTC
Read the original article
Hit count: 200
Hi all,
This is a "big" question, that I don't know how to start, so I hope some of you can give me a direction. And if this is not a "good" question, I will close the thread with an apology.
I wish to go through the database of Wikipedia (let's say the English one), and do statistics. For example, I am interested in how many active editors (which should be defined) Wikipedia had at each point of time (let's say in the last 2 years).
I don't know how to build such a database, how to access it, how to know which types of data it has and so on. So my questions are:
- What tools do I need for this (besides basic R) ? MySQL on my computer? RODBC database connection?
- How do you start planning for such a project?
© Stack Overflow or respective owner