How can I scrape specific data from a website
Posted
by
Stoney
on Super User
See other posts from Super User
or by Stoney
Published on 2012-09-12T15:47:45Z
Indexed on
2012/09/12
21:40 UTC
Read the original article
Hit count: 301
I'm trying to scrape data from a website for research.
The urls are nicely organized in an example.com/x format, with x as an ascending number and all of the pages are structured in the same way. I just need to grab certain headings and a few numbers which are always in the same locations. I'll then need to get this data into structured form for analysis in Excel.
I have used wget before to download pages, but I can't figure out how to grab specific lines of text.
Excel has a feature to grab data from the web (Data->From Web) but from what I can see it only allows me to download tables. Unfortunately, the data I need is not in tables.
© Super User or respective owner