Site crawler/spider that tosses results into mysql
Posted
by ian.evans
on Server Fault
See other posts from Server Fault
or by ian.evans
Published on 2010-02-22T15:14:23Z
Indexed on
2010/03/17
16:01 UTC
Read the original article
Hit count: 575
It's been suggested that we use mysql for our site's search as it'd be running on the same server that hosts our web server (nginx) and our db (mysql).
Since not all of our pages are created from the database, it's been suggested that we have a crawler that can crawl the site, and toss the page url and data into mysql and have sphinx index on that.
Does anyone know of an open source spider that has a mysql storing option out of the box.
Thanks.
© Server Fault or respective owner