Site crawler/spider that tosses results into mysql

Posted by ian.evans on Server Fault See other posts from Server Fault or by ian.evans
Published on 2010-02-22T15:14:23Z Indexed on 2010/03/17 16:01 UTC
Read the original article Hit count: 575

Filed under:
|
|
|

It's been suggested that we use mysql for our site's search as it'd be running on the same server that hosts our web server (nginx) and our db (mysql).

Since not all of our pages are created from the database, it's been suggested that we have a crawler that can crawl the site, and toss the page url and data into mysql and have sphinx index on that.

Does anyone know of an open source spider that has a mysql storing option out of the box.

Thanks.

© Server Fault or respective owner

Related posts about sphinxsearch

Related posts about mysql