xml parsing - Looking for an Open Source Web Crawler that can crawl API requests and parse XML into csv -
i'm looking webcrawlers crawl through api , parse xml xml or csv file.
i've been playing around requests api feeds great if didnt have manually , use automatically , edit data later.
for example using api site called eventful, can request "?xml feed?" of data
http://api.eventful.com/rest/events/search?app_key=lksbnc8mgtjd4wc5&location=pittsburgh&date=future
if inspect link can see there ton of xml data sent back.
i thought since xml data broken down elements wouldn't difficult ask crawler handle sorting (e.g city element send data city field in csv document)
i'm wondering if has used existing opensource web crawler crawl apis , relate parsed data excel format....
i looked nutch couldnt find reference in documentation sorting xml return excel document based on elements returned api feed.
has done before , can refer program. specifics helpful.
we @ http://import.io/ have free solution similar mozenda, build api using our web browser , can upload api our servers , use free. offer crawler , various other features. check out , see think :)
p.s work import.io if didn't already.
Comments
Post a Comment