Recently Zarino and I were pairing on making improvements to a new scraping tool on ScraperWiki. We were working on some code that allows the person using the tool to pick out parts of some scraped data in order to extract a date into a new database column. For processing the data on the server […]
Scraping PDFs: now 26% less unpleasant with ScraperWiki
Got a PDF you want to get data from? Try our easy web interface over at PDFTables.com! Scraping PDFs is a bit like cleaning drains with your teeth. It’s slow, unpleasant, and you can’t help but feel you’re using the wrong tools for the job. Coders try to avoid scraping PDFs if there’s any other option. But […]
