Comments on: The Tyranny of the PDF https://blog.scraperwiki.com/2013/12/the-tyranny-of-the-pdf/ Extract tables from PDFs and scrape the web Thu, 14 Jul 2016 16:12:42 +0000 hourly 1 https://wordpress.org/?v=4.6 By: The Tyranny of the PDF | Frontiers of Journalis... https://blog.scraperwiki.com/2013/12/the-tyranny-of-the-pdf/#comment-1035 Fri, 27 Dec 2013 22:06:44 +0000 https://blog.scraperwiki.com/?p=758220272#comment-1035 […] Why is ScraperWiki so interested in PDF files? Because the world is full of PDF files. The treemap above shows the scale of their dominance. In the treemap the area a segment covers is proportional to the number of examples we found.  […]

]]>