Meet @henry__morris! He’s the inspirational serial entrepreneur that set up PiC and upReach. They’re amazing businesses that focus on social mobility. We interviewed him for PDFTables.com He’s been using it to convert delegate lists that come as PDF into Excel and then into his Apple iphone. It’s his preferred personal Customer Relationship Management (CRM) system, it’s […]
Data Business Models
If it sometimes feels like the data business is full of buzzwords and hipster technical jargon, then that’s probably because it is. But don’t panic! I’ve been at loads of hip and non-hip data talks here and there and, buzzwords aside, I’ve come across four actual categories of data business model in this hip data […]
Scraping guides: Excel spreadsheets
Following on from the CSV scraping guide, we’ve now added one about scraping Excel spreadsheets. You can get to them from the documentation page. The Excel scraping guide is available in Ruby, Python and PHP. Just as with all documentation, you can choose which at the top right of the page. As with CSV files, at first […]
Ruby screen scraping tutorials
Mark Chapman has been busy translating our Python web scraping tutorials into Ruby. They now cover three tutorials on how to write basic screen scrapers, plus extra ones on using .ASPX pages, Excel files and CSV files. We’ve also installed some extra Ruby modules – spreadsheet and FastCSV – to make them possible. These Ruby scraping […]