Meet @henry__morris! He’s the inspirational serial entrepreneur that set up PiC and upReach. They’re amazing businesses that focus on social mobility. We interviewed him for PDFTables.com He’s been using it to convert delegate lists that come as PDF into Excel and then into his Apple iphone. It’s his preferred personal Customer Relationship Management (CRM) system, it’s […]
Which car should I (not) buy? Find out, with the ScraperWiki MOT website…
I am finishing up my MSc Data Science placement at ScraperWiki and, by extension, my MSc Data Science (Computing Specialism) programme at Lancaster University. My project was to build a website to enable users to investigate the MOT data. This week the result of that work, the ScraperWiki MOT website, went live. The aim of […]
Over a billion public PDFs
You can get a guesstimate for the number of PDFs in the world by searching for filetype:pdf on a web search engine. These are the results I got in August 2015 – follow the links to see for yourself. Google Bing Number of PDFs 1.8 billion 84 million Number of Excel files 14 million 6 […]
The Royal Statistical Society Conference–Exeter 2015
ScraperWiki have been off to the Royal Statistical Society Conference in Exeter to discuss our wares with the delegates. The conference was very friendly with senior RSS staff coming to see how we were doing through the week. We shared the exhibitor space in the fine atrium of The Forum at Exeter University with Wiley, […]
Civil Service People Survey – Faster, Better, Cheaper
The Civil Service is one of the UK’s largest employers. Every year it asks every civil servant what it thinks of its employer: UK plc. For Sir Jeremy Heywood the survey matters. In his blog post “Why is the People Survey Important?” he says “The survey is one of the few ways we can objectively […]
Horizon 2020–Project TIMON
ScraperWiki are members of a new EU Horizon 2020 project: TIMON “Enhanced real time services for optimized multimodal mobility relying on cooperative networks and open data”. This is a 3.5 year project, that commenced in June 2015, whose objectives are: to improve road safety; to provide greater transport flexibility in terms of journey planning across multiple modes […]
Number of prescriptions by location
There are 211 clinical commissioning groups (CCG’s) across England dispensing a range of medications every day. These CCG’s have demographic factors that could affect how much medication is dispensed. Therefore I thought it would interesting to compare the number of items dispensed in CCGs across England for a number of different medications, using the Clinical […]
Branded and Generic medication compared
According to the Office of Health Economics for the Association of the British Pharmaceutical Industry (ABPI), the total medicines bill in the UK was £13.6 billion in 2011 and £10.8 billion of this was spent on branded medication. Prescribers such as GPs are encouraged to prescribe generic medicine instead of its branded version. This is […]
We’re hiring! Technical Architect
We’ve lots of interesting projects on – with clients like the United Nations and the Cabinet Office, and with our own work building products such as PDFTables.com. Currently we’re after a Technical Architect, full details on our jobs page. We’re a small company, so roles depend on individual people. Get in touch if something doesn’t […]
Case study: Enrique Cocero getting political data from PDFs
Political strategy is international now. Enrique Cocero works from Madrid for his consultancy 7-50 Electoral Math, using data to understand voters and candidates in election campaigns across the world. He’s struggled with PDFs for a long time, and recently found PDF Tables via a Google search. He says: I used to have nightmares – I’m […]