Open University academic, pragmatic data junkie and wannabe web technology appropriation artist.
Scrapers and views (150)
-
Type scraper Language python Status Protected Tony Hirst / All Party Groups (backup...)
148 lines of code. No rows of data yet.
Created 9 months, 3 weeks ago.
-
Type scraper Language python Status Public Tony Hirst / PDF Scraper Intro
126 lines of code. 476 rows of data.
Created 9 months, 4 weeks ago.
-
Type view Language python Status Public Tony Hirst / OpenCorporates Trawler Timeline Test
43 lines of code.
Created 10 months ago.
-
Type scraper Language python Status Public Tony Hirst / IW Council Contracts
99 lines of code. 230 rows of data.
Created 10 months ago.
Scrape of the Isle of Wight Council Contracts pages:http://www.iwight.com/contracts/ Still to do – ponder policy for weekly scrape – trawl the whole database each time?
-
Type scraper Language python Status Public Tony Hirst / IW CC
61 lines of code. 787 rows of data.
Created 10 months, 1 week ago.
-
Type scraper Language python Status Public Tony Hirst / BGMEA Factories
198 lines of code. 7,692 rows of data.
Created 10 months, 1 week ago.
Scraper of BGMEA (Bangladesh Garment Manufacturers and Exporters Association) members list http://www.bgmea.com.bd/member/memberlist/ For more information see: http://schoolofdata.org/2013/05/18/data-expedition-mapping-the-garment-factories/
-
Type view Language html Status Public Tony Hirst / Google Spreadsheet Query
192 lines of code.
Created 10 months, 2 weeks ago.
-
Type scraper Language python Status Public Tony Hirst / IW poll notices scrape
125 lines of code. 4,237 rows of data.
Created 10 months, 3 weeks ago.
Scrape Isle of WIght notices of election for 2013 local election.
-
Type scraper Language python Status Public Tony Hirst / Opencharities charity comparison test
26 lines of code. 170 rows of data.
Created 10 months, 4 weeks ago.
-
Type view Language python Status Public Tony Hirst / KML merge test
41 lines of code.
Created 11 months ago.
QUick hack at trying to generate a single KML file containing boundary data for wards within a council area using data from MySociety/Mapit. Usage: grab ID from local council MapIt page (eg 65791 from http://mapit.mysociety.org/area/65791.html) then call: https://views.scraperwiki.com/run/kml_merge_test/?key=65791 (I guess there's a risk of banging up against MapIt API throttle/usage limit?)
-
Type scraper Language python Status Public Tony Hirst / purehelp.no helper
143 lines of code. 605 rows of data.
Created 11 months, 1 week ago.
-
Type scraper Language python Status Public Tony Hirst / Demo PDF parser
53 lines of code. 75 rows of data.
Created 11 months, 2 weeks ago.
-
Type view Language python Status Public Tony Hirst / OpenCorporates Director Timeline
103 lines of code.
Created 11 months, 3 weeks ago.
-
Type scraper Language python Status Protected Tony Hirst / OpenCorporates Director History
81 lines of code. 96 rows of data.
Created 11 months, 3 weeks ago.
Has 1 secret query-string environment variable.
-
Type scraper Language python Status Protected Tony Hirst / eduCrunchbase
76 lines of code. 524 rows of data.
Created 11 months, 3 weeks ago.
Has 1 secret query-string environment variable.
-
Type scraper Language python Status Public Tony Hirst / edu Crunchbase POINTLESS scraper
49 lines of code. No rows of data yet.
Created 11 months, 3 weeks ago.
-
Type scraper Language python Status Protected Tony Hirst / Opencorporates Trademark Trawler
76 lines of code. 59 rows of data.
Created 1 year ago.
Has 2 secret query-string environment variables.
-
Type view Language python Status Public Tony Hirst / OpenCorporates Trawler gexf
94 lines of code.
Created 1 year, 1 month ago.
-
Type view Language python Status Public Tony Hirst / EU Horse imports Sankey Diagram
336 lines of code.
Created 1 year, 1 month ago.
-
Type scraper Language python Status Protected Tony Hirst / opencorporates trawler
299 lines of code. 73,603 rows of data.
Created 1 year, 1 month ago.
Has 2 secret query-string environment variables.