Scrapers and views (15)
-
Type scraper Language python Status Public Ben Harris / Clinical commissioning groups
35 lines of code. No rows of data yet.
Created 1 year, 1 month ago.
-
Type scraper Language python Status Public Ben Harris / Local Healthwatch organisations
28 lines of code. No rows of data yet.
Created 1 year, 1 month ago.
Another scraper collecting details of local Healthwatch organisations. This is complementary to the other one and collects only email and website URLs (which requires scraping individual organisation pages). It should be possible to join the two scrapers' tables on "name".
-
Type view Language python Status Protected Ben Harris / Routeing Guide map
9 lines of code.
Created 1 year, 11 months ago.
This simple view just serves up a single map in PNG format. It is protected because ScraperWiki doesn't support rolling back changes to views.
-
Type view Language python Status Protected Ben Harris / Routeing Guide browser
155 lines of code.
Created 1 year, 11 months ago.
This view is protected because ScraperWiki doesn't support rolling back changes to views .
-
Type scraper Language python Status Public Ben Harris / ATOC Routeing Guide
193 lines of code. 7,558 rows of data.
Created 1 year, 11 months ago.
This scraper converts the ATOC National Routeing (sic) Guide into a potentially-useful database. The tables groups and routing_points contain information from the “pink pages” (section B), namely which stations are members of which groups, and which routeing points are associated with each station. The permitted_routes table is based on the “yellow pages” (section C), and contains, for each pair of routeing points, records describing each of the permitted lists of maps between them, or the ...
-
Type scraper Language python Status Public Ben Harris / WGA: designated NI bodies
17 lines of code. 116 rows of data.
Created 2 years, 2 months ago.
The Whole of Government Accounts cover a wide variety of public bodies. This scraper gets the current list of Northern Ireland bodies, which at present is in the Whole of Government Accounts (Designation of Bodies) Order (Northern Ireland) 2012. Happily, the order (like all current UK legislation) is available in a somewhat-semantic XML format, which makes extracting information easy.
-
Type scraper Language python Status Public Ben Harris / Scottish public authorities
20 lines of code. 607 rows of data.
Created 2 years, 9 months ago.
The Scottish Information Commissioner provides a convenient list of Schedule 1 Scottish public authorities. This simple scraper imports that list into ScraperWiki.
-
Type scraper Language python Status Public Ben Harris / NI Public Bodies
30 lines of code. 89 rows of data.
Created 2 years, 9 months ago.
This scraper rips the names of public bodies out of the Public Bodies & Public Appointments Annual Report 2010/2011. It is the source of a corresponding page in FOIwiki .
-
Type view Language python Status Protected Ben Harris / FOIwiki ScraperSync
254 lines of code.
Created 2 years, 10 months ago.
This is a slightly generalised mechanism for propagating changes from a scraped dataset listing public authorities into a corresponding page in FOIwiki. Some syncs that currently work: NHS foundation trusts Whole of Government Accounts It's still pretty wobbly — check any changes it proposes with "Show changes" before saving. More thorough documentation is on FOIwiki. This view is protected because ScaperWiki doesn't support rolling back changes to views.
-
Type scraper Language python Status Public Ben Harris / ICO Decision Notice bodies
8 lines of code. 1,312 rows of data.
Created 2 years, 10 months ago.
This is a trivial little scraper to rip the list of public authorities from the ICO's list of decision notices. For more comprehensive scrapers of the same dataset, try the ico tag.
-
Type scraper Language python Status Public Ben Harris / WhatDoTheyKnow bodies
31 lines of code. 19,750 rows of data.
Created 2 years, 10 months ago.
WhatDoTheyKnow provides a nice list of public bodies in CSV format. While this is lovely, having the list in a ScraperWiki database is even better.
-
Type scraper Language python Status Public Ben Harris / WGA: designated bodies
24 lines of code. 2,424 rows of data.
Created 2 years, 10 months ago.
The Whole of Government Accounts cover a wide variety of public bodies. This scraper gets the current list, which at present is in the Whole of Government Accounts (Designation of Bodies) Order 2012. Happily, the order (like all current UK legislation) is available in a somewhat-semantic XML format, which makes extracting information easy.
-
Type scraper Language python Status Public Ben Harris / South Cambs parishes
32 lines of code. 86 rows of data.
Created 2 years, 10 months ago.
This scraper tries to extract useful information about parishes in South Cambridgeshire from the district council's Web site. Currently it just gets the name of each parish and the email address of the parish clerk.
-
Type scraper Language python Status Public Ben Harris / Names of NHS Foundation Trusts
15 lines of code. No rows of data yet.
Created 2 years, 10 months ago.
This scraper just gets the names of all the NHS Foundation Trusts from the site of the NHSFT regulator, Monitor.
-
Type scraper Language python Status Public Dave Hughes / Cambridge Botanical Gardens Daily Temperatures
33 lines of code. 3,917 rows of data.
Created 3 years, 4 months ago.
University of Cambridge Botanical Gardens temperature data (2000-2010) This scraper provides daily minimum, maximum, and mean temperatures from the weather monitoring station at the botanical gardens of the University of Cambridge for the years 2000 (from January) to 2010 (up to September). The source is an Excel spreadsheet provided in response to a Freedom of Information (FOI) request. As such the data is static and not expected to change.