I like to scrape, I run the site edms.org.uk (which doesn't use scraperwiki). I can be found on twitter @mazadillon
Scrapers and views (15)
-
Type scraper Language python Status Public Matt / OCC Chart Archive
32 lines of code. 108,969 rows of data.
Created 9 months ago.
-
Type view Language python Status Protected Matt / Yelvertoft RSS
25 lines of code.
Created 1 year, 1 month ago.
-
Type scraper Language python Status Public Matt / Yelvertoft Scraper
13 lines of code. 21 rows of data.
Created 1 year, 1 month ago.
-
Type scraper Language python Status Public Matt / Import HOL File
11 lines of code. No rows of data yet.
Created 1 year, 2 months ago.
-
Type scraper Language python Status Protected Matt / mg Councillor Scraper
30 lines of code. 133 rows of data.
Created 1 year, 3 months ago.
Scrapes councillor information from council websites that run the moderngov system.
-
Type scraper Language python Status Protected Matt / Welsh Assembly Members
23 lines of code. 60 rows of data.
Created 1 year, 4 months ago.
Scrape a list of Welsh Assembly members from the Welsh Assembly website.
-
Type scraper Language python Status Public Matt / Email Alert Scraper
3 lines of code. No rows of data yet.
Created 2 years, 2 months ago.
-
Type view Language python Status Public Matt / Irish President's Engagements
17 lines of code.
Created 2 years, 5 months ago.
View what the Irish President has been up to, when and where.
-
Type view Language python Status Public Matt / NMR AI Codes
33 lines of code.
Created 2 years, 5 months ago.
Search for NMR bull AI codes
-
Type view Language python Status Public Matt / OCC Chart CSV
32 lines of code.
Created 2 years, 5 months ago.
Displays the current chart data, use the query string chart=singles or chart=albums to choose which chart to load.
-
Type scraper Language python Status Public Matt / OCC Charts
25 lines of code. 24,800 rows of data.
Created 2 years, 5 months ago.
Scrapes the official albums and singles charts from the official charts company.
-
Type scraper Language python Status Public Matt / NMR AI Codes
70 lines of code. 7,589 rows of data.
Created 2 years, 5 months ago.
This scrapes the list of AI codes from the National Milk Records website at http://www.nmr.co.uk/ai-codes it downloads each PDF file and converts it by running pdf2txt.py locally, then it scans for the information contained.
-
Type view Language php Status Public Matt / UCB Airplay Charts
24 lines of code.
Created 2 years, 6 months ago.
-
Type scraper Language php Status Public Matt / UCB Playlists
30 lines of code. 326,081 rows of data.
Created 2 years, 7 months ago.
This scrapes the "played yesterday" playlists for the three radio stations produced by United Christian Broadcasters (UCB). It lists the date and time each song was played, along with the station, artist and song title.
-
Type scraper Language php Status Public Matt / Irish President Engagements
131 lines of code. 6,075 rows of data.
Created 2 years, 8 months ago.
This collects data on the Irish President's engagements, including historical data going back to 1997. All of the information is scraped from the President's website