Publikace: ScraperWiki Tutorial
Soubory
Datum
2012-11-03
Autoři
Název časopisu
ISSN časopisu
Název svazku
Vydavatel
Abstrakt
The objective of the workshop, or better hackathon, was to get the data into a structured format, and join it with data from another sources – together with an overview and showing by example what is possible with scraping. Thomas identified targets for web scraping and navigating the complexity of different types of web pages and introduced that in a few half-hour-long and hour-long modules that catered to different audiences.
Popis
Klíčová slova
sbírání dat, čištění dat, strukturované data, BigClean, scraping, data cleaning, structured data, BigClean