Publikace:
ScraperWiki Tutorial

Datum
2012-11-03
Název časopisu
ISSN časopisu
Název svazku
Vydavatel
Výzkumné projekty
Organizační jednotky
Číslo časopisu
Abstrakt

The objective of the workshop, or better hackathon, was to get the data into a structured format, and join it with data from another sources – together with an overview and showing by example what is possible with scraping. Thomas identified targets for web scraping and navigating the complexity of different types of web pages and introduced that in a few half-hour-long and hour-long modules that catered to different audiences.

Popis
Klíčová slova
sbírání dat, čištění dat, strukturované data, BigClean, scraping, data cleaning, structured data, BigClean
Citace