Publication in Code4Lib Journal Vol. 38
We got an article published in the Code4Lib Journal (Issue 38): “ Web-Scraping for Non-Programmers: Introducing OXPath for Digital Library Metadata Harvesting”. Thanks to our co-author Jan Steinberg from GESIS! For a full list of publications check our group’s publication list.
New interview in Inside out magazine
In the latest issue of TH Köln’s Inside out magazine (in German) an interview with Prof. Schaer about his work on long tail web search is featured.
We would like to welcome our new colleague Mandy Neumann who joined us last week. She is going to work in the Smart Harvesting II project.
New Project: Smart Harvesting II
Within the Smart Harvesting project we would like to develop a ‘smart’ set of tools and workflows to allow non-programmers to build a rich set of web scrapers to build online bibliographies out of freely available web resources.