New Project: STELLA
We are happy to annouce that DFG - German Research Foundation has accepted our grant application for the STELLA project (Infrastructure for Living Labs). Together with our partner of GESIS and ZB MED we will build up a new evaluation environment for retrieval and recommender systems. These online evaluation will will differ considerably from classical TREC studies and will enable researchers to use an evaluation method that was previously reserved only for industrial research.
Smart Harvesting II presentation and Hands-on Lab on 107th German Librarian's Day
Mandy Neumann introduced the DFG-funded Smart Harvesting II project to the attendants of the 107th German Librarian’s Day in Berlin. For this purpose she gave a presentation that included an overall view of the project, its objectives, the work packages already completed, and an outlook on future work. In addition Mandy Neumann and Christopher Michels from the University of Trier led a Hands-on Lab, in which the participants were introduced to OXPath on the basis of a concrete example and enabled to design their own expressions for their specific application cases.
Mandy Neumann participated in the JCDL 2018
From 3rd to 6th June Mandy Neumann participated in the Joint Conference on Digital Libraries (JCDL 2018) in Fort Worth, TX, USA. You can find our paper Prioritizing and Scheduling Conferences for Metadata Harvesting in dblp besides all other accepted papers on
2018.jcdl.org now. Also have a look at the presentation that was held.
Paper accepted for JCDL 2018 available at the ArXiv
We are happy to announce that our paper we wrote together with Christopher and Ralf from dblp on “ Prioritizing and Scheduling Conferences for Metadata Harvesting in dblp” was accepted the Joint Conference on Digital Libraries (JCDL 2018). As usual we deposited a preprint of the paper at arXiv.org.
Poster presentation at ID@NRW 2018
Mandy Neumann will present her PhD proposal related to the Smart Harvesting II project at the conference Innovationstag Digitalisierung NRW 2018 at the Rheinische Fachhochschule Köln on 1st of March 2018. The main goal of the ID@NRW 2018 is to provide a forum for PhD students and scientists from the Graduate Institute NRW to discuss their research projects.
New Project: PRIOR - PRepublicatIOn Radar
We are pleased to announce that the Google Digital News Initiative has approved funding for a prototype project we will carry out in collaboration with the SMC Lab. PRIOR, the PRepublicatIOn Radar, will be an integrated tool for science journalists to keep up to date with the latest scientific research, enabling them to detect and filter potentially interesting studies in a diverse set of scientific journals. Find out more about PRIOR on our project page.
New Project: ESUPOL - The Influence for Web Search Engines on Political Opinion Formations
The Ministry of Culture and Science of the German State of North Rhine-Westphalia has approved funding for a state-wide graduate institute on “Digital Societies”. Philipp Schaer (Professor for Information Retrieval at TH Köln, University of Applied Sciences) and Sven-Oliver Proksch (Cologne Center for Comparative Politics) will conduct an interdisciplinary project on the influence of search engines on political opinion formation. The project will collect large amounts of web data from various search engines and analyze them using natural language processing and investigate the effects on opinion formation using laboratory experiments.
Publication in Code4Lib Journal Vol. 38
We got an article published in the Code4Lib Journal (Issue 38): “ Web-Scraping for Non-Programmers: Introducing OXPath for Digital Library Metadata Harvesting”. Thanks to our co-author Jan Steinberg from GESIS! For a full list of publications check our group’s publication list.
New interview in Inside out magazine
In the latest issue of TH Köln’s Inside out magazine (in German) an interview with Prof. Schaer about his work on long tail web search is featured.
We would like to welcome our new colleague Mandy Neumann who joined us last week. She is going to work in the Smart Harvesting II project.
New Project: Smart Harvesting II
Within the Smart Harvesting project we would like to develop a ‘smart’ set of tools and workflows to allow non-programmers to build a rich set of web scrapers to build online bibliographies out of freely available web resources.