New Project: PRIOR - PRepublicatIOn Radar
We are pleased to announce that the Google Digital News Initiative has approved funding for a prototype project we will carry out in collaboration with the SMC Lab. PRIOR, the PRepublicatIOn Radar, will be an integrated tool for science journalists to keep up to date with the latest scientific research, enabling them to detect and filter potentially interesting studies in a diverse set of scientific journals. Find out more about PRIOR on our project page.
New Project: ESUPOL - The Influence for Web Search Engines on Political Opinion Formations
The Ministry of Culture and Science of the German State of North Rhine-Westphalia has approved funding for a state-wide graduate institute on “Digital Societies”. Philipp Schaer (Professor for Information Retrieval at TH Köln, University of Applied Sciences) and Sven-Oliver Proksch (Cologne Center for Comparative Politics) will conduct an interdisciplinary project on the influence of search engines on political opinion formation. The project will collect large amounts of web data from various search engines and analyze them using natural language processing and investigate the effects on opinion formation using laboratory experiments.
Publication in Code4Lib Journal Vol. 38
We got an article published in the Code4Lib Journal (Issue 38): “ Web-Scraping for Non-Programmers: Introducing OXPath for Digital Library Metadata Harvesting”. Thanks to our co-author Jan Steinberg from GESIS! For a full list of publications check our group’s publication list.
New interview in Inside out magazine
In the latest issue of TH Köln’s Inside out magazine (in German) an interview with Prof. Schaer about his work on long tail web search is featured.
We would like to welcome our new colleague Mandy Neumann who joined us last week. She is going to work in the Smart Harvesting II project.
New Project: Smart Harvesting II
Within the Smart Harvesting project we would like to develop a ‘smart’ set of tools and workflows to allow non-programmers to build a rich set of web scrapers to build online bibliographies out of freely available web resources.