Projects

PIXLS - Preprint Information eXtraction for Life Sciences

There is a large number of different preprint servers used by the research community, which differ both technically and in terms of content. In the PIXLS project we would like to systematically unlock the previously neglected information source preprint servers, make them more accessible through value-added services and ensure the reusability of the metadata and full texts obtained.

Duration

2023 - 2025

Partners

ZB MED - Information Centre for Life Sciences

Funding Organization

DFG - Deutsche Forschungsgemeinschaft

RESIRE - Reproducibility and simulation of interactive retrieval experiments

The main idea of the RESIRE project is to describe actual users of IR systems by quantitative modeling of their behavior and their content-related decisions during retrieval sessions. These model parameters can then form the basis for comparing two interactive IR experiments with regard to reproducibility, and can also be used for simulating interactive IR, e. g. for evaluating new systems or system variants without (or before) carrying out actual user experiments.

Duration

2023 - 2026

Partners

University of Duisburg-Essen

Funding Organization

DFG - Deutsche Forschungsgemeinschaft

STELLA II - Infrastructures for Living Labs

The STELLA project provides an innovative technology and methodology infrastructure that allows information providers to evaluate their information systems with the actual users of their web platforms. By incorporating the Living Lab principle, the systems can be evaluated iteratively and continuously, thus taking a big step towards a "Continues Evaluation".

Duration

2023 - 2025

Partners

GESIS - Leibniz Institute for the Social Sciences
ZB MED - Information Centre for Life Sciences

Funding Organization

DFG - Deutsche Forschungsgemeinschaft

Dissertations in Information Science - Analysis of a heterogeneous discipline

Information science is characterized as a very heterogeneous and multidisciplinary scientific discipline. In this project, various methods will be used to extract and created a corpus thath will contain only dissertations that are dissertations relevant to information science.

Duration

2022-09 - 2023-02

Funding Organization

German National Library - DH Fellowship

JoIE - Journalistic Information Extraction

The project Journalistic Information Extraction (JoIE) aims to address the problem of information extraction from unstructured sources, that are relevant for (data) journalism. Based on the two state-of-the-art tools Workbench and Fonduer, a solution will be developed that can handle the different web data sources and makes them usable for journalism by putting them into a structured form.

Duration

2020 - 2023

Partners

Science Media Center

Funding Organization

Klaus Tschira Stiftung

STELLA - Infrastructures for Living Labs

The STELLA project aims to create an evaluation infrastructure that allows to evaluate search and recommendation services within productive web-based search systems with real users. STELLA provides an integrated e-Research environment that allows researchers in the field of information retrieval and recommendation services to conduct studies with real users in real environments. The experimental set-ups differ considerably from classical TREC studies, which can only be carried out offline, or also from user studies, which only allow laboratory experiments, and thus enable researchers to use an evaluation method that was previously reserved only for industrial research or the operators of large online platforms.

Duration

2018 - 2022

Partners

GESIS - Leibniz Institute for the Social Sciences
ZB MED - Information Centre for Life Sciences

Funding Organization

DFG - Deutsche Forschungsgemeinschaft

ESUPOL - Einfluss von Suchmaschinen auf die politische Meinungsbildung

In this project, the question of how search engines can influence political opinion-forming and political issues is to be addressed, and what influence factors such as 'filter bubbles', collaborative filtering and the lack of users' search or media competence have on these processes.

Duration

2018 - 2022

Partners

University of Cologne

Funding Organization

Ministerium für Kultur und Wissenschaft NRW

PRIOR - PRepublicatIOn Radar

PRIOR, the PRepublicatIOn Radar, will be an integrated tool for science journalists to keep up with the latest scientific research in important domains of knowledge. It will enable them to detect and filter potentially interesting studies in a diverse set of scientific journals. The challenge is to deal with unstructured and heterogeneous incoming information types. PRIOR will extract, harmonize and process new embargoed research publications to allow searching, browsing and filtering. The prototype will work with two modules: a data extraction and harmonization framework as well as a web-based user interface to find new and filter relevant scientific publications.

Duration

2017-03 - 2018-03

Partners

Science Media Center

Funding Organization

Google Digital News Initiative

Smart Harvesting II

Within the Smart Harvesting project we would like to develop a 'smart' set of tools and workflows to allow non-programmers to build a rich set of web scrapers to build online bibliographies out of freely available web resources.

Duration

2016 - 2019

Partners

dblp - Computer Science Bibliography @ University of Trier
GESIS - Leibniz Institute for the Social Sciences

Funding Organization

DFG - Deutsche Forschungsgemeinschaft

Information Retrieval Research Group

IR Research Group

Technische Hochschule Köln

PIXLS - Preprint Information eXtraction for Life Sciences

RESIRE - Reproducibility and simulation of interactive retrieval experiments

STELLA II - Infrastructures for Living Labs

Dissertations in Information Science - Analysis of a heterogeneous discipline

JoIE - Journalistic Information Extraction

STELLA - Infrastructures for Living Labs

ESUPOL - Einfluss von Suchmaschinen auf die politische Meinungsbildung

PRIOR - PRepublicatIOn Radar

Smart Harvesting II