SePaMiM Sequential Pattern Mining und Pattern Matching von Krankheits- und Behandlungsverläufen für klinische Krebsregister

Motivation

With the establishment of the clinical state cancer registries according to §65c SGB V and the introduction of the nationwide ADT-GEKID basic data set, data on the treatment and course of cancer are now also collected. Due to their complexity, the evaluation of these data can currently only be implemented with great, disproportionate manual effort without suitable procedures and tools. In the SePaMiM project, we would like to investigate to what extent the analysis of complex treatment and disease courses from registry data can be supported with the help of suitable IT systems.

Goal

SePaMiM aims to support epidemiologists in clinical cancer registries in the analysis of disease and treatment course data by means of suitable IT systems.

Technologies

Two primary approaches exist for analysing sequential data. The first aims to look for sequences in the history data that match a given pattern. The second approach comes from data mining and deals with algorithms for finding interesting or frequently occurring patterns in the sequence data. In the SePaMiM project, both approaches will be pursued.

Firstly, the staff of the clinical cancer registries are to be supported in the selection of patient cohorts on the basis of specific treatment histories. This should enable the staff to answer concrete questions, for example in the context of a quality conference.

Based on this, artificial intelligence methods, especially from the field of data mining, are then used to automatically discover patterns in the course data. In the field of data mining, sequential pattern mining, for example, deals with algorithms for finding interesting or frequently occurring patterns in sequence data.

Persons
Publications
SePaMiM – an online tool for analyzing course-of-disease data in German cancer registries using CQL

Kolja Blohm, David Korfkamp, Christian Lüpkes, Andreas Hein; 68. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e.V. (GMDS); 2023

The Clinical Quality Language as a tool to support data analysis in German clinical cancer registries

Kolja Blohm and David Korfkamp and Joachim Hübner and Florian Oesterling and Stefanie Schulze and Andreas Hein; GMS Medizinische Informatik, Biometrie und Epidemiologie; 08 / 2024

Clustering breast cancer patients based on their course of treatment

Kolja Blohm and David Korfkamp and Christian Lüpkes; Gesundheit – gemeinsam. Kooperationstagung der GMDS, DGSMP, DGEpi, DGMS und der DGPH; 2024

Partners
Landeskrebsregister Nordrhein-Westfalen
www.landeskrebsregister.nrw
Epidemiologisches Krebsregister Niedersachsen, OFFIS CARE GmbH
www.krebsregister-niedersachsen.de

Duration

Start: 01.04.2021
End: 31.03.2024

Source of funding

Related projects

CARLOS

Cancer Registry Lower-Saxony

MUSTANG

Multidimensional Statistical Data Analysis Engine

VersKiK

Versorgung, Versorgungsbedarf und Versorgungsbedürfnisse von Personen nach einer Krebserkrankung im Kindes- oder Jugendalter