In their joint effort against cancer, all involved parties within the German healthcare system are obligated to report diagnostics, treatments, progression, and follow-up information for tumor patients to the respective cancer registries. Given the federal structure of Germany, the oncological basis dataset (oBDS) operates as the legally required national standard for oncological reporting. Unfortunately, the usage of various documentation software solutions leads to semantic and technical heterogeneity of the data, complicating the establishment of research networks and collective data analysis. Within this feasibility study, we evaluated the transferability of all oBDS characteristics to the standardized vocabularies, a metadata repository of the observational medical outcomes partnership (OMOP) common data model (CDM). A total of 17,844 oBDS expressions were mapped automatically or manually to standardized concepts of the OMOP CDM. In a second step, we converted real patient data retrieved from the Hamburg Cancer Registry to the new terminologies. Given our pipeline, we transformed 1773.373 cancer-related data elements to the OMOP CDM. The mapping of the oBDS to the standardized vocabularies of the OMOP CDM promotes the semantic interoperability of oncological data in Germany. Moreover, it allows the participation in network studies of the observational health data sciences and informatics under the usage of federated analysis beyond the level of individual countries.
在德国医疗体系内共同抗击癌症的努力中,所有相关方都有义务向相应的癌症登记处报告肿瘤患者的诊断、治疗、进展及随访信息。鉴于德国的联邦制结构,肿瘤学基础数据集作为法定的全国性肿瘤报告标准运作。然而,由于使用不同的记录软件解决方案,数据存在语义和技术上的异质性,这给建立研究网络和开展集体数据分析带来了困难。在本可行性研究中,我们评估了将所有肿瘤学基础数据集特征转换为观察性医疗结果合作组织通用数据模型的标准化词汇表(一个元数据存储库)的可移植性。总共17,844个肿瘤学基础数据集表述通过自动或人工方式映射到通用数据模型的标准化概念中。第二步,我们将从汉堡癌症登记处获取的真实患者数据转换为新术语体系。通过我们的处理流程,成功将1,773.373个癌症相关数据元素转换至通用数据模型。将肿瘤学基础数据集映射到通用数据模型的标准化词汇表,提升了德国肿瘤数据的语义互操作性。此外,这使得在超越单个国家层面的联合分析应用中,能够参与观察性健康数据科学与信息学的网络研究。