The latest news from The Hyve on Open Source solutions for bioinformatics

Recent Posts

OMOP data curation: Automatic Mapping of Drug Concepts to RxNorm

February 02, 2017 | By Maxim Moinat

An important part of ETL to the OMOP CDM is mapping the source concepts to the standard concepts in the OMOP vocabulary. This can be as straight forward as mapping ICD-10 codes to SNOMED or as difficult as mapping a national drug vocabulary to RxNorm.

Drug mapping is challenging because, in contrast to other mappings, it has multiple components that have to be mapped. These components are ingredient, dose form and strength.

The goal of this EMIF project was to automate the drug mapping procedure for a Danish dataset of drugs. The dataset contained the drug name, the ATC code, the dose form (in Danish) and a numeric strength as well as a unit. An example is given below:

vnr Name ATC Dose form Strength strnum strunit
148 Loratadin "HEXAL" R06AX13 tabletter 10 mg 10 MG


The poster below, as presented during the OHDSI symposium 2016, describes the mapping procedure and the results. We managed to map 91% of the drugs to a drug in the RxNorm vocabulary.

Learn more about our OHDSI and OMOP services or contact us to see how we can help you leverage real world health data!

Mapping of drug concepts into the OMOP datamodel

Click here to read the full size PDF of the poster.