TY - JOUR PY - 2023// TI - Exploring named entity recognition and relation extraction for ontology and medical records integration JO - Informatics in medicine unlocked A1 - da Silva, Diego Pinheiro A1 - da Rosa Fröhlich, William A1 - de Mello, Blanda Helena A1 - Vieira, Renata A1 - Rigo, Sandro José SP - e101381 EP - e101381 VL - 43 IS - N2 - The available natural language data in electronic health records is of noteworthy interest to health research and development. Nevertheless, their manual analysis is not feasible and poses a challenge to accessing valuable information in these records. This paper presents an approach to automatically extract information from these unstructured medical records using Domain Entity Recognition and Relation Extraction, structuring the results through a domain ontology. We developed our work in the oncology domain, an attention-demanding field. The main contribution of this work lies in integrating multiple resources in a complete methodology to accomplish this task. We developed a new entity and relation annotated dataset of medical evolutions in Brazilian Portuguese, containing 1622 documents, 146,769 entities, and 111,716 relations. We attained 78.24 % accuracy for entity and relation extraction in the exams domain. Healthcare specialists evaluated the approach regarding entity recognition and relation extraction positively and considered the methodology valuable to health professionals.
Language: en
LA - en SN - 2352-9148 UR - http://dx.doi.org/10.1016/j.imu.2023.101381 ID - ref1 ER -