University of Limerick
Browse

DCM-CNER: A dual-channel model for clinical named entity recognition based on embedded ConvNet and gated dilated CNN

Download (1.97 MB)
journal contribution
posted on 2024-08-14, 13:27 authored by Lin Shi, Wenyan Zhou, Yafeng Wu, Na Yuan, Xiangzhen Zang, Zhanlin Ji, Ivan GanchevIvan Ganchev

As the volume of Chinese electronic medical records (EMRs) experiences an explosive growth, the application of clinical named entity recognition (CNER) technology becomes crucial for the effective utilization of EMR data and practical implementation of evidence-based medicine. While mainstream models excel in capturing global contextual information, their feature extraction mechanisms tend to be unidimensional, limiting their information retrieval capabilities. To address this issue, this paper proposes a Dual-Channel Model for CNER (DCM-CNER), which enhances feature extraction through the introduction of a customized convolutional stack, denoted as emConvNet (embedded ConvNet), for local feature extraction. The model further addresses the vanishing gradient issue by employing a newly designed Gated Dilated Convolutional Neural Network (GDCNN) module with a residual structure. Sequential and contextual information is extracted from the text using a Bidirectional Long Short-Term Memory (BiLSTM) layer. Additionally, a multi-head bilinear attention mechanism is introduced for parallel dynamic feature fusion. Experimental results demonstrate the superiority of the proposed DCM-CNER model, in comparison to the existing mainstream models and state-of-the-art models, achieving F1 scores of 94.15%, 85.26%, and 84.21% on the CCKS2017, CCKS2019, and CLUENER2020 datasets, respectively, thereby validating its effectiveness in performing the task of Chinese CNER in EMRs.


History

Publication

IEEE Access, vol. 12, pp. 97726-97738

Publisher

Institute of Electrical and Electronics Engineers

Other Funding information

This work was supported in part by the National Key Research and Development Program of China under Grant 2017YFE0135700; in part by Bulgarian National Science Fund (BNSF) under Grant ; and in part by the Telecommunications Research Centre (TRC) of the University of Limerick, Ireland

Department or School

  • Electronic & Computer Engineering

Usage metrics

    University of Limerick

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC