[1]王海鹏,杜方,宋丽娟,等.融合单词级段信息的中文医疗命名实体识别[J].计算机技术与发展,2024,34(06):110-117.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0091]
 WANG Hai-peng,DU Fang,SONG Li-juan,et al.Research on Chinese Medical Named Entity Recognition Integrating Word-level Segment Information[J].,2024,34(06):110-117.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0091]
点击复制

融合单词级段信息的中文医疗命名实体识别()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
34
期数:
2024年06期
页码:
110-117
栏目:
人工智能
出版日期:
2024-06-10

文章信息/Info

Title:
Research on Chinese Medical Named Entity Recognition Integrating Word-level Segment Information
文章编号:
1673-629X(2024)06-0110-08
作者:
王海鹏1杜方1宋丽娟1李婷2
1. 宁夏大学 信息工程学院,宁夏 银川 750021;2. 宁夏大学 数学统计学院,宁夏 银川 750021
Author(s):
WANG Hai-peng1DU Fang1SONG Li-juan1LI Ting2
1. School of Information Engineering,Ningxia University,Yinchuan 750021,China;2. School of Mathematics and Statistics,Ningxia University,Yinchuan 750021,China
关键词:
命名实体识别深度神经网络词级信息段级信息中文医疗信息处理
Keywords:
named entity recognition deep neural network word - level information segment - level information Chinese medical information processing
分类号:
TP391
DOI:
10.20165/j.cnki.ISSN1673-629X.2024.0091
摘要:
中文医疗命名实体识别(Named Entity Recognition,NER)是医学领域的一项基础任务,在知识图谱等许多下游任务中起着重要的作用。 常用的 NER 方法可分为基于词级信息和基于段级信息,已有研究表明两种信息融合能取得更好的性能。 目前,词级信息和段级信息融合的方法在中文医疗 NER 任务中还未被充分研究,且现有的融合方法为段中的每个单词赋予相同的权重,不考虑单词的不同贡献。 而医疗实体中每个单词和实体(段)有着不同的相关性,忽略这种相关性的差异将影响医疗 NER 的性能。 基于此,通过分析中文医疗实体特性,提出了一种单词级段信息抽取方法(Word-Level Segment Information Extraction,WL-SIE)。 该方法为实体中的每个单词分配一个权重矩阵集,学习单词与实体之间的关联信息,在与实体词组交互之后输出不同的单词级段信息。 在 CCKS2017 和 CMeEE 中文临床 NER 数据集上的实验结果表明,WL-SIE 方法较对比方法在 F1 值上提升了 3% ~ 5% ,特别是在实体样本不均衡场景下和长实体识别任务上表现出了优异的性能。
Abstract:
Chinese medical named entity recognition (NER) is a fundamental task in the field of medicine and plays an important role in various downstream tasks such as knowledge graphs. NER methods can generally be categorized into two types:word-level information and segment-level information. Some studies have shown that the fusion of the two types of information achieves better performance.However,the integration of word-level and segment-level information has not been thoroughly studied in Chinese medical NER task.Meanwhile,in the existing integration methods,each word in the segment is assigned with equivalent weight,which do not consider the different contribution of the word. Moreover,each word and entity ( segment) in medical entities have different correlations,ignoring these differences in correlations will decrease the performance of medical NER. Based on this, we propose a word - level segment information extraction method called WL-SIE by analyzing the characteristics of Chinese medical entities. This method assigns a weight matrix set to each word in the entity to learn different associative information between words and entities,and outputs word-level segment information after interacting with entity phrases. Experimental results on the CCKS2017 and CMeEE Chinese clinical NER datasets dem-onstrate that the WL-SIE method improves the F1 score by 3% to 5% compared to comparative methods,particularly in scenarios with imbalanced entity samples and long entity recognition tasks,showing outstanding performance.

相似文献/References:

[1]林舒都,邵曦.基于i-vector和深度学习的说话人识别[J].计算机技术与发展,2017,27(06):66.
 LIN Shu-du,SHAO Xi. Speaker Recognition with i-vector and Deep Learning[J].,2017,27(06):66.
[2]邓烜堃,万 良,马彦勤.深度稀疏修正神经网络在股票预测中的应用[J].计算机技术与发展,2018,28(09):199.[doi:10.3969/ j. issn.1673-629X.2018.09.041]
 DENG Xuan-kun,WAN Liang,MA Yan-qin.Application of Deep Sparse Modified Neural Network in Stock Forecasting[J].,2018,28(06):199.[doi:10.3969/ j. issn.1673-629X.2018.09.041]
[3]秦牧轩,荆晓远,吴 飞.基于公共空间嵌入的端到端深度零样本学习[J].计算机技术与发展,2018,28(11):44.[doi:10.3969/ j. issn.1673-629X.2018.11.010]
 QIN Mu-xuan,JING Xiao-yuan,WU Fei.End-to-end Deep Zero-shot Learning Based on Co-space Embedding[J].,2018,28(06):44.[doi:10.3969/ j. issn.1673-629X.2018.11.010]
[4]郎 波,樊一娜.基于深度神经网络的个性化学习行为评价方法[J].计算机技术与发展,2019,29(07):6.[doi:10. 3969 / j. issn. 1673-629X. 2019. 07. 002]
 LANG Bo,FAN Yi-na.Personalized Learning Behavior Evaluation Method Based on Deep Neural Network[J].,2019,29(06):6.[doi:10. 3969 / j. issn. 1673-629X. 2019. 07. 002]
[5]马文斌,夏国恩.基于深度神经网络的客户流失预测模型[J].计算机技术与发展,2019,29(09):76.[doi:10. 3969 / j. issn. 1673-629X. 2019. 09. 015]
 MA Wen-bin,XIA Guo-en.Customer Churn Prediction Model Based on Deep Neural Network[J].,2019,29(06):76.[doi:10. 3969 / j. issn. 1673-629X. 2019. 09. 015]
[6]陈 琛,刘小云,方玉华.融合注意力机制的电子病历命名实体识别[J].计算机技术与发展,2020,30(10):216.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 038]
 CHEN Chen,LIU Xiao-yun,FANG Yu-hua.Named Entity Recognition in Electronic Medical Record Introducing Attention Mechanisms[J].,2020,30(06):216.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 038]
[7]贾欣齐,李 睿,张志成,等.DenseNet-GRU:直肠癌 CT 影像分类的深度神经网络模型[J].计算机技术与发展,2021,31(03):111.[doi:10. 3969 / j. issn. 1673-629X. 2021. 03. 019]
 JIA Xin-qi,LI Rui,ZHANG Zhi-cheng,et al.DenseNet-GRU:A Deep Neural Network Model for CT Image Classification of Rectal Cancer[J].,2021,31(06):111.[doi:10. 3969 / j. issn. 1673-629X. 2021. 03. 019]
[8]王卫红,吕红燕,曹玉辉,等.基于 BERT 的混合神经网络实体识别方法[J].计算机技术与发展,2021,31(08):100.[doi:10. 3969 / j. issn. 1673-629X. 2021. 08. 017]
 WANG Wei-hong,LYU Hong-yan,CAO Yu-hui,et al.A Hybrid Neural Network Entity Recognition Method Based on BERT Model[J].,2021,31(06):100.[doi:10. 3969 / j. issn. 1673-629X. 2021. 08. 017]
[9]利向晴,夏国恩,张显全,等.基于深度神经网络权重集成的客户流失预测[J].计算机技术与发展,2021,31(10):18.[doi:10. 3969 / j. issn. 1673-629X. 2021. 10. 004]
 LI Xiang-qing,XIA Guo-en,ZHANG Xian-quan,et al.Customer Churn Prediction Based on Deep Neural Network Weight Ensemble[J].,2021,31(06):18.[doi:10. 3969 / j. issn. 1673-629X. 2021. 10. 004]
[10]王 俊,王修来*,栾伟先,等.基于 BERT 模型的科研人才领域命名实体识别[J].计算机技术与发展,2021,31(11):21.[doi:10. 3969 / j. issn. 1673-629X. 2021. 11. 004]
 WANG Jun,WANG Xiu-lai*,LUAN Wei-xian,et al.Research on Named Entity Recognition of Scientific Research Talents Field Based on BERT Model[J].,2021,31(06):21.[doi:10. 3969 / j. issn. 1673-629X. 2021. 11. 004]

更新日期/Last Update: 2024-06-10