[1]王卫红,吕红燕,曹玉辉,等.基于 BERT 的混合神经网络实体识别方法[J].计算机技术与发展,2021,31(08):100-105.[doi:10. 3969 / j. issn. 1673-629X. 2021. 08. 017]
 WANG Wei-hong,LYU Hong-yan,CAO Yu-hui,et al.A Hybrid Neural Network Entity Recognition Method Based on BERT Model[J].,2021,31(08):100-105.[doi:10. 3969 / j. issn. 1673-629X. 2021. 08. 017]
点击复制

基于 BERT 的混合神经网络实体识别方法()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
31
期数:
2021年08期
页码:
100-105
栏目:
网络与安全
出版日期:
2021-08-10

文章信息/Info

Title:
A Hybrid Neural Network Entity Recognition Method Based on BERT Model
文章编号:
1673-629X(2021)08-0100-06
作者:
王卫红吕红燕曹玉辉霍 峥
河北经贸大学 信息技术学院,河北 石家庄 050061
Author(s):
WANG Wei-hongLYU Hong-yanCAO Yu-huiHUO Zheng
School of Information Technology,Hebei University of Economics and Business,Shijiazhuang 050061,China
关键词:
命名实体识别BERT 模型卷积神经网络双向长短期记忆网络条件随机场
Keywords:
named entity recognitionBERT modelconvolutional neural networkbi-directional long short-term memoryconditionalrandom field
分类号:
TP391
DOI:
10. 3969 / j. issn. 1673-629X. 2021. 08. 017
摘要:
针对命名实体识别方法中语义分析不足及准确率较低的问题, 提出一种基于 BERT 模型的混合神经网络实体识别方法。 对命名实体识别研究现状进行了调查与分析, 发现? 现有命名实体识别研究中存在数据分析与特征提取不充分导致准确率较低的问题。 利用 BERT 预训练语言模型动态生成字的语义向量,丰富其文本特征。使用卷积神经网 络(convolutional neural network,CNN)模型再次抽取语义特征,实现语义的自动抽取,二者联合作为下一步的输入向量。 采用引入注意力机制的双向长短时记忆网络(bi-directional long short-term memory,BiLSTM)获取单个字在字符级别上前后两个方向上的信息。 通过条件随机场(conditional random field,CRF)模型解码序列标签,得到全局最优标注序列。 在《人民日报》和 MSRA 两个数据集上的实验结果表明,该方法相比于其他模型,能有效地获取语义信息,在准确率、召回率和 F1值上均有所提升。
Abstract:
Aiming at the problem of insufficient semantic analysis and low accuracy in named entity recognition method, a hybrid neural network entity recognition method based? ?on BERT model is proposed. The research status of named entity recognition was investigated and an alyzed,and it was found that the problem of low accuracy resulted from insufficient data analysis and feature extraction existed in the research of named entity recognition. The semantic vector of the word is generated dynamically by using BERT pre-training language model to enrich its text features. The semantic features are extracted again using the convolutional neural network (CNN) model to realize the automatic semantic extraction,and the two are combined as the next step of the input vector. BiLSTM is used to obtain the information of a single word in two directions before and after the character level. The conditional random field (CRF) model was used to decode the sequence tags and obtain the global optimal labeling sequence. Experiments on two data sets of People’s Daily and MSRA show that compared with other models, the proposed method can effectively obtain semantic information, and it is improved in accuracy, recall rate and F1 value.

相似文献/References:

[1]陈 琛,刘小云,方玉华.融合注意力机制的电子病历命名实体识别[J].计算机技术与发展,2020,30(10):216.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 038]
 CHEN Chen,LIU Xiao-yun,FANG Yu-hua.Named Entity Recognition in Electronic Medical Record Introducing Attention Mechanisms[J].,2020,30(08):216.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 038]
[2]周亦敏,黄 俊.基于 BERT 的学术合作者推荐研究[J].计算机技术与发展,2021,31(03):45.[doi:10. 3969 / j. issn. 1673-629X. 2021. 03. 008]
 ZHOU Yi-min,HUANG Jun.Research on BERT-based Academic Collaborator Recommendation[J].,2021,31(08):45.[doi:10. 3969 / j. issn. 1673-629X. 2021. 03. 008]
[3]潘理虎,赵彭彭,龚大立,等.煤矿事故案例命名实体识别方法研究[J].计算机技术与发展,2022,32(02):154.[doi:10. 3969 / j. issn. 1673-629X. 2022. 02. 025]
 PAN Li-hu,ZHAO Peng-peng,GONG Da-li,et al.Combined ALBERT for Named Entity Recognition in Coal Mine Accident Cases[J].,2022,32(08):154.[doi:10. 3969 / j. issn. 1673-629X. 2022. 02. 025]
[4]尚福华,蒋毅文,曹茂俊.一种增强的多粒度特征融合语义匹配模型[J].计算机技术与发展,2022,32(07):28.[doi:10. 3969 / j. issn. 1673-629X. 2022. 07. 005]
 SHANG Fu-hua,JIANG Yi-wen*,CAO Mao-jun.An Enhanced Multi Granularity Feature Fusion Model for Semantic Matching[J].,2022,32(08):28.[doi:10. 3969 / j. issn. 1673-629X. 2022. 07. 005]
[5]范禹辰,刘相坤,朱建生,等.基于 BERT 的服务网站 Web 攻击检测研究[J].计算机技术与发展,2022,32(08):168.[doi:10. 3969 / j. issn. 1673-629X. 2022. 08. 027]
 FAN Yu-chen,LIU Xiang-kun,ZHU Jian-sheng,et al.Research on Web Attack Detection of Service Website Based on BERT[J].,2022,32(08):168.[doi:10. 3969 / j. issn. 1673-629X. 2022. 08. 027]
[6]刘华玲,孙 毅.基于实体识别和信息融合的知识图谱研究[J].计算机技术与发展,2022,32(09):107.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 017]
 LIU Hua-ling,SUN Yi.Knowledge Graph Based on Entity Recognition and Information Fusion--A Case Study of COVID-19[J].,2022,32(08):107.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 017]
[7]孙安亮,时宏伟,王金策.基于字符与单词嵌入的航空安全命名实体识别[J].计算机技术与发展,2022,32(09):148.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 023]
 SUN An-liang,SHI Hong-wei,WANG Jin-ce.Named Entity Recognition Based on Character and Word Embedding in Aviation Safety[J].,2022,32(08):148.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 023]
[8]杜睿山,陈思路,刘文豪.基于岩石文本信息的命名实体识别[J].计算机技术与发展,2022,32(09):188.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 029]
 DU Rui-shan,CHEN Si-lu,LIU Wen-hao.Named Entity Recognition Based on Rock Text Information[J].,2022,32(08):188.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 029]
[9]胡慧婷,李建平,董振荣,等.基于 BERT 模型的教育技术学领域实体抽取[J].计算机技术与发展,2022,32(10):164.[doi:10. 3969 / j. issn. 1673-629X. 2022. 10. 027]
 HU Hui-ting,LI Jian-ping,DONG Zhen-rong,et al.Named Entity Recognition Method in Educational Technology Field Based on BERT[J].,2022,32(08):164.[doi:10. 3969 / j. issn. 1673-629X. 2022. 10. 027]
[10]罗 峦,夏骄雄.融合 ERNIE 与改进 Transformer 的中文 NER 模型[J].计算机技术与发展,2022,32(10):120.[doi:10. 3969 / j. issn. 1673-629X. 2022. 10. 020]
 LUO Luan,XIA Jiao-xiong.Research on Chinese Named Entity Recognition Combining ERNIE with Improved Transformer[J].,2022,32(08):120.[doi:10. 3969 / j. issn. 1673-629X. 2022. 10. 020]
[11]王 俊,王修来*,栾伟先,等.基于 BERT 模型的科研人才领域命名实体识别[J].计算机技术与发展,2021,31(11):21.[doi:10. 3969 / j. issn. 1673-629X. 2021. 11. 004]
 WANG Jun,WANG Xiu-lai*,LUAN Wei-xian,et al.Research on Named Entity Recognition of Scientific Research Talents Field Based on BERT Model[J].,2021,31(08):21.[doi:10. 3969 / j. issn. 1673-629X. 2021. 11. 004]

更新日期/Last Update: 2021-08-10