[1]何阳宇,易晓宇,唐 亮,等.基于BLSTM-ATT的老挝语军事领域实体关系抽取[J].计算机技术与发展,2021,31(05):31-37.[doi:10. 3969 / j. issn. 1673-629X. 2021. 05. 006]
 ,,et al.LaoEntityRelationExtractioninMilitaryDomainBasedonBLSTM andAttentionMechanism[J].,2021,31(05):31-37.[doi:10. 3969 / j. issn. 1673-629X. 2021. 05. 006]
点击复制

基于BLSTM-ATT的老挝语军事领域实体关系抽取()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
31
期数:
2021年05期
页码:
31-37
栏目:
大数据分析与挖掘
出版日期:
2021-05-10

文章信息/Info

Title:
LaoEntityRelationExtractioninMilitaryDomainBasedonBLSTM andAttentionMechanism
文章编号:
1673-629X(2021)05-0031-07
作者:
何阳宇1易晓宇1唐 亮1易绵竹1李宏欣12
1.解放军战略支援部队信息工程大学洛阳校区,河南洛阳471003
2.密码科学技术国家重点实验室,北京100878
Author(s):
HEYang-yuYIXiao-yuTANGLiangYIMian-zhuLIHong-xin
1.LuoyangCampus,PLAStrategicSupportForceInformationEngineeringUniversity,Luoyang471003,China
2.StateKeyLaboratoryofCryptology,Beijing100878,China
关键词:
双向长短期记忆网络多头注意力老挝语军事领域实体关系抽取
Keywords:
bidirectionallongshort-termmemorynetworksmulti-headattentionLaomilitarydomainentityrelationextractio
分类号:
TP391
DOI:
10. 3969 / j. issn. 1673-629X. 2021. 05. 006
摘要:
为了对互联网上大量的老挝语军事类文本进行结构化分析,该文提出了一种基于双向长短期记忆网络和多头自注意力机制的军事领域实体关系抽取方法。针对老挝语语料匮乏问题,提出了“硬匹配”和“软匹配”的思想,在完成语料获取和预处理的基础上,利用预定义的关系词表进行“硬匹配”,之后再通过词典匹配和相似度计算相结合的方法进行“软匹配”,以提高关系类型的泛化能力,进而自行构建了关系抽取标注语料库;然后,通过分析老挝语语言特点,融入了词、词性、实体类型、相对位置关系等特征进行模型训练,并设置了四轮针对不同变量的对比实验,验证了不同的神经网络模型、注意力机制、嵌入的特征以及语料规模对抽取效果的影响程度,实验结果表明融合双向长短期记忆网络和多头自注意力的方法对老挝语军事领域实体关系抽取具有更好的性能。
Abstract:
InordertoconductastructuredanalysisofalargenumberofLaomilitarytextsontheInternet,weproposeamethodforextractingentityrelationshipinmilitarydomainbasedonabidirectionalLSTM networkandmulti-headself-attentionmechanism.InviewofthelackofLaocorpus,theideaof“hardmatching”and“softmatching”isproposed.Basedonthecompletionofcorpusacquisitionandpreprocessing,thepre-definedrelationvocabularyisusedfor“hardmatching”,andthenthemethodcombineddictionarymatchingwithsimilaritycalculationisusedtoperform “softmatching”toimprovethegeneralizationabilityofrelationtypes,thenarelationextractioncorpusisbuiltonitsown.ByanalyzingthecharacteristicsofLaolanguage,thefeaturessuchasword,partofspeech,entitytypeandrelativepositionrelationshipareincorporatedformodeltraining,andfourroundsofcomparativeexperimentsfordifferentvariablesaresetuptoverifytheeffectofdifferentneuralnetworkmodels,attentionmechanisms,embeddedfeaturesandcorpussizeontheextractioneffect.TheexperimentshowsthatthefusionofthebidirectionalLSTM networkandmulti-headself-attentionmechanismhavebetterperformanceforLaoentityrelationextractioninmilitarydomain.

相似文献/References:

[1]周 康,曲卫东,杨艺琛.基于增强 BiLSTM 的网络文章核心实体识别[J].计算机技术与发展,2021,31(01):7.[doi:10. 3969 / j. issn. 1673-629X. 2021. 01. 002]
 ZHOU Kang,QU Wei-dong,YANG Yi-chen.Core Entity Recognition of Web Articles Based on Enhanced BiLSTM[J].,2021,31(05):7.[doi:10. 3969 / j. issn. 1673-629X. 2021. 01. 002]
[2]王卫红,吕红燕,曹玉辉,等.基于 BERT 的混合神经网络实体识别方法[J].计算机技术与发展,2021,31(08):100.[doi:10. 3969 / j. issn. 1673-629X. 2021. 08. 017]
 WANG Wei-hong,LYU Hong-yan,CAO Yu-hui,et al.A Hybrid Neural Network Entity Recognition Method Based on BERT Model[J].,2021,31(05):100.[doi:10. 3969 / j. issn. 1673-629X. 2021. 08. 017]
[3]孙安亮,时宏伟,王金策.基于字符与单词嵌入的航空安全命名实体识别[J].计算机技术与发展,2022,32(09):148.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 023]
 SUN An-liang,SHI Hong-wei,WANG Jin-ce.Named Entity Recognition Based on Character and Word Embedding in Aviation Safety[J].,2022,32(05):148.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 023]
[4]胡慧婷,李建平,董振荣,等.基于 BERT 模型的教育技术学领域实体抽取[J].计算机技术与发展,2022,32(10):164.[doi:10. 3969 / j. issn. 1673-629X. 2022. 10. 027]
 HU Hui-ting,LI Jian-ping,DONG Zhen-rong,et al.Named Entity Recognition Method in Educational Technology Field Based on BERT[J].,2022,32(05):164.[doi:10. 3969 / j. issn. 1673-629X. 2022. 10. 027]
[5]毛宏亮,艾孜尔古丽,陈德刚.基于多头注意力的电网调度领域命名实体识别[J].计算机技术与发展,2023,33(02):181.[doi:10. 3969 / j. issn. 1673-629X. 2023. 02. 027]
 MAO Hong-liang,Azragul,CHEN De-gang.Named Entity Recognition in Grid Dispatch Domain Based on Multi-headed Attention[J].,2023,33(05):181.[doi:10. 3969 / j. issn. 1673-629X. 2023. 02. 027]
[6]李春生,田梦晴,张可佳.基于 Bi-LSTM 网络的管道异常数据检测方法[J].计算机技术与发展,2023,33(06):215.[doi:10. 3969 / j. issn. 1673-629X. 2023. 06. 032]
 LI Chun-sheng,TIAN Meng-qing,ZHANG Ke-jia.Pipeline Anomaly Data Detection Method Based on CNN and Bi-LSTM Network[J].,2023,33(05):215.[doi:10. 3969 / j. issn. 1673-629X. 2023. 06. 032]

更新日期/Last Update: 2020-05-10