[1]王蕾 杨季文.基于属性标记的专有名词自动识别研究[J].计算机技术与发展,2006,(11):195-198.
 WANG Lei,YANG Ji-wen.Recognition of Chinese Proper Noun Based on Attribute Tag[J].,2006,(11):195-198.
点击复制

基于属性标记的专有名词自动识别研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2006年11期
页码:
195-198
栏目:
应用开发研究
出版日期:
1900-01-01

文章信息/Info

Title:
Recognition of Chinese Proper Noun Based on Attribute Tag
文章编号:
1673-629X(2006)11-0195-04
作者:
王蕾 杨季文
苏州大学计算机科学与技术学院
Author(s):
WANG Lei YANG Ji-wen
School of Computer Science and Technology, Suzhou University
关键词:
中文专有名词识别未登录词识别属性标注基于转换的错误驱动学习方法
Keywords:
Chinese proper noun rceognitionunknown words recognitionattribute tagtransfomation-bascd error drive learning
分类号:
TP391.1
文献标志码:
A
摘要:
提出了一种新的基于属性标记的专有名词统一识别方法。其基本思想是:根据专有名词的成词特点,利用标注语料库,设定词语属性作为标准属性重新进行标注,在此语料基础上进行专有名词成词结构、成词环境的实例提取.并采用基于转换的错误驱动方法对提取的实例进行适用规则提取,在提取的实例和规则的基础上进行属性标注,是一种基于转换的错误驱动规则自学习方法与基于实例的学习方法相结合的基于浅层句法分析的一种新的识别专有名词的方法。实验证明该方法在测试样本集上准确率达到95.3%.召回率达到92.5%.是一种有效的专有名词识别方法
Abstract:
Introduces a new method to identify the Chinese proper noun. It is based on attribute tag, The basic thinking is : according the characteristics about the Chinese proper noun compages, using label corpus, enact the words attribute to be the standard attribute and relabeled it. Based on the corpus,distilling the Chinese proper noun instances about compares configuration and compages environnwnt, using the transfomiation - based error- drive learning method to distill the fit regulation. Doing attribute label based on the instance and regulation which just distilled is the method combined the transfonnatkion- based error - drive learning and instance - based learning. Experiments proved this method ratio of nicety aehieved 95.3 % on testing stylebooks, the ratio of recall achied, 92.5 %,so it is an effcetive method to identify Chinese proper noun

相似文献/References:

[1]钟锋 罗燕京 杨曦 李虎.一种基于合并策略的机构名称切分方法[J].计算机技术与发展,2008,(05):12.
 ZHONG Feng,LUO Yan-jing,YANG Xi,et al.An Organization Name Segmentation Approach Based on Combination Strategy[J].,2008,(11):12.

备注/Memo

备注/Memo:
王蕾(1980-),女,河南开封人,硕士研究生,主要从事中文信息处理,杨季文,教授,主要从事中文信息处理
更新日期/Last Update: 1900-01-01