体育赛事命名实体识别研究
摘要:
为了准确地从中文文本中识别出复杂体育赛事命名实体,提出了一种基于双层条件随机场模型的命名实体识别方法.该方法首先在低层条件随机场模型中识别出简单体育赛事命名实体,然后在高层条件随机场模型中识别出嵌套了简单体育赛事命名实体的复杂命名实体如赛事名、参赛球队名和比赛场馆名.在对大规模真实语料进行的开放测试中,赛事名、参赛球队名和比赛场馆名识别的F值分别达到97.09%,97.81%和98.03%.
In order to accurately recognize the complex sports events named entities in Chinese text,this paper presents a method of named entity recognition based on cascaded conditional random fields.In the proposed method,simple named entities are firstly recognized by lower model and then complex named entities nesting simple sports events named entity such as event name,team name and venue name are recognized by higher model.In open test on large-scale corpus,its F-measure of event name,team name and venue name is 97.09%,97.81% and 98.03%.
作者:
谷川 宋旭
机构地区:
安阳师范学院软件学院 安阳师范学院计算机与信息工程学院
出处:
《betway官方app 学报:自然科学版》 CAS 北大核心 2015年第4期163-167,共5页
基金:
国家自然科学基金(60875081) 河南省基础与前沿技术研究计划项目(112300410182)
关键词:
命名实体识别 体育赛事领域 双层条件随机场
named entity recognition sports events filed cascaded conditional random fields
分类号:
TP391.11 [自动化与计算机技术—计算机应用技术]