Recognition method of Vietnamese named entity based on#br# conditional random fields

PAN Qing-qing, ZHOU Feng, YU Zheng-tao, GUO Jian-yi, XIAN Yan-tuan   

  School of Information Engineering and Automation, Kunming University of Science and Technology,
A method of named entity recognition is proposed based on conditional random fields model aimed at the language feature of Vietnamese. This method aims at the feature of word and part of speech, adopts the arithmetic of conditional random fields, selects the word and part of speech as the feature, defines the feature template, chooses the news text of Vietnamese, tags the six entity linguistic data such as place name, person name and organization, trains the Vietnamese entity recognition model which acquired. Vietnamese entity recognition experiment results prove that the entity recognition accuracy rate of this method reach 83.73%.

Key words: machine learning, feature selection, conditional random fields, Vietnamese named entity recognition

