J4
• Articles • Previous Articles Next Articles
WANG Lei,CHEN Zhi-ping,LI Zhi-cheng
Received:
Revised:
Online:
Published:
Contact:
Abstract: Since varied training data sources are not profitable for the learning of optimal model parameters, then a novel text information extraction algorithm based on hidden Markov model with multiple templates is proposed, which makes use of the information of format and list separators to segment text, and then extracts text information through combining theparameters of releasing probability for universal training, using multiple form templates to train the parameters of initial probability and transition probability for hidden Markov mode. Experimental results show better performance in precision and recall over simple hidden Markov model.
Key words: text block , multiple templates, hidden markov model, text information extraction
WANG Lei,CHEN Zhi-ping,LI Zhi-cheng . Using text blocks based on multiple templates hidden markov model for text information extraction[J].J4, 2006, 41(3): 19-24 .
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: http://lxbwk.njournal.sdu.edu.cn/EN/
http://lxbwk.njournal.sdu.edu.cn/EN/Y2006/V41/I3/19
Cited