您的位置:山东大学 -> 科技期刊社 -> 《山东大学学报(理学版)》

J4 ›› 2012, Vol. 47 ›› Issue (5): 38-42.

• 电子技术与信息 • 上一篇    下一篇

基于语义分析的微博搜索

刘晓华1,2,韦福如2,段亚娟3,周明2   

  1. 1.哈尔滨工业大学计算机科学与技术学院, 黑龙江 哈尔滨 150001;
    2.微软亚洲研究院, 北京 100080; 3.中国科技大学计算机科学与技术学院, 安徽 合肥 230026
  • 收稿日期:2011-11-30 出版日期:2012-05-20 发布日期:2012-06-01
  • 作者简介:刘晓华(1976- ),男,研究员,博士,研究方向为自然语言处理. Email:Lxh5147@126.com

Semantic search of microblogs

LIU Xiao-hua1,2, WEI Fu-ru2, DUAN Ya-juan3, ZHOU Ming2   

  1. 1. School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, Heilongjiang, China;
    2. Microsoft Research Asia, Beijing 100080, China; 3. School of Computer Science and Technology,
    University of Science and Technology of China, Hefei 230026, Anhui, China
  • Received:2011-11-30 Online:2012-05-20 Published:2012-06-01

摘要:

 提出构建基于语义分析的微博搜索以帮助用户从海量的、书写通常不规范的微博中有效地获取信息。和现有的微博搜索引擎不同,基于语义分析的微博搜索利用一系列的自然语言处理和文本挖掘模块从微博中获取各类兴趣点,例如命名实体、事件、情感等。这些兴趣点进一步被索引,以支持分类浏览和高级搜索。本文讨论了微博语义搜索所面临的挑战及对策,介绍了一种参考实现框架及相关的语义分析技术,特别是面向微博的语义角色标注技术。

关键词: 微博;搜索引擎;语义搜索;语义角色标注

Abstract:

To obtain efficient information from a  huge number of microblogs which are short and often informally written, a search engine based on semantic analysis for microblogs semantic search was  proposed.  Unlike current microblogs search engines, it conducts a serials of natural language processings and text minings for microblogs to get interesting points such as named entities, events and opinions, that  are further indexed, and thus  two brand new scenarios are enabled, i.e., classifiction browsing and advanced search. The challenges and their possible solutions, a reference implementation framework, and related core semantic computing technologies, e.g., semantic role labeling, were presented.

Key words: microblogs; search engine; semantic search; semantic role labeling

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!