Select to translate
Journal|[J]ACM Transactions on Knowledge Discovery from DataVolume 12, Issue 5. 2018. PP 1-36
Employing Semantic Context for Sparse Information Extraction Assessment
Abstract / 摘要
A huge amount of texts available on the World Wide Web presents an unprecedented opportunity for information extraction (IE). One important assumption in IE is that frequent extractions are more likely to be correct. Sparse IE is hence a challenging task because no matter how big a corpus is, there are extractions supported by only a small amount of evidence in the corpus. However, there is limited research on sparse IE, especially in the assessment of the validity of sparse IEs. Motivated by this, we introduce a lightweight, explicit semantic approach for assessing sparse IE.1 We first use a large semantic network consisting of millions of concepts, entities, and attributes to explicitly model the context of any semantic relationship. Second, we learn from three semantic contexts using different base classifiers to select an optimal classification model for assessing sparse extractions. Finally, experiments show that as compared with several state-of-the-art approaches, our approach can significantly improve the F -score in the assessment of sparse extractions while maintaining the efficiency.
Indexed by / 核心评价
《中国学术期刊(光盘版)》电子杂志社有限公司KDN平台基础技术由KBASE 11.0提供