TY - GEN
T1 - Using corpus statistics on entities to improve semi-supervised relation extraction from the Web
AU - Rosenfeld, Benjamin
AU - Feldman, Ronen
PY - 2007
Y1 - 2007
N2 - Many errors produced by unsupervised and semi-supervised relation extraction (RE) systems occur because of wrong recognition of entities that participate in the relations. This is especially true for systems that do not use separate named-entity recognition components, instead relying on general-purpose shallow parsing. Such systems have greater applicability, because they are able to extract relations that contain attributes of unknown types. However, this generality comes with the cost in accuracy. In this paper we show how to use corpus statistics to validate and correct the arguments of extracted relation instances, improving the overall RE performance. We test the methods on SRES - a self-supervised Web relation extraction system. We also compare the performance of corpus-based methods to the performance of validation and correction methods based on supervised NER components.
AB - Many errors produced by unsupervised and semi-supervised relation extraction (RE) systems occur because of wrong recognition of entities that participate in the relations. This is especially true for systems that do not use separate named-entity recognition components, instead relying on general-purpose shallow parsing. Such systems have greater applicability, because they are able to extract relations that contain attributes of unknown types. However, this generality comes with the cost in accuracy. In this paper we show how to use corpus statistics to validate and correct the arguments of extracted relation instances, improving the overall RE performance. We test the methods on SRES - a self-supervised Web relation extraction system. We also compare the performance of corpus-based methods to the performance of validation and correction methods based on supervised NER components.
UR - http://www.scopus.com/inward/record.url?scp=80053498006&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:80053498006
SN - 9781932432862
T3 - ACL 2007 - Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics
SP - 600
EP - 607
BT - ACL 2007 - Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics
T2 - 45th Annual Meeting of the Association for Computational Linguistics, ACL 2007
Y2 - 23 June 2007 through 30 June 2007
ER -