Abstract
This paper studies entity linking (EL) in Web tables, which aims to link the string mentions in table cells to their referent entities in a knowledge base. Two main problems exist in previous studies: 1) contextual information is not well utilized in mention-entity similarity computation; 2) the assumption on entity coherence that all entities in the same row or column are highly related to each other is not always correct. In this paper, we propose NPEL, a new Neural Paired Entity Linking framework, to overcome the above problems. In NPEL, we design a deep learning model with different neural networks and an attention mechanism, to model different kinds of contextual information of mentions and entities, for mention-entity similarity computation in Web tables. NPEL also relaxes the above assumption on entity coherence by a new paired entity linking algorithm, which iteratively selects two mentions with the highest confidence for EL. Experiments on real-world datasets exhibit that NPEL has the best performance compared with state-of-the-art baselines in different evaluation metrics.
- Chandra Sekhar Bhagavatula, Thanapon Noraset, and Doug Downey. 2015. TabEL: Entity Linking in Web Tables. In ISWC, Part I. 425–441.Google Scholar
- Michael Cafarella, Alon Halevy, Hongrae Lee, Jayant Madhavan, Cong Yu, Daisy Zhe Wang, and Eugene Wu. 2018. Ten Years of WebTables. PVLDB 11, 12 (2018), 2140–2149.Google ScholarDigital Library
- Michael J Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, and Yang Zhang. 2008. WebTables: Exploring the Power of Tables on the Web. PVLDB 1, 1 (2008), 538–549.Google ScholarDigital Library
- Jiaoyan Chen, Ernesto Jiménez-Ruiz, Ian Horrocks, and Charles Sutton. 2019. ColNet: Embedding the Semantics of Web Tables for Column Type Prediction. In AAAI, Vol. 33. 29–36.Google Scholar
- Jiaoyan Chen, Ernesto Jiménez-Ruiz, Ian Horrocks, and Charles Sutton. 2019. Learning Semantic Annotations for Tabular Data. In IJCAI. 2088–2094.Google Scholar
- Xiang Deng, Huan Sun, Alyssa Lees, You Wu, and Cong Yu. 2022. TURL: Table Understanding through Representation Learning. ACM SIGMOD Record 51, 1 (2022), 33–40.Google ScholarDigital Library
- Vasilis Efthymiou, Oktie Hassanzadeh, Mariano Rodriguez-Muro, and Vassilis Christophides. 2017. Matching Web Tables with Knowledge Base Entities: From Entity Lookups to Entity Embeddings. In ISWC, Part I. 260–277.Google Scholar
- Felix A Gers, Jürgen Schmidhuber, and Fred Cummins. 2000. Learning to Forget: Continual Prediction with LSTM. Neural Computation 12, 10 (2000), 2451–2471.Google ScholarDigital Library
- Alex Graves. 2013. Generating Sequences with Recurrent Neural Networks. arXiv preprint arXiv:1308.0850(2013).Google Scholar
- Gaëlle Hignette, Patrice Buche, Juliette Dibie-Barthélemy, and Ollivier Haemmerlé. 2009. Fuzzy Annotation of Web Data Tables Driven by a Domain Ontology. In ESWC. 638–653.Google Scholar
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural computation 9, 8 (1997), 1735–1780.Google ScholarDigital Library
- Yusra Ibrahim, Mirek Riedewald, and Gerhard Weikum. 2016. Making Sense of Entities and Quantities in Web Tables. In CIKM. 1703–1712.Google Scholar
- Sujay Kumar Jauhar, Peter Turney, and Eduard Hovy. 2016. Tables as Semi-Structured Knowledge for Question Answering. In ACL, Volume 1: Long Papers. 474–483.Google Scholar
- Diederik P Kingma and Jimmy Ba. 2014. ADAM: A METHOD FOR STOCHASTIC OPTIMIZATION. arXiv preprint arXiv:1412.6980(2014).Google Scholar
- Thomas N Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In ICLR.Google Scholar
- Benno Kruit, Peter Boncz, and Jacopo Urbani. 2019. Extracting Novel Facts from Tables for Knowledge Graph Completion. In ISWC, Part I. 364–381.Google Scholar
- Oliver Lehmberg and Christian Bizer. 2017. Stitching Web Tables for Improving Matching Quality. PVLDB 10, 11 (2017), 1502–1513.Google ScholarDigital Library
- Oliver Lehmberg, Dominique Ritze, Robert Meusel, and Christian Bizer. 2016. A Large Public Corpus of Web Tables Containing Time and Context Metadata. In WWW, Companion Volume. 75–76.Google Scholar
- Girija Limaye, Sunita Sarawagi, and Soumen Chakrabarti. 2010. Annotating and Searching Web Tables using Entities, Types and Relationships. PVLDB 3, 1-2 (2010), 1338–1347.Google ScholarDigital Library
- Xusheng Luo, Kangqi Luo, Xianyang Chen, and Kenny Q Zhu. 2018. Cross-lingual Entity Linking for Web Tables. In AAAI. 362–369.Google Scholar
- Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. In ICLR, Workshop.Google Scholar
- Varish Mulwad, Tim Finin, and Anupam Joshi. 2013. Semantic Message Passing for Generating Linked Data from Tables. In ISWC, Part I. 363–378.Google Scholar
- Emir Muñoz, Aidan Hogan, and Alessandra Mileo. 2014. Using Linked Data to Mine RDF from Wikipedia’s Tables. In WSDM. 533–542.Google Scholar
- Minh C Phan, Aixin Sun, Yi Tay, Jialong Han, and Chenliang Li. 2018. Pair-Linking for Collective Entity Disambiguation: Two Could Be Better Than All. IEEE Transactions on Knowledge and Data Engineering 31, 7(2018), 1383–1396.Google ScholarCross Ref
- Dominique Ritze, Oliver Lehmberg, Yaser Oulabi, and Christian Bizer. 2016. Profiling the Potential of Web Tables for Augmenting Cross-Domain Knowledge Bases. In WWW. 251–261.Google Scholar
- Huan Sun, Hao Ma, Xiaodong He, Wen-tau Yih, Yu Su, and Xifeng Yan. 2016. Table Cell Search for Question Answering. In WWW. 771–782.Google Scholar
- Kunihiro Takeoka, Masafumi Oyamada, Shinji Nakadai, and Takeshi Okadome. 2019. Meimei: An Efficient Probabilistic Approach for Semantically Annotating Tables. In AAAI, Vol. 33. 281–288.Google Scholar
- Zhiruo Wang, Haoyu Dong, Ran Jia, Jia Li, Zhiyi Fu, Shi Han, and Dongmei Zhang. 2021. TUTA: Tree-based Transformers for Generally Structured Table Pre-training. In SIGKDD. 1780–1790.Google ScholarDigital Library
- Tianxing Wu, Shengjia Yan, Zhixin Piao, Liang Xu, Ruiming Wang, and Guilin Qi. 2016. Entity Linking in Web Tables with Multiple Linked Knowledge Bases. In JIST. 239–253.Google Scholar
- Shuo Zhang and Krisztian Balog. 2020. Web Table Extraction, Retrieval, and Augmentation: A Survey. ACM Transactions on Intelligent Systems and Technology 11, 2(2020), 1–35.Google ScholarDigital Library
- Si Zhang, Hanghang Tong, Jiejun Xu, and Ross Maciejewski. 2019. Graph Convolutional Networks: A Comprehensive Review. Computational Social Networks 6, 1 (2019), 11.Google ScholarCross Ref
- Ziqi Zhang. 2017. Effective and efficient semantic table interpretation using tableminer+. Semantic Web 8, 6 (2017), 921–957.Google ScholarDigital Library
Index Terms
- NPEL: Neural Paired Entity Linking in Web Tables
Recommendations
Entity linking leveraging: automatically generated annotation
COLING '10: Proceedings of the 23rd International Conference on Computational LinguisticsEntity linking refers entity mentions in a document to their representations in a knowledge base (KB). In this paper, we propose to use additional information sources from Wikipedia to find more name variations for entity linking task. In addition, as ...
Web personal name disambiguation based on reference entity tables mined from the web
WIDM '09: Proceedings of the eleventh international workshop on Web information and data managementAmbiguous personal names are common on the Web, which pose a challenge for many different tasks. The traditional disambiguation employs the clustering methods. However, without reference entity tables, the clustering method can only identify whether two ...
Leveraging Entity Linking to Enhance Entity Recognition in Microblogs
IC3K 2015: Proceedings of the International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge ManagementThe Web of Data provides abundant knowledge wherein objects or entities are described by means of properties
and their relationships with other objects or entities. This knowledge is used extensively by the research
community for Information Extraction ...
Comments