skip to main content
research-article
Free Access
Just Accepted

NPEL: Neural Paired Entity Linking in Web Tables

Authors Info & Claims
Online AM:19 March 2024Publication History
Skip Abstract Section

Abstract

This paper studies entity linking (EL) in Web tables, which aims to link the string mentions in table cells to their referent entities in a knowledge base. Two main problems exist in previous studies: 1) contextual information is not well utilized in mention-entity similarity computation; 2) the assumption on entity coherence that all entities in the same row or column are highly related to each other is not always correct. In this paper, we propose NPEL, a new Neural Paired Entity Linking framework, to overcome the above problems. In NPEL, we design a deep learning model with different neural networks and an attention mechanism, to model different kinds of contextual information of mentions and entities, for mention-entity similarity computation in Web tables. NPEL also relaxes the above assumption on entity coherence by a new paired entity linking algorithm, which iteratively selects two mentions with the highest confidence for EL. Experiments on real-world datasets exhibit that NPEL has the best performance compared with state-of-the-art baselines in different evaluation metrics.

References

  1. Chandra Sekhar Bhagavatula, Thanapon Noraset, and Doug Downey. 2015. TabEL: Entity Linking in Web Tables. In ISWC, Part I. 425–441.Google ScholarGoogle Scholar
  2. Michael Cafarella, Alon Halevy, Hongrae Lee, Jayant Madhavan, Cong Yu, Daisy Zhe Wang, and Eugene Wu. 2018. Ten Years of WebTables. PVLDB 11, 12 (2018), 2140–2149.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Michael J Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, and Yang Zhang. 2008. WebTables: Exploring the Power of Tables on the Web. PVLDB 1, 1 (2008), 538–549.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Jiaoyan Chen, Ernesto Jiménez-Ruiz, Ian Horrocks, and Charles Sutton. 2019. ColNet: Embedding the Semantics of Web Tables for Column Type Prediction. In AAAI, Vol.  33. 29–36.Google ScholarGoogle Scholar
  5. Jiaoyan Chen, Ernesto Jiménez-Ruiz, Ian Horrocks, and Charles Sutton. 2019. Learning Semantic Annotations for Tabular Data. In IJCAI. 2088–2094.Google ScholarGoogle Scholar
  6. Xiang Deng, Huan Sun, Alyssa Lees, You Wu, and Cong Yu. 2022. TURL: Table Understanding through Representation Learning. ACM SIGMOD Record 51, 1 (2022), 33–40.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Vasilis Efthymiou, Oktie Hassanzadeh, Mariano Rodriguez-Muro, and Vassilis Christophides. 2017. Matching Web Tables with Knowledge Base Entities: From Entity Lookups to Entity Embeddings. In ISWC, Part I. 260–277.Google ScholarGoogle Scholar
  8. Felix A Gers, Jürgen Schmidhuber, and Fred Cummins. 2000. Learning to Forget: Continual Prediction with LSTM. Neural Computation 12, 10 (2000), 2451–2471.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Alex Graves. 2013. Generating Sequences with Recurrent Neural Networks. arXiv preprint arXiv:1308.0850(2013).Google ScholarGoogle Scholar
  10. Gaëlle Hignette, Patrice Buche, Juliette Dibie-Barthélemy, and Ollivier Haemmerlé. 2009. Fuzzy Annotation of Web Data Tables Driven by a Domain Ontology. In ESWC. 638–653.Google ScholarGoogle Scholar
  11. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural computation 9, 8 (1997), 1735–1780.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yusra Ibrahim, Mirek Riedewald, and Gerhard Weikum. 2016. Making Sense of Entities and Quantities in Web Tables. In CIKM. 1703–1712.Google ScholarGoogle Scholar
  13. Sujay Kumar Jauhar, Peter Turney, and Eduard Hovy. 2016. Tables as Semi-Structured Knowledge for Question Answering. In ACL, Volume 1: Long Papers. 474–483.Google ScholarGoogle Scholar
  14. Diederik P Kingma and Jimmy Ba. 2014. ADAM: A METHOD FOR STOCHASTIC OPTIMIZATION. arXiv preprint arXiv:1412.6980(2014).Google ScholarGoogle Scholar
  15. Thomas N Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In ICLR.Google ScholarGoogle Scholar
  16. Benno Kruit, Peter Boncz, and Jacopo Urbani. 2019. Extracting Novel Facts from Tables for Knowledge Graph Completion. In ISWC, Part I. 364–381.Google ScholarGoogle Scholar
  17. Oliver Lehmberg and Christian Bizer. 2017. Stitching Web Tables for Improving Matching Quality. PVLDB 10, 11 (2017), 1502–1513.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Oliver Lehmberg, Dominique Ritze, Robert Meusel, and Christian Bizer. 2016. A Large Public Corpus of Web Tables Containing Time and Context Metadata. In WWW, Companion Volume. 75–76.Google ScholarGoogle Scholar
  19. Girija Limaye, Sunita Sarawagi, and Soumen Chakrabarti. 2010. Annotating and Searching Web Tables using Entities, Types and Relationships. PVLDB 3, 1-2 (2010), 1338–1347.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Xusheng Luo, Kangqi Luo, Xianyang Chen, and Kenny Q Zhu. 2018. Cross-lingual Entity Linking for Web Tables. In AAAI. 362–369.Google ScholarGoogle Scholar
  21. Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. In ICLR, Workshop.Google ScholarGoogle Scholar
  22. Varish Mulwad, Tim Finin, and Anupam Joshi. 2013. Semantic Message Passing for Generating Linked Data from Tables. In ISWC, Part I. 363–378.Google ScholarGoogle Scholar
  23. Emir Muñoz, Aidan Hogan, and Alessandra Mileo. 2014. Using Linked Data to Mine RDF from Wikipedia’s Tables. In WSDM. 533–542.Google ScholarGoogle Scholar
  24. Minh C Phan, Aixin Sun, Yi Tay, Jialong Han, and Chenliang Li. 2018. Pair-Linking for Collective Entity Disambiguation: Two Could Be Better Than All. IEEE Transactions on Knowledge and Data Engineering 31, 7(2018), 1383–1396.Google ScholarGoogle ScholarCross RefCross Ref
  25. Dominique Ritze, Oliver Lehmberg, Yaser Oulabi, and Christian Bizer. 2016. Profiling the Potential of Web Tables for Augmenting Cross-Domain Knowledge Bases. In WWW. 251–261.Google ScholarGoogle Scholar
  26. Huan Sun, Hao Ma, Xiaodong He, Wen-tau Yih, Yu Su, and Xifeng Yan. 2016. Table Cell Search for Question Answering. In WWW. 771–782.Google ScholarGoogle Scholar
  27. Kunihiro Takeoka, Masafumi Oyamada, Shinji Nakadai, and Takeshi Okadome. 2019. Meimei: An Efficient Probabilistic Approach for Semantically Annotating Tables. In AAAI, Vol.  33. 281–288.Google ScholarGoogle Scholar
  28. Zhiruo Wang, Haoyu Dong, Ran Jia, Jia Li, Zhiyi Fu, Shi Han, and Dongmei Zhang. 2021. TUTA: Tree-based Transformers for Generally Structured Table Pre-training. In SIGKDD. 1780–1790.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Tianxing Wu, Shengjia Yan, Zhixin Piao, Liang Xu, Ruiming Wang, and Guilin Qi. 2016. Entity Linking in Web Tables with Multiple Linked Knowledge Bases. In JIST. 239–253.Google ScholarGoogle Scholar
  30. Shuo Zhang and Krisztian Balog. 2020. Web Table Extraction, Retrieval, and Augmentation: A Survey. ACM Transactions on Intelligent Systems and Technology 11, 2(2020), 1–35.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Si Zhang, Hanghang Tong, Jiejun Xu, and Ross Maciejewski. 2019. Graph Convolutional Networks: A Comprehensive Review. Computational Social Networks 6, 1 (2019), 11.Google ScholarGoogle ScholarCross RefCross Ref
  32. Ziqi Zhang. 2017. Effective and efficient semantic table interpretation using tableminer+. Semantic Web 8, 6 (2017), 921–957.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. NPEL: Neural Paired Entity Linking in Web Tables

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Asian and Low-Resource Language Information Processing
        ACM Transactions on Asian and Low-Resource Language Information Processing Just Accepted
        ISSN:2375-4699
        EISSN:2375-4702
        Table of Contents

        Copyright © 2024 Copyright held by the owner/author(s). Publication rights licensed to ACM.

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Online AM: 19 March 2024
        • Accepted: 5 March 2024
        • Revised: 8 February 2024
        • Received: 30 October 2023
        Published in tallip Just Accepted

        Check for updates

        Qualifiers

        • research-article
      • Article Metrics

        • Downloads (Last 12 months)63
        • Downloads (Last 6 weeks)41

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader