Username   Password       Forgot your password?  Forgot your username? 

Entity Disambiguation with Markov Logic Network Knowledge Graphs

Volume 13, Number 8, December 2017, pp. 1293-1303
DOI: 10.23940/ijpe.17.08.p11.12931303

Jiangtao Ma, Tao Wei, Yaqiong Qiao, Yongzhong Huang, Weibo Xie, Chaoqin Zhang,Yanjun Wang, Rui Zhang

aState Key Laboratory of Mathematical Engineering and Advanced Computing, Zhengzhou 450002, China
bZhengzhou University of Light Industry, Zhengzhou 450002, China                                                                                                                             
cNational Digital Switching System Engineering & Technological R&D Center, Zhengzhou 451000, China
dHenan Institute of Engineering, Computer College, Zhengzhou 451000, China
eNorth China University of Water Resources and Electric Power, Zhengzhou 450002, China

(Submitted on July 25, 2017; Revised on August 30, 2017; Accepted on September 15, 2017)

(This paper was presented at the Third International Symposium on System and Software Reliability.)


Disambiguating named entities is an important problem in natural language processing, knowledge base, question answering systems. In the paper, we propose a Markov logic network knowledge graph solution for solving entity resolution problem. First, we employ knowledge graph to represent the entity relationship between linked entities in the knowledge base. Then, we utilize MLN to inference the inconsistent relationship in the knowledge graph, and disambiguate the entities in the process of entity disambiguation. As far as we know, inferencing with MLN is a first attempt for entity disambiguation in the knowledge graph. We evaluate the proposed solution with three real world knowledge bases and compare it with four baseline solutions. The experimental results demonstrate that our solution is 7% higher than other baseline methods with F1 measure. We also test our scheme and compare entity resolution systems on four datasets with three knowledge base corpora. Extensive experiments show that our solution achieves higher precision and recall than baseline solutions.


References: 30

      1. J. Biega, E. Kuzey, and F.M. Suchanek, “Inside YAGO2s: A Transparent Information Extraction Architecture,” in proceedings of the 22nd International World Wide Web Conference (WWW), pp. 325–328, Rio Janeiro, Brazil, May 2013
      2. K. Bollacker, C. Evans, P. Paritosh, T. Sturge, and J. Taylor, “Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge,” in proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 1247–1250, Vancouver, Canada, June 2008
      3. J.D. Carroll and J.-J. Chang, “Analysis of Individual Differences in Multidimensional Scaling Via an N-way Generalization of “Eckart-Young” Decomposition,” Psychometrika. vol.35, no. 35, pp. 283–319, July 1970
      4. X. Cheng and D. Roth, “Relational Inference for Wikification,” in proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1787–1796, Washington, USA, August 2013
      5. P. Christen, “Automatic Training Example Selection for Scalable Unsupervised Record Linkage,” in proceedings of the 12th Advances in Knowledge Discovery and Data Mining, Pacific-Asia Conference, (PAKDD), pp. 511–518, Osaka, Japan, May 2008
      6. W.W. Cohen, and J. Richman, “Learning to Match and Cluster Large High-dimensional Data Sets for Data Integration,” in proceedings of the Eighth International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 475–480, Alberta, Canada, July 2002
      7. M. Cornolti, P. Ferragina and M. Ciaramita, “A Framework for Benchmarking Entity-Annotation Systems,” in proceedings of the 22nd International World Wide Web Conference (WWW), pp. 249–260, Rio Janeiro, Brazil, May 2013
      8. S. Cucerzan, “Large-Scale Named Entity Disambiguation Based on Wikipedia Data,” in proceedings of the Conference on Empirical Methods in Natural Language Processing Conference and Computational Natural Language Learning Joint Meeting following ACL 2007 (EMNLP-CoNLL 2007), pp. 708–716, Prague, Czech Republic, June 2007
      9. X. Dong, E. Gabrilovich, G. Heitz, W. Horn, N. Lao, K. Murphy, T. Strohmann, S. Sun and W. Zhang, “Knowledge Vault: A Web-scale Approach to Probabilistic Knowledge Fusion,” in proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 601–610, New York, NY, USA, August 2014
      10. F. Erxleben, M. Günther, M. Krötzsch, J. Mendez and D. Vrandecic, “Introducing Wikidata to the Linked Data Web,” in proceedings of the 13th International Semantic Web Conference (ISWC), pp. 50–65, Riva del Garda, Italy, October 2014
      11. P. Ferragina and U. Scaiella, “Fast and Accurate Annotation of Short Texts with Wikipedia Pages,” IEEE Software. vol. 29, no. 1, pp. 70–75, January 2012
      12. T.N. Herzog, F.J. Scheuren and W.E. Winkler, “Data Quality and Record Linkage Techniques,” Springer, 2007
      13. J. Hoffart, M.A. Yosef, I. Bordino, H. Fürstenau, M. Pinkal, M. Spaniol, B. Taneva, S. Thater and G. Weikum, “Robust Disambiguation of Named Entities in Text,” in proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 782–792, Edinburgh, UK, July 2011
      14. T.G. Kolda and B.W. Bader, “Tensor Decompositions and Applications,” SIAM Review, vol. 51, no. 3, pp. 455–500, May 2009
      15. S. Lacoste-Julien, K. Palla, A. Davies, G. Kasneci, T. Graepel and Z. Ghahramani, “SIGMa: Simple Greedy Matching for Aligning Large Knowledge Bases,” in proceedings of the 19th International Conference on Knowledge Discovery and Data Mining (KDD), pp. 572–580, Chicago, USA, August 2013
      16. T. Lee, Z. Wang, H. Wang and S. Hwang, “Web Scale Taxonomy Cleansing,” in proceedings of the VLDB Endowment 2011, vol. 4, no. 12, pp. 1295–1306, September 2011
      17. J. Lehmann, R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas, P.N. Mendes, S. Hellmann, M. Morsey, P. van Kleef, S. Auer and C. Bizer, “DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia,” Semantic Web, vol. 6, no. 2, pp. 167–195, August 2015
      18. F. Mahdisoltani, J. Biega and F.M. Suchanek, “YAGO3: A Knowledge Base from Multilingual Wikipedias,” online proceedings of the Seventh Biennial Conference on Innovative Data Systems Research (CIDR), Asilomar, CA, USA, January 2015
      19. A. McCallum and B. Wellner, “Conditional Models of Identity Uncertainty with Application to Noun Coreference,” in proceedings of the 17th International Conference on Neural Information Processing Systems (NIPS), pp. 905–912, Vancouver, Br. Columbia, Canada, December 2004
      20. P.N. Mendes, M. Jakob, A. Garcia-Silva and C. Bizer, “DBpedia Spotlight: Shedding Light on the Web Of Documents,” in proceedings the 7th International Conference on Semantic Systems (I-SEMANTICS), pp. 1–8, Graz, Austria, September 2011
      21. D.N. Milne and I.H. Witten, “Learning to Link with Wikipedia,” in proceedings of the 17th Conference on Information and Knowledge Management (CIKM), pp. 509–518, Napa Val. California, USA, October 2008
      22. F. Piccinno and P. Ferragina, “From TagME to WAT: A New Entity Annotator,” in proceedings of the First International Workshop on Entity Recognition & disambiguation (ERD '14), pp. 55–62, Gold Coast, Queensland, Australia, July 2014
      23. L. Ratinov, D. Roth, D. Downey and M. Anderson, “Local and Global Algorithms for Disambiguation to Wikipedia,” in proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (HLT '11), pp. 1375–1384, Portland, Oregon, June 2011
      24. M. Richardson and P. Domingos, “Markov Logic Networks,” Machine Learning, vol. 62, no. 1, pp. 107–136, February 2006
      25. S. Sarawagi and A. Bhamidipaty, “Interactive Deduplication Using Active Learning,” in proceedings of the Eighth International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 269–278, Edmonton, Alberta, Canada, July 2002
      26. P. Singla and P.M. Domingos, “Entity Resolution with Markov Logic,” in proceedings of the 6th International Conference on Data Mining (ICDM), pp. 572–582, Hong Kong, China, December 2006
      27. S. Tejada, C.A. Knoblock and S. Minton, “Learning Domain-Independent String Transformation Weights for High Accuracy Object Identification,” in proceedings of the Eighth International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 350–359, Edmonton, Alberta, Canada, July 2002
      28. R. Usbeck, M. Röder, A.-C. Ngonga Ngomo, C. Baron, A. Both, M. Brümmer, D. Ceccarelli, M. Cornolti, D. Cherix, B. Eickmann, P. Ferragina, C. Lemke, A. Moro, R. Navigli, F. Piccinno, G. Rizzo, H. Sack, R. Speck, R. Troncy, J. Waitelonis and L. Wesemann, “GERBIL: General Entity Annotator Benchmarking Framework,” in proceedings of the 24th International Conference on World Wide Web (WWW’15), pp. 1133–1143, Florence, Italy, May 2015
      29. W. Wu, H. Li, H. Wang and K.Q. Zhu, “Probase: A Probabilistic Taxonomy for Text Understanding,” in proceedings of the International Conference on Management of Data (SIGMOD), pp. 481–492, Scottsdale, AZ, USA, May 2012
      30. S. Zwicklbauer, C. Seifert and M. Granitzer, “DoSeR - A Knowledge-Base-Agnostic Framework for Entity Disambiguation Using Semantic Embeddings,” in proceedings of the Semantic Web. Latest Advances and New Domains: 13th International Conference (ESWC 2016), pp. 182–198, Heraklion, Crete, Greece, May 2016


          Click here to download the paper.

          Please note : You will need Adobe Acrobat viewer to view the full articles.Get Free Adobe Reader

          This site uses encryption for transmitting your passwords.