Novel knowledge-based system with relation detection and textual evidence for question answering research

PLoS One. 2018 Oct 3;13(10):e0205097. doi: 10.1371/journal.pone.0205097. eCollection 2018.

Abstract

With the development of large-scale knowledge bases (KBs), knowledge-based question answering (KBQA) has become an important research topic in recent years. The key task in KBQA is relation detection, which is the process of finding a compatible answer type for a natural language question and generating its corresponding structured query over a KB. However, existing systems often rely on shallow probabilistic methods, which are less expressive than deep semantic representation methods. In addition, since KBs are still far from complete, it is necessary to develop a new strategy that leverages unstructured resources outside of KBs. In this work, we propose a novel Question Answering method with Relation Detection and Textual Evidence (QARDTE). First, to address the semantic gap problem in relation detection, we use bidirectional long-short term memory networks with different levels of abstraction to better capture sentence structures. Our model achieves improved results with robustness against a wide diversity of expressions and questions with multiple relations. Moreover, to help compensate for the incompleteness of KBs, we utilize external unstructured text to extract additional supporting evidence and combine this evidence with relation information during the answer re-ranking process. In experiments on two well-known benchmarks, our system achieves F1 values of 0.558 (+2.8%) and 0.663 (+5.7%), which are state-of-the-art results that show significant improvement over existing KBQA systems.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Knowledge Bases*
  • Natural Language Processing*
  • Quality Improvement
  • Semantics

Grants and funding

This research is supported by (1) National Natural Science Foundation of China Grant No. 61773229, Funding Body: National Natural Science Foundation of China, http://www.nsfc.gov.cn/, author received: HTZ; (2) Shenzhen Science and Technology Project Grant No. CYZZ20150408152315667, Funding Body: Shenzhen Science, Technology and Innovation Commission, http://www.szsti.gov.cn, author received: CZZ; (3) Basic Scientific Research Program of Shenzhen City Grant No. JCYJ20160331184440545, Funding Body: Shenzhen Science, Technology and Innovation Commission, http://www.szsti.gov.cn, author received: HTZ; and (4) Cross fund of Graduate School at Shenzhen, Tsinghua University Grant No. JC20140001, http://www.sz.tsinghua.edu.cn/, Funding Body: Graduate School at Shenzhen, Tsinghua University, author received: HTZ. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. One of our authors (CZZ) affiliates to Giiso Information Technology Co., Ltd. The funder provided support in the form of salaries for authors HTZ, ZYF, YJ, CZZ and JYC, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.