Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs

Recent years have seen increasingly complex question-answering on knowledge bases (KBQA) involving logical, quantitative, and comparative reasoning over KB subgraphs. Neural Program Induction (NPI) is a pragmatic approach toward modularizing the reasoning process by translating...

Full description

Bibliographic Details
Main Authors: Saha, Amrita, Ansari, Ghulam Ahmed, Laddha, Abhishek, Sankaranarayanan, Karthik, Chakrabarti, Soumen
Format: Article
Language:English
Published: The MIT Press 2019-11-01
Series:Transactions of the Association for Computational Linguistics
Online Access:https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00262
id doaj-3a9591e65064424ea1ed096376db075d
record_format Article
spelling doaj-3a9591e65064424ea1ed096376db075d2020-11-25T03:16:37ZengThe MIT PressTransactions of the Association for Computational Linguistics2307-387X2019-11-01718520010.1162/tacl_a_00262Complex Program Induction for Querying Knowledge Bases in the Absence of Gold ProgramsSaha, AmritaAnsari, Ghulam AhmedLaddha, AbhishekSankaranarayanan, KarthikChakrabarti, Soumen Recent years have seen increasingly complex question-answering on knowledge bases (KBQA) involving logical, quantitative, and comparative reasoning over KB subgraphs. Neural Program Induction (NPI) is a pragmatic approach toward modularizing the reasoning process by translating a complex natural language query into a multi-step executable program. While NPI has been commonly trained with the ‘‘gold’’ program or its sketch, for realistic KBQA applications such gold programs are expensive to obtain. There, practically only natural language queries and the corresponding answers can be provided for training. The resulting combinatorial explosion in program space, along with extremely sparse rewards, makes NPI for KBQA ambitious and challenging. We present Complex Imperative Program Induction from Terminal Rewards (CIPITR), an advanced neural programmer that mitigates reward sparsity with auxiliary rewards, and restricts the program space to semantically correct programs using high-level constraints, KB schema, and inferred answer type. CIPITR solves complex KBQA considerably more accurately than key-value memory networks and neural symbolic machines (NSM). For moderately complex queries requiring 2- to 5-step programs, CIPITR scores at least 3× higher F1 than the competing systems. On one of the hardest class of programs (comparative reasoning) with 5–10 steps, CIPITR outperforms NSM by a factor of 89 and memory networks by 9 times. 1https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00262
collection DOAJ
language English
format Article
sources DOAJ
author Saha, Amrita
Ansari, Ghulam Ahmed
Laddha, Abhishek
Sankaranarayanan, Karthik
Chakrabarti, Soumen
spellingShingle Saha, Amrita
Ansari, Ghulam Ahmed
Laddha, Abhishek
Sankaranarayanan, Karthik
Chakrabarti, Soumen
Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
Transactions of the Association for Computational Linguistics
author_facet Saha, Amrita
Ansari, Ghulam Ahmed
Laddha, Abhishek
Sankaranarayanan, Karthik
Chakrabarti, Soumen
author_sort Saha, Amrita
title Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_short Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_full Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_fullStr Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_full_unstemmed Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_sort complex program induction for querying knowledge bases in the absence of gold programs
publisher The MIT Press
series Transactions of the Association for Computational Linguistics
issn 2307-387X
publishDate 2019-11-01
description Recent years have seen increasingly complex question-answering on knowledge bases (KBQA) involving logical, quantitative, and comparative reasoning over KB subgraphs. Neural Program Induction (NPI) is a pragmatic approach toward modularizing the reasoning process by translating a complex natural language query into a multi-step executable program. While NPI has been commonly trained with the ‘‘gold’’ program or its sketch, for realistic KBQA applications such gold programs are expensive to obtain. There, practically only natural language queries and the corresponding answers can be provided for training. The resulting combinatorial explosion in program space, along with extremely sparse rewards, makes NPI for KBQA ambitious and challenging. We present Complex Imperative Program Induction from Terminal Rewards (CIPITR), an advanced neural programmer that mitigates reward sparsity with auxiliary rewards, and restricts the program space to semantically correct programs using high-level constraints, KB schema, and inferred answer type. CIPITR solves complex KBQA considerably more accurately than key-value memory networks and neural symbolic machines (NSM). For moderately complex queries requiring 2- to 5-step programs, CIPITR scores at least 3× higher F1 than the competing systems. On one of the hardest class of programs (comparative reasoning) with 5–10 steps, CIPITR outperforms NSM by a factor of 89 and memory networks by 9 times. 1
url https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00262
work_keys_str_mv AT sahaamrita complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms
AT ansarighulamahmed complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms
AT laddhaabhishek complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms
AT sankaranarayanankarthik complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms
AT chakrabartisoumen complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms
_version_ 1724635117312278528