Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs

Recent years have seen increasingly complex question-answering on knowledge bases (KBQA) involving logical, quantitative, and comparative reasoning over KB subgraphs. Neural Program Induction (NPI) is a pragmatic approach toward modularizing the reasoning process by translating...

Full description

Bibliographic Details
Main Authors:	Saha, Amrita, Ansari, Ghulam Ahmed, Laddha, Abhishek, Sankaranarayanan, Karthik, Chakrabarti, Soumen
Format:	Article
Language:	English
Published:	The MIT Press 2019-11-01
Series:	Transactions of the Association for Computational Linguistics
Online Access:	https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00262

id	doaj-3a9591e65064424ea1ed096376db075d
record_format	Article
spelling	doaj-3a9591e65064424ea1ed096376db075d2020-11-25T03:16:37ZengThe MIT PressTransactions of the Association for Computational Linguistics2307-387X2019-11-01718520010.1162/tacl_a_00262Complex Program Induction for Querying Knowledge Bases in the Absence of Gold ProgramsSaha, AmritaAnsari, Ghulam AhmedLaddha, AbhishekSankaranarayanan, KarthikChakrabarti, Soumen Recent years have seen increasingly complex question-answering on knowledge bases (KBQA) involving logical, quantitative, and comparative reasoning over KB subgraphs. Neural Program Induction (NPI) is a pragmatic approach toward modularizing the reasoning process by translating a complex natural language query into a multi-step executable program. While NPI has been commonly trained with the ‘‘gold’’ program or its sketch, for realistic KBQA applications such gold programs are expensive to obtain. There, practically only natural language queries and the corresponding answers can be provided for training. The resulting combinatorial explosion in program space, along with extremely sparse rewards, makes NPI for KBQA ambitious and challenging. We present Complex Imperative Program Induction from Terminal Rewards (CIPITR), an advanced neural programmer that mitigates reward sparsity with auxiliary rewards, and restricts the program space to semantically correct programs using high-level constraints, KB schema, and inferred answer type. CIPITR solves complex KBQA considerably more accurately than key-value memory networks and neural symbolic machines (NSM). For moderately complex queries requiring 2- to 5-step programs, CIPITR scores at least 3× higher F1 than the competing systems. On one of the hardest class of programs (comparative reasoning) with 5–10 steps, CIPITR outperforms NSM by a factor of 89 and memory networks by 9 times. 1https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00262
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Saha, Amrita Ansari, Ghulam Ahmed Laddha, Abhishek Sankaranarayanan, Karthik Chakrabarti, Soumen
spellingShingle	Saha, Amrita Ansari, Ghulam Ahmed Laddha, Abhishek Sankaranarayanan, Karthik Chakrabarti, Soumen Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs Transactions of the Association for Computational Linguistics
author_facet	Saha, Amrita Ansari, Ghulam Ahmed Laddha, Abhishek Sankaranarayanan, Karthik Chakrabarti, Soumen
author_sort	Saha, Amrita
title	Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_short	Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_full	Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_fullStr	Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_full_unstemmed	Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs
title_sort	complex program induction for querying knowledge bases in the absence of gold programs
publisher	The MIT Press
series	Transactions of the Association for Computational Linguistics
issn	2307-387X
publishDate	2019-11-01
description	Recent years have seen increasingly complex question-answering on knowledge bases (KBQA) involving logical, quantitative, and comparative reasoning over KB subgraphs. Neural Program Induction (NPI) is a pragmatic approach toward modularizing the reasoning process by translating a complex natural language query into a multi-step executable program. While NPI has been commonly trained with the ‘‘gold’’ program or its sketch, for realistic KBQA applications such gold programs are expensive to obtain. There, practically only natural language queries and the corresponding answers can be provided for training. The resulting combinatorial explosion in program space, along with extremely sparse rewards, makes NPI for KBQA ambitious and challenging. We present Complex Imperative Program Induction from Terminal Rewards (CIPITR), an advanced neural programmer that mitigates reward sparsity with auxiliary rewards, and restricts the program space to semantically correct programs using high-level constraints, KB schema, and inferred answer type. CIPITR solves complex KBQA considerably more accurately than key-value memory networks and neural symbolic machines (NSM). For moderately complex queries requiring 2- to 5-step programs, CIPITR scores at least 3× higher F1 than the competing systems. On one of the hardest class of programs (comparative reasoning) with 5–10 steps, CIPITR outperforms NSM by a factor of 89 and memory networks by 9 times. 1
url	https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00262
work_keys_str_mv	AT sahaamrita complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms AT ansarighulamahmed complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms AT laddhaabhishek complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms AT sankaranarayanankarthik complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms AT chakrabartisoumen complexprograminductionforqueryingknowledgebasesintheabsenceofgoldprograms
_version_	1724635117312278528

Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs

Similar Items