From Natural Language Processing to Neural Databases

Thorne, James; Yazdani, Majid; Saeidi, Marzieh; Silvestri, Fabrizio; Riedel, Sebastian; Halevy, Alon

doi:10.14778/3447689.3447706

In recent years, neural networks have shown impressive performance gains on long-standing AI problems, such as answering queries from text and machine translation. These advances raise the question of whether neural nets can be used at the core of query processing to derive answers from facts, even when the facts are expressed in natural language. If so, it is conceivable that we could relax the fundamental assumption of database management, namely, that our data is represented as fields of a pre-defined schema. Furthermore, such technology would enable combining information from text, images, and structured data seamlessly. This paper introduces neural databases, a class of systems that use NLP transformers as localized answer derivation engines. We ground the vision in NeuralDB, a system for querying facts represented as short natural language sentences. We demonstrate that recent natural language processing models, specifically transformers, can answer select-project-join queries if they are given a set of relevant facts. However, they cannot scale to non-trivial databases nor answer set-based and aggregation queries. Based on these insights, we identify specific research challenges that are needed to build neural databases. Some of the challenges require drawing upon the rich literature in data management, and others pose new research opportunities to the NLP community. Finally, we show that with preliminary solutions, NeuralDB can already answer queries over thousands of sentences with very high accuracy

From Natural Language Processing to Neural Databases / Thorne, James; Yazdani, Majid; Saeidi, Marzieh; Silvestri, Fabrizio; Riedel, Sebastian; Halevy, Alon. - In: PROCEEDINGS OF THE VLDB ENDOWMENT. - ISSN 2150-8097. - 14:6(2021), pp. 1033-1039. [10.14778/3447689.3447706]

From Natural Language Processing to Neural Databases

James Thorne;Majid Yazdani;Marzieh Saeidi;Fabrizio Silvestri^{Writing – Original Draft Preparation};Sebastian Riedel;Alon Halevy

2021

Abstract

In recent years, neural networks have shown impressive performance gains on long-standing AI problems, such as answering queries from text and machine translation. These advances raise the question of whether neural nets can be used at the core of query processing to derive answers from facts, even when the facts are expressed in natural language. If so, it is conceivable that we could relax the fundamental assumption of database management, namely, that our data is represented as fields of a pre-defined schema. Furthermore, such technology would enable combining information from text, images, and structured data seamlessly. This paper introduces neural databases, a class of systems that use NLP transformers as localized answer derivation engines. We ground the vision in NeuralDB, a system for querying facts represented as short natural language sentences. We demonstrate that recent natural language processing models, specifically transformers, can answer select-project-join queries if they are given a set of relevant facts. However, they cannot scale to non-trivial databases nor answer set-based and aggregation queries. Based on these insights, we identify specific research challenges that are needed to build neural databases. Some of the challenges require drawing upon the rich literature in data management, and others pose new research opportunities to the NLP community. Finally, we show that with preliminary solutions, NeuralDB can already answer queries over thousands of sentences with very high accuracy

Scheda breve

Scheda completa

	Anno di pubblicazione
	
			2021
		
	Parole chiave
	
			nlp; neural databases; natural language inference;
		
	Tipologia
	
			01 Pubblicazione su rivista::01a Articolo in rivista
		
	Citazione
	
			From Natural Language Processing to Neural Databases / Thorne, James; Yazdani, Majid; Saeidi, Marzieh; Silvestri, Fabrizio; Riedel, Sebastian; Halevy, Alon. - In: PROCEEDINGS OF THE VLDB ENDOWMENT. - ISSN 2150-8097. - 14:6(2021), pp. 1033-1039. [10.14778/3447689.3447706]
		
	Appartiene alla tipologia:
	
			01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Thorne_From-natural_2021.pdf accesso aperto Note: DOI 10.14778/3447689.3447706 Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 447.86 kB Formato Adobe PDF	447.86 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1507612

Citazioni

ND

27

12

Catalogo dei prodotti della ricerca