TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text-To-SQL	BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)	DIN-SQL + GPT-4	Execution Accuracy % (Test)	55.90	# 8
Text-To-SQL	BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)	DIN-SQL + GPT-4	Execution Accuracy % (Dev)	50.72	# 8
Text-To-SQL	spider	DIN-SQL + GPT-4	Exact Match Accuracy (Test)	60	# 6
Text-To-SQL	spider	DIN-SQL + GPT-4	Execution Accuracy (Test)	85.3	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/din-sql-decomposed-in-context-learning-of-1/text-to-sql-on-spider)](https://paperswithcode.com/sota/text-to-sql-on-spider?p=din-sql-decomposed-in-context-learning-of-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/din-sql-decomposed-in-context-learning-of-1/text-to-sql-on-bird-big-bench-for-large-scale)](https://paperswithcode.com/sota/text-to-sql-on-bird-big-bench-for-large-scale?p=din-sql-decomposed-in-context-learning-of-1)`

DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction

NeurIPS 2023 · Mohammadreza Pourreza, Davood Rafiei ·

There is currently a significant gap between the performance of fine-tuned models and prompting approaches using Large Language Models (LLMs) on the challenging task of text-to-SQL, as evaluated on datasets such as Spider. To improve the performance of LLMs in the reasoning process, we study how decomposing the task into smaller sub-tasks can be effective. In particular, we show that breaking down the generation problem into sub-problems and feeding the solutions of those sub-problems into LLMs can be an effective approach for significantly improving their performance. Our experiments with three LLMs show that this approach consistently improves their simple few-shot performance by roughly 10%, pushing the accuracy of LLMs towards SOTA or surpassing it. On the holdout test set of Spider, the SOTA, in terms of execution accuracy, was 79.9 and the new SOTA at the time of this writing using our approach is 85.3. Our approach with in-context learning beats many heavily fine-tuned models by at least 5%. Additionally, when evaluated on the BIRD benchmark, our approach achieved an execution accuracy of 55.9%, setting a new SOTA on its holdout test set.

PDF Abstract NeurIPS 2023 PDF NeurIPS 2023 Abstract

Code

Add Remove Mark official

mohammadrezapourreza/few-shot-nl2sq… official

269

Tasks

Add Remove

In-Context Learning

Text-To-SQL

Datasets

Spider-Realistic BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)

Results from the Paper

Add Remove

Ranked #3 on Text-To-SQL on spider

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-To-SQL	BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)	DIN-SQL + GPT-4	Execution Accuracy % (Test)	55.90	# 8	Compare
Text-To-SQL		DIN-SQL + GPT-4	Execution Accuracy % (Dev)	50.72	# 8	Compare
Text-To-SQL	spider	DIN-SQL + GPT-4	Exact Match Accuracy (Test)	60	# 6	Compare
Text-To-SQL	spider	DIN-SQL + GPT-4	Execution Accuracy (Test)	85.3	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove