Synthetic training data for Text-to-SQL
Improve model performance with custom Text-to-SQL datasets
Generate synthetic text-to-SQL training datasets tailored to your database and unique data challenges with Gretel's compound AI system, Gretel Navigator.

Improve model performance with custom Text-to-SQL datasets
Introducing the #1 trending dataset on Hugging Face
The Text-to-SQL dataset — an open-source high-quality training dataset — designed using Gretel Navigator was the #1 trending dataset on Hugging Face, receiving 200+ likes and 1k+ downloads in one week.
Based on LLM-as-a-judge comparison of gretelai/synthetic_text_to_sql with the b-mc2/sql-create-context dataset, an extension of the popular Spider dataset, Gretel's dataset consistently scores higher on:
Compliance with SQL standards: +54.5%
SQL correctness: +34.5%
Adherence to instructions: +8.5%

Text-to-SQL stats
The largest and most diverse synthetic Text-to-SQL dataset available to-date
This dataset includes a comprehensive array of SQL tasks: data definition, retrieval, manipulation, analytics & reporting.