Synthetic training data for Text-to-SQL
Improve model performance with custom Text-to-SQL datasets
Generate synthetic text-to-SQL training datasets tailored to your database and unique data challenges with Gretel's compound AI system, Gretel Navigator.
Text-to-SQL synthetic training dataset results
Introducing the #1 trending dataset on Hugging Face
The Text-to-SQL dataset — an open-source high-quality training dataset — designed using Gretel Navigator was the #1 trending dataset on Hugging Face, receiving 200+ likes and 1k+ downloads in one week.
Based on LLM-as-a-judge comparison of gretelai/synthetic_text_to_sql with the b-mc2/sql-create-context dataset, an extension of the popular Spider dataset, Gretel's dataset consistently scores higher on:
Compliance with SQL standards: +54.5%
SQL correctness: +34.5%
Adherence to instructions: +8.5%
Text-to-SQL stats
The largest and most diverse synthetic Text-to-SQL dataset available to-date
This dataset includes a comprehensive array of SQL tasks: data definition, retrieval, manipulation, analytics & reporting.
Contact us to start designing custom Text-to-SQL datasets.
Resources
Get Started
Ready to try Gretel?
Make your job easier instantly.
Get started in just a few clicks with a free account.
Get started in just a few clicks with a free account.
- Join the Synthetic Data Community
Join our Discord to connect with the Gretel team and engage with our community.
- Read our docs
Set up your environment and connect to our SDK.