Synthetic training data for Text-to-SQL

Improve model performance with custom Text-to-SQL datasets

Generate synthetic text-to-SQL training datasets tailored to your database and unique data challenges with Gretel's compound AI system, Gretel Navigator.
Text-to-SQL synthetic training dataset results

Introducing the #1 trending dataset on Hugging Face

The Text-to-SQL dataset — an open-source high-quality training dataset — designed using Gretel Navigator was the #1 trending dataset on Hugging Face, receiving 200+ likes and 1k+ downloads in one week.
Based on LLM-as-a-judge comparison of gretelai/synthetic_text_to_sql with the b-mc2/sql-create-context dataset, an extension of the popular Spider dataset, Gretel's dataset consistently scores higher on:

Compliance with SQL standards: +54.5%

SQL correctness: +34.5%

Adherence to instructions: +8.5%

Gretel's #1 trending dataset
Text-to-SQL stats

The largest and most diverse synthetic Text-to-SQL dataset available to-date

This dataset includes a comprehensive array of SQL tasks: data definition, retrieval, manipulation, analytics & reporting.
Text-to-SQL stats

Contact us to start designing custom Text-to-SQL datasets.

Get Started

Ready to try Gretel?

Make your job easier instantly.
Get started in just a few clicks with a free account.