Yev Meyer

Sample-to-Dataset: Generate Rich Datasets from Limited Samples Using Data Designer

Seed to succeed: use the sample-to-dataset workflow to create diverse, large-scale synthetic datasets tailored to your needs with nothing but a few samples.
Read more...

Introducing world's largest synthetic open-source Text-to-SQL dataset

Gretel releases largest open source Text-to-SQL dataset to accelerate AI model training
Read more...

The explosion of small language models (SLMs) and license confusion

Rapid SLM releases highlight the need for clarity on licenses + lineage, which are crucial for enterprises navigating open-weight models and synthetic data ownership
Read more...