Synthetics
GSM-Symbolic: Analyzing LLM Limitations in Mathematical Reasoning and Potential Solutions
What The Recent Paper on LLM Reasoning Got Right—And What It Missed.
Read more...Machine Learning Accuracy Using Synthetic Data
Can synthetic data really be used in machine learning? We explore the utility of synthetic data created from popular datasets and tested on popular ML algorithms.
Read more...Introducing world's largest synthetic open-source Text-to-SQL dataset
Gretel releases largest open source Text-to-SQL dataset to accelerate AI model training
Read more...Transforms and Synthetics on Relational Databases
A walkthrough of our new multi-table transform and multi-table synthetics notebooks, which can be used independently or simultaneously.
Read more...Test Data Generation: Uses, Benefits, and Tips
Test data generation is the process of creating new data that replicates an original dataset. Here’s how developers and data engineers use it.
Read more...Generate Synthetic Databases with Gretel Relational
Introducing Gretel Relational, enabling organizations to generate high-quality synthetic databases while preserving cross-table relationships.
Read more...Fine-Tuning CodeLlama on Gretel's Synthetic Text-to-SQL Dataset using Amazon SageMaker JumpStart
Fine-tune CodeLlama with Gretel's Synthetic Text-to-SQL on BIRDBench, achieving a 36% relative improvement in EX and 38% in VES.
Read more...Teaching AI to Think: A New Approach with Synthetic Data and Reflection
Gretel's synthetic GSM8k dataset shows an 84% improvement for AI Reasoning tasks vs synthetic data generated without reflection.
Read more...What Is Data Simulation?
Data simulation is the process of using large quantities of data to predict events and validate models. Get the full data simulation definition.
Read more...Gretel announces partnership with Databricks to improve Enterprise AI performance
Gretel partners with Databricks to seamlessly integrate synthetic data workflows and improve model performance for Enterprise AI.
Read more...