Alex Watson

2025: The Year Synthetic Data Goes Mainstream

How synthetic data is transforming enterprise AI in 2025 by addressing privacy, fine-tuning, and scaling challenges.
Read more...

How to Create Synthetic Data at High Quality for Fine-Tuning LLMs

Gretel Navigator’s synthetic data generation outperformed OpenAI's GPT-4 by 25.6%, surpassed Llama3-70b by 48.1%, and exceeded human expert-curated data by 73.6%.
Read more...

Gretel Unlocks PII Detection with Synthetic Financial Document Dataset

Gretel releases a new synthetic financial document dataset to empower AI developers in building customized and highly performant sensitive data detection systems.
Read more...

Teaching AI to Think: A New Approach with Synthetic Data and Reflection

Gretel's synthetic GSM8k dataset shows an 84% improvement for AI Reasoning tasks vs synthetic data generated without the Reflection technique.
Read more...

Privacy-preserving AI development with Azure & Gretel

Leveraging Gretel's privacy-preserving synthetic data generation platform to fine-tune Azure OpenAI Service models in the financial domain.
Read more...

Synthetic Data and the Data-centric Machine Learning Life Cycle

Gretel's synthetic data platform overcomes challenges across the data-centric machine learning life cycle to enable AI and ML solutions.
Read more...

Synthesizing Private Patient Data with Gretel: A Step-by-Step Guide

Create privacy-safe synthetic patient data with Gretel, ensuring compliance, secure sharing, and actionable insights for AI and machine learning in healthcare.
Read more...

GSM-Symbolic: Analyzing LLM Limitations in Mathematical Reasoning and Potential Solutions

What The Recent Paper on LLM Reasoning Got Right—And What It Missed.
Read more...

Addressing Concerns of Model Collapse from Synthetic Data in AI

How thoughtful, high-quality synthetic data generation, rather than 'indiscriminate' use, can prevent model collapse.
Read more...
Copyright © 2022 Gretel.ai

How to Generate Synthetic Data: Tools and Techniques to Create Interchangeable Datasets

Synthetic data is algorithmically generated data that mirrors the statistical properties of the dataset it’s based on. Learn how to make high-quality synthetic data.
Read more...