Data science
Red Teaming Synthetic Data Models
How we implemented a practical attack on a synthetic data model to validate its ability to protect sensitive information under different parameter settings.
Read more...Machine Learning Accuracy Using Synthetic Data
Can synthetic data really be used in machine learning? We explore the utility of synthetic data created from popular datasets and tested on popular ML algorithms.
Read more...What Is Data Simulation?
Data simulation is the process of using large quantities of data to predict events and validate models. Get the full data simulation definition.
Read more...How to Generate Synthetic Data: Tools and Techniques to Create Interchangeable Datasets
Synthetic data is algorithmically generated data that mirrors the statistical properties of the dataset it’s based on. Learn how to make high-quality synthetic data.
Read more...Optimize the Llama-2 Model with Gretel’s Text SQS
How Gretel's data quality analysis tools for evaluating generated text can help you optimize the performance LLMs, like the Llama-2 model.
Read more...How to Safely Query Enterprise Data with Langchain Agents + SQL + OpenAI + Gretel
How combining agent-based methods, LLMs, and synthetic data enables natural language queries for databases and data warehouses, sans SQL.
Read more...Gretel GPT Sentiment Swap
Let’s fine tune and prompt a large language model to swap the sentiment of product reviews!
Read more...Comprehensive Data Cleaning for AI and ML
Learn to prepare tabular data for AI and ML with an end-to-end data cleaning workflow.
Read more...Generate time-series data with Gretel’s new DGAN model
Announcing the open beta release of our DGAN model type.
Read more...Community Insights: Overcoming Medical Class Imbalance with Synthetic Data
An interview with one of Gretel's users on why medical practitioners turn to synthetic data when overcoming challenges with clinical data.
Read more...