Introducing Gretel Benchmark

Benchmark is your toolkit to evaluate any synthetic data algorithm on any production dataset

Published by

No items found.

•

Updated

October 6, 2022

What is Gretel Benchmark?

Today we’re announcing the release of Gretel Benchmark, a Python library for you to compare any model that generates synthetic data, with a set of standardized tests to evaluate those algorithms for synthetic data quality, runtime, and other machine learning use cases.

Getting started

Want to jump right in? You can use Gretel Benchmark by installing Gretel-trainer.

Get started with the quickstart Benchmark notebook and Benchmark documentation.

Keep reading to learn about important features and the detailed Benchmark evaluation report.

Models

Your custom models

It’s easy to define custom models, so you can use any algorithm, not just Gretel models, for synthetic data generation to compare in Benchmark.

To provide your own model implementation, define a Python class that meets this interface:

class MyCustomModel:
    def train(self, source: str, **kwargs) -> None:
        # your training code here
    def generate(self, **kwargs) -> pd.DataFrame:
        # your generation code here

Learn more here about creating your custom model. Make sure to install any third-party libraries you use as dependencies wherever you are running Benchmark.

Gretel models

We’ve also made it easy for you to use Gretel models in Benchmark. Here’s a nifty summary of all the available Gretel models with default configurations:

GretelAuto

This model will automatically pick the best solution between Gretel LSTM and Gretel CTGAN for your dataset (see more below on the two models). This can be helpful if you want the Gretel engine to select the best model based on the characteristics of your dataset.

‍Gretel LSTM

This model works for a variety of synthetic data tasks and works with time-series, tabular, and text data. Gretel LSTM is generally most useful for datasets with a few thousand records and upward. Datasets can include a mix of categorical, continuous, and numerical values.

Gretel ACTGAN

This model works well for high-dimensional, largely numeric data. You can use Gretel CTGAN for datasets with more than 20 columns and/or 50,000 rows.
Data requirements: Not ideal if the dataset contains free text fields.

Gretel GPT

This model is useful for natural language or plain text datasets such as reviews, tweets, and conversations.
Data requirements: Dataset must be single-column.

Gretel Amplify

This model is great for generating lots of data quickly.
Note: Gretel Amplify is not a neural network model but instead uses statistical means to generate large amounts of data from an input dataset. The Synthetic Data Quality Score (SQS) for data generated using Gretel Amplify may be lower.

You can also easily modify the Gretel model configs with:

class CustomizedLSTM(GretelModel):
    config = {...} # define configuration here

Find out more about how to use Gretel model classes in the Benchmark documentation.

Data

Benchmark allows you to compare the synthetic data quality and runtime of multiple models (whether custom or Gretel models) on multiple datasets.

To use your own data in Benchmark, you can follow the instructions for `make_dataset` in the docs or check out the Benchmark notebook.

If you need test data, we also provide a list of publicly available datasets that are popular for synthetic data use cases like those in finance, e-commerce, healthcare, and more. You can view and select datasets in Benchmark using these functions:

list_gretel_datasets(datatype: Optional[Union[Datatype, str]] = None, tags: Optional[List[str]] = None) -> List[Dataset]
"""Returns a list of Gretel-curated datasets matching the specified datatype and tags. Uses “and” semantics—i.e. only returns datasets that match all supplied values.
`datatype` (optional): Datatype to filter on
`tags` (optional): Tags to filter on. Various tags are applied to Gretel-curated datasets, see below"""

get_gretel_dataset(name: str) -> Dataset
"""Fetches a Gretel-curated dataset from Gretel’s S3 bucket
`name` (required): The name of the dataset.
This function will raise an exception if no dataset exists with the supplied name"""

Evaluations

We created a set of standard tests that make up the Gretel Benchmark evaluations on algorithms for synthetic data generation. The Benchmark report shows:

Data type
Data shape
Synthetic Data Quality Score (SQS): an evaluation, developed by Gretel, of the quality of synthetic data. Learn more about SQS on our SQS FAQ.
Train time
Generate time
Total runtime

The Benchmark report

Want to evaluate Gretel models on your industry use case? For a quick and easy look into how different Gretel models perform on popular machine learning datasets, check out our Benchmark report below. When you run Benchmark, you’ll also see an evaluation report like this one.

You can use a Benchmark report like the one shown here to evaluate which Gretel model is best for your synthetic data goals. For example, Gretel LSTM consistently generates synthetic data with a high Synthetic Data Quality Score (SQS) on multiple types of tabular or complex data. As seen in the results below, Gretel CTGAN is great for particularly long or wide datasets and generally has a faster runtime. If you’re looking to quickly generate lots of data, Gretel Amplify produces results in 1/10 of the time (check out the fast train and generate times!). Gretel GPT-X generates high-quality synthetic data for natural language datasets. Depending on your specific goals with synthetic data or constraints, you may find particular Gretel models to be best suited for your use case. You can reference the Benchmark report below to guide how you evaluate Gretel models, or of course, try Benchmark yourself!

Industry	Input data	Model	DataType	Rows	Cols	SQS	Train time (sec)	Generate time (sec)	Total time (sec)
Ads, Finance, Marketing	bank_marketing_large/data.csv	GretelAmplify	tabular_mixed	41188	21	73	36.07	29.75	65.82
	bank_marketing_large/data.csv	GretelCTGAN	tabular_mixed	41188	21	85	1300.75	33.24	1333.99
	bank_marketing_large/data.csv	GretelLSTM	tabular_mixed	41188	21	84	317.79	401.04	718.83
	bank_marketing_small/data.csv	GretelAmplify	tabular_mixed	4521	17	80	24.41	23.81	48.22
	bank_marketing_small/data.csv	GretelCTGAN	tabular_mixed	4521	17	84	169.32	175.63	344.95
	bank_marketing_small/data.csv	GretelLSTM	tabular_mixed	4521	17	84	326.26	96.73	422.99
	dow_jones_index/data.csv	GretelAmplify	time_series	750	16	76	81.5	23.32	104.82
	dow_jones_index/data.csv	GretelCTGAN	time_series	750	16	70	221.58	129.15	350.73
	dow_jones_index/data.csv	GretelLSTM	time_series	750	16	83	424.2	64.66	488.86
	banking77/data.csv	GretelAmplify	natural_language	10016	1	100	84.42	58.36	142.78
	banking77/data.csv	GretelLSTM	natural_language	10016	1	100	318.19	96.17	414.36
	banking77/data.csv	GretelGPTX	natural_language	10016	1	100	487.56	3675	487.56
E-commerce	bike_sales/data.csv	GretelAmplify	tabular_numeric	16519	24	79	119.94	30	149.94
	bike_sales/data.csv	GretelLSTM	tabular_numeric	16519	24	88	911.59	249.68	1161.27
	car_evaluation/data.csv	GretelAmplify	tabular_numeric	1728	7	85	24.06	23.7	47.76
	car_evaluation/data.csv	GretelCTGAN	tabular_numeric	1728	7	77	201.19	44.5	245.69
	car_evaluation/data.csv	GretelLSTM	tabular_numeric	1728	7	87	357.66	54.02	411.68
	credit_card_payments/data.csv	GretelAmplify	tabular_mixed	30000	25	74	107.13	29.97	137.1
	credit_card_payments/data.csv	GretelCTGAN	tabular_mixed	30000	25	83	1229	33.4	1262.4
	credit_card_payments/data.csv	GretelLSTM	tabular_mixed	30000	25	81	1468.11	579.92	2048.03
	olist_order_payments/data.csv	GretelAmplify	tabular_numeric	103886	5	69	529.06	40.41	569.47
	olist_order_payments/data.csv	GretelLSTM	tabular_numeric	103886	5	93	4201.89	897.22	5099.11
Employment	data_science_job_candidates/data.csv	GretelAmplify	tabular_mixed	19158	14	88	107.66	23.26	130.92
	data_science_job_candidates/data.csv	GretelCTGAN	tabular_mixed	19158	14	90	609.02	128.11	737.13
	data_science_job_candidates/data.csv	GretelLSTM	tabular_mixed	19158	14	93	358.21	276.29	634.5
	ibm_employee_attrition/data.csv	GretelAmplify	tabular_mixed	1470	37	88	24.09	20.5	44.59
	ibm_employee_attrition/data.csv	GretelCTGAN	tabular_mixed	1470	37	80	368.13	33.93	402.06
	ibm_employee_attrition/data.csv	GretelLSTM	tabular_mixed	1470	37	93	365.18	127.79	492.97
Energy, Telecom	energydata_complete/data.csv	GretelAmplify	time_series	19735	29	74	103.68	33.09	136.77
	energydata_complete/data.csv	GretelLSTM	time_series	19735	29	93	1531.88	400.29	1932.17
	telco_customer_churn/data.csv	GretelAmplify	tabular_mixed	7043	33	82	40.84	30.17	71.01
	telco_customer_churn/data.csv	GretelCTGAN	tabular_mixed	7043	33	79	5911.34	55.36	5966.7
	telco_customer_churn/data.csv	GretelLSTM	tabular_mixed	7043	33	76	787.7	155.55	943.25
Environment, Food	air_quality_uci/data.csv	GretelAmplify	time_series	9357	15	65	96.15	52.47	148.62
	air_quality_uci/data.csv	GretelCTGAN	time_series	9357	15	62	6656.12	54.49	6710.61
	air_quality_uci/data.csv	GretelLSTM	time_series	9357	15	89	398.42	211.66	610.08
	winequality_red/data.csv	GretelAmplify	tabular_numeric	1599	12	82	81.94	23.68	105.62
	winequality_red/data.csv	GretelCTGAN	tabular_numeric	1599	12	61	76.14	43.87	120.01
	winequality_red/data.csv	GretelLSTM	tabular_numeric	1599	12	89	221.92	54.69	276.61
	winequality_white/data.csv	GretelAmplify	tabular_numeric	4898	12	88	24.65	23.27	47.92
	winequality_white/data.csv	GretelCTGAN	tabular_numeric	4898	12	81	139.03	33.15	172.18
	winequality_white/data.csv	GretelLSTM	tabular_numeric	4898	12	91	287.84	76.4	364.24
Government	portuguese_election_data/data.csv	GretelAmplify	tabular_numeric	21643	28	52	31.33	107.04	138.37
	portuguese_election_data/data.csv	GretelCTGAN	tabular_numeric	21643	28	72	928.15	128.73	1056.88
	portuguese_election_data/data.csv	GretelLSTM	tabular_numeric	21643	28	81	455.56	327.19	782.75
	adult/data.csv	GretelAmplify	tabular_mixed	32561	15	85	213.54	58.46	272
	adult/data.csv	GretelCTGAN	tabular_mixed	32561	15	87	965.31	128.03	1093.34
	adult/data.csv	GretelLSTM	tabular_mixed	32561	15	94	667.21	615.08	1282.29
Healthcare	processed_cleveland_heart_disease_uci/data.csv	GretelAmplify	tabular_numeric	303	14	83	35.87	23.01	58.88
	processed_cleveland_heart_disease_uci/data.csv	GretelCTGAN	tabular_numeric	303	14	70	66.97	33.48	100.45
	processed_cleveland_heart_disease_uci/data.csv	GretelLSTM	tabular_numeric	303	14	91	221.56	54.26	275.82
	breast_cancer_wisconsin/data.csv	GretelAmplify	tabular_numeric	699	11	55	23.89	23.55	47.44
	breast_cancer_wisconsin/data.csv	GretelCTGAN	tabular_numeric	699	11	56	67.4	203.11	270.51
	breast_cancer_wisconsin/data.csv	GretelLSTM	tabular_numeric	699	11	83	206.88	64.73	271.61

Learn more

You can find out more in the Benchmark documentation. Questions or comments? We’re always available in our Discord community - send us a note! Happy synthesizing!

‍

Get started with Gretel

Learn how our platform can streamline your data privacy and compliance.

Want to learn more?

Discover the benefits of synthetic data.

Request a demo

Generate synthetic data at scale

Synthesize millions of rows with high accuracy using Gretel ACTGAN.

Scale your data