Back to all posts

Open Source

Auto-anonymize production datasets for development

In this post, we walk through building a data pipeline that will automatically transform datasets so they can be safely used in development environments.
Read more...
Source: enjoynz, via iStockPhoto

Innovating With FastText and Table Headers

Look at how FastText word embeddings can help to quickly understand new datasets, and build more consistent labels for your own data.
Read more...
Copyright 2021 Gretel.

Instrumenting Kubernetes in AWS with Terraform and FluentBit

In this blog, we will use Fluent Bit to collect logs from AWS EKS cluster applications.
Read more...

Synthetic Data Configuration Templates

Our new configuration templates will help you pick some of the right parameters needed to train your synthetic data models.
Read more...
Source: Kubkoo, via iStockPhoto

Reducing AI bias with Synthetic data

Generate artificial records to balance biased datasets and improve overall model accuracy.
Read more...
Credit: sylv1rob1 via ShutterStock

Create a Location Generator GAN

How to train a FastCUT GAN on public location data from a few cities to predict realistic e-bike locations across the world.
Read more...

README.V2

We founded Gretel based on our beliefs that data shouldn’t be scary.
Read more...

Introducing Gretel Blueprints

We are launching Gretel Blueprints, making it easy to anonymize and balance datasets with just a few clicks.
Read more...
Copyright © 2022 Gretel.ai

Create Synthetic Time-series Data with DoppelGANger and PyTorch

Generate synthetic time series data with Gretel.ai’s open-source PyTorch implementation of DoppelGANger.
Read more...