Back to all posts

Open Source

GLiNER Models for PII Detection through Fine-Tuning on Gretel-Generated Synthetic Documents

Gretel fine-tuned, synthetically-enhanced GLiNER models for better PII & PHI detection—datasets included.
Read more...

Introducing world's largest synthetic open-source Text-to-SQL dataset

Gretel releases largest open source Text-to-SQL dataset to accelerate AI model training
Read more...

Gretel Unlocks PII Detection with Synthetic Financial Document Dataset

Gretel releases a new synthetic financial document dataset to empower AI developers in building customized and highly performant sensitive data detection systems.
Read more...

An Awesome Synthetic Multilingual Prompts Dataset

Gretel's latest open synthetic dataset aims to enhance LLM interactions and contributes to the popular 'awesome-chatGPT-prompts' GitHub repository.
Read more...

Fine-Tuning CodeLlama on Gretel's Synthetic Text-to-SQL Dataset using Amazon SageMaker JumpStart

Fine-tune CodeLlama with Gretel's Synthetic Text-to-SQL on BIRDBench, achieving a 36% relative improvement in EX and 38% in VES.
Read more...

The explosion of small language models (SLMs) and license confusion

Rapid SLM releases highlight the need for clarity on licenses + lineage, which are crucial for enterprises navigating open-weight models and synthetic data ownership
Read more...

Addressing Concerns of Model Collapse from Synthetic Data in AI

How thoughtful, high-quality synthetic data generation, rather than 'indiscriminate' use, can prevent model collapse.
Read more...
Copyright © 2022 Gretel.ai

Conditional Text Generation by Fine Tuning Gretel GPT

Augment machine learning datasets with synthetically generated text and labels using an open-source implementation of GPT-3.
Read more...

How to use Weights & Biases with Gretel.ai

How to use Weights & Biases’ ML hyperparameter sweeps tool to optimize the accuracy of your synthetic data.
Read more...
Copyright © 2022 Gretel.ai

Measure the Quality of any Synthetic Dataset with Gretel Evaluate

Assessing the efficacy and quality of synthetic data with Gretel Evaluate API.
Read more...