| import gradio as gr | |
| with gr.Blocks() as app: | |
| gr.Markdown( | |
| """ | |
| Synthetic data is artificially generated information that mimics real-world data. It allows overcoming data limitations by expanding or enhancing datasets. | |
| Introducing the Synthetic Data Generator, a user-friendly application that takes a no-code approach to creating custom datasets with Large Language Models (LLMs). The best part: A simple step-by-step process, making dataset creation a non-technical breeze, allowing anyone to create datasets and models in minutes and without any code. | |
| The synthetic data generator takes your custom prompt and returns a dataset for your use case, using a synthetic data pipeline. In the background this is powered by [distilabel](https://distilabel.argilla.io/latest/) and the [free Hugging Face text-generation API](https://huggingface.co/docs/api-inference/en/index) but we don't need to worry about these complexities and we can focus on using the UI. | |
| - Read more in [our announcement blog post](https://huggingface.co/blog/synthetic-data-generator) | |
| - Find the library on [GitHub](https://github.com/argilla-io/synthetic-data-generator) | |
| """ | |
| ) | |