Contents Menu Expand Light mode Dark mode Auto light/dark mode
Light Logo Dark Logo
Logo
v0.46.0
Star

Get Started

  • πŸŽ‰ Installation
  • πŸš€ Quick Tour
    • Synthetic Data Generation
    • Motivation and Design
    • Training an β€œAbstract to Tweet Model” with Fully Synthetic Data
    • Generating Training Data with Attributed Prompts
    • Distilling GPT-4 Capabilities to GPT-3.5
    • Augmenting an Existing Dataset
    • Cleaning an Existing Dataset
    • Bootstrapping Synthetic Few-Shot Examples
    • Instruction-Tuning and Aligning Models
    • Instruction-Tuning a LLM
    • Aligning a LLM with Human Preferences
    • Training a Self-Improving LLM with Self-Rewarding
  • πŸ’‘ Motivation and Design
  • πŸ“– Overview Guide
  • πŸŽ“ Advanced Usage
    • Caching and Saved Outputs
    • Creating a New DataDreamer ...
      • Step
      • LLM
      • Trainer
      • Other
    • Parallelization
      • Running Steps in Parallel
      • Running Models on Multiple GPUs
      • Training Models on Multiple GPUs
    • Quantization
    • Parameter-Efficient Training

References

  • API Reference
    • datasets
    • embedders
    • errors
    • llms
    • retrievers
    • steps
    • task_models
    • trainers
  • Index

About

  • GitHub
  • PyPI
  • License
  • Citation
  • Contact
  • Contributing
Back to top
Copyright Β© 2025, Ajay Patel (ajayp@upenn.edu)
Made with Furo
πŸ‘¨πŸ½β€πŸ’»