Skip to content

Guides

Practical how-to guides for common mlx-audio workflows.

  • Streaming Audio


    Stream TTS output and STT transcription for low-latency applications.

    Streaming Guide

  • Voice Cloning


    Clone voices using reference audio with CSM, Qwen3-TTS, and other models.

    Voice Cloning Guide

  • :material-compress:{ .lg .middle } Quantization


    Reduce model size and speed up inference with 3-bit to 8-bit quantization.

    Quantization Guide

  • Web UI & API Server


    Run the OpenAI-compatible REST API and interactive web interface.

    Web UI & API Guide