Skip to content

Getting Started

Welcome to MLX Audio! This section will help you install the library and run your first text-to-speech generation or speech transcription in minutes.

Requirements

  • Python 3.10+
  • Apple Silicon Mac (M1 / M2 / M3 / M4)
  • ffmpeg (required for MP3, FLAC, OGG, Opus, or Vorbis encoding; WAV works without it)

Where to Go Next

  • Installation


    Install mlx-audio with pip or uv, including optional extras for TTS, STT, and development.

    Installation

  • Quick Start: CLI


    Generate speech and transcribe audio from the command line.

    CLI Quick Start

  • Quick Start: Python


    Use the Python API to generate, transcribe, and process audio programmatically.

    Python Quick Start