Windows · Offline · Single User · Free

Click. Install. Make music.

The full StemTube audio engine — stems, chords, beats, lyrics, mixer — packaged as a single 1.4 MB Windows installer. The engine downloads itself on first launch. After that, everything runs offline on your machine.

v1.0.0 · 1.4 MB installer · Windows 10/11 x64 · ~6 GB after first-launch download

StemTube Desktop Mixer Interface
1.4
MB installer
170
Chord Types
0
Cloud Dependency
MIT
License

What StemTube Desktop does

Local-first audio analysis. Upload your own files, run everything on your own machine. No account, no telemetry, no internet required after install.

AI Stem Separation

Split any song into vocals, bass, drums, guitar, piano and "other" using Demucs htdemucs_6s. NVIDIA GPU acceleration with CUDA, automatic CPU fallback.

Chord Detection

Hybrid BTC Transformer + madmom CRF pipeline detecting 170 chord types. Real-time chord display with guitar diagrams synchronized to playback.

Beat Tracking

madmom CNN + DBN downbeat detection. Detects beats and bar positions (1/2/3/4). Tap-sync to nudge the grid manually if needed.

Lyrics Transcription

faster-whisper with word-level timestamps and karaoke-style highlighting. Dedicated Focus mode for distraction-free practice.

Full Mixer

Per-stem volume, pan, mute and solo. Tempo adjustment with pitch correction (SoundTouch). Waveform with playhead scrubbing.

Local & Private

Single-user, no account, no telemetry. Audio files never leave your machine. After first-launch download, the app works fully offline.

From installer to mixer in three steps

Install (1.4 MB)

Download the installer and run it. On first launch, the app fetches its 2.7 GB engine (Python + PyTorch + CUDA + Demucs + madmom + Whisper) from GitHub. Once. After that, launches are instant.

Upload a track

Drop any MP3, WAV, FLAC, M4A or other audio file. Demucs separates stems, BTC + madmom detect chords and beats, Whisper transcribes lyrics. All in parallel, all local.

Mix and practice

Open the mixer: mute the guitar and play along with chord diagrams in real time, slow down the tempo without changing pitch, switch to karaoke focus to learn the lyrics.

See it in action

Native Windows app, full audio engine, designed for musicians.

Stem extraction interface
Stem Extraction — AI separation into 6 stems with per-stem download
Chord detection grid
Chord Detection — real-time chord grid with guitar fingering diagrams
Focus lyrics view
Lyrics Focus — word-by-word karaoke highlighting with tempo control
Main mixer
Main Mixer — per-stem controls, waveform, transport, and analysis pane
Need more than one user, or mobile access?

There's a self-hosted server edition

Same audio engine, plus: multi-user with per-account libraries, mobile PWA on iOS and Android, real-time jam sessions, shared global library, optional YouTube import. Runs on Linux, Windows or macOS — Python 3.12 setup script included.

  • Mobile PWA (iOS & Android)
  • Multi-user with admin panel
  • Jam sessions with synchronized playback
  • Shared global library across users

A side project that grew up

StemTube started as a vibe-coding experiment — what happens when you let AI-assisted development guide the process without a rigid plan? What was meant to be a weekend hack turned into a full music workstation.

The Server edition (self-hosted, multi-user, mobile PWA, jam sessions) is the original. This Desktop edition packages the same engine for people who just want to double-click an installer and have it work. No Python knowledge required, no network setup, no account.

After 25 years in IT, I wanted to give back to the open-source ecosystem that gave me so much throughout my career. StemTube is that contribution — imperfect, evolving, and open to everyone.

Everything is open source — the Tauri shell, the Python backend, the audio pipeline. No tracking, no account required, no cloud dependency. Your music, your machine.

Ready to try it?

Download the installer, double-click, wait for the engine to download on first launch, and start mixing.

Unsigned installer

The installer is not signed with a Windows code-signing certificate (paid yearly subscription). Windows SmartScreen will display a warning on first launch. Click "More info" then "Run anyway". The installer and the engine zip are both hosted publicly on GitHub Releases for full transparency.