Neural Analog Restoration Models
Learn more about the technology powering audio restoration
MP3 Music Restoration - Apollo
musicUpscale low quality MP3 back to high quality. Trained on pairs of high quality music, and their degraded mp3 versions. Restores missing high frequencies and removes compression artifacts. Output sample rate matches the prepared input rate (44.1kHz or 48kHz). Recommended for: online source imports, songs downloaded from the internet, compressed sound, tracks missing >16Khz. Not effective to regenerate 16Khz+ audio? Try AudioSR instead.
Neural Remix - Stable Audio 3 - Stable Audio 3 Medium
Recreates the track with Stability AI's Stable Audio 3 Medium audio-to-audio editing model. This prompt-guided remix mode uses the source audio as a reference while the text prompt steers style, tone, and instrumentation. Recommended for: creative remixes, inpainting missing musical ideas, alternate takes, and low-end or texture repair that benefits from generative reconstruction. Try a short section first for predictable results.
Neural Remix - ACEStep 1.5 XL - ACE-Step 1.5 XL
2xRecreates the track with the ACE-Step XL 2026 music generation model in remix mode. This is similar to SUNO 'cover' mode. It will use similar sounds as the reference, but use new notes if you lower the reference strength. Recommended for: Bass tracks, inpainting missing notes in stems, low end mudiness. Interesting results when blended with the original audio.
AudioSR Upscaler - AudioSR
musicRegenerates high frequencies above the selected cutoff while keeping lower frequencies from the original audio. Output sample rate: 48kHz. Recommended for: Low quality recordings, missing high frequencies, hissing noise around 12-13Khz (hihats, sibilance, brilliance). Low thresholds (eg: 3Khz) will change audio the most, while high threshold will mostly preserve what's already here. Not recommended for: Bass tracks, muddy low end. Use Neural Remix instead.
UniverSR Upscaler - UniverSR
musicPRO6xUpscale music, voice, and sound effects to 48 kHz. UniverSR is a 2026 model developed by the University of Seoul which performs audio super-resolution directly in the complex STFT domain using flow matching. Very similar to AudioSR, but with more coherent high ends. Output sample rate: 48kHz. Recommended for: Low quality recordings, missing high frequencies, hissing noise around 12-13Khz (hihats, sibilance, brilliance). Low thresholds (eg: 4Khz) will change audio the most, while high threshold will preserve what's already here. Not recommended for: muddy bass or muddy low end. Use Neural Remix instead.
Singing Upscaler - Apollo Voice
voiceSpecialized voice upscaling. Restores missing high frequencies and removes compression artifacts in lower frequencies. Output sample rate: 48kHz.
NovaSR Speech Upscaler - NovaSR
voiceUpscaling model trained on English speech. Best for restoring podcasts, voice-overs, or isolated vocal tracks. Output sample rate: 48kHz.
FlashSR Upscaler - FlashSR
musicFast single-step super-resolution model for bandwidth extension and detail recovery. Output sample rate: 48kHz. Recommended for: Western music, single instrument upscaling, upscaling 44.1Khz stems to 48 kHz (before Atmos Export). Not recommended for: Bass.
Neural Reconstruction - DACVAE
Leverage the DACVAE neural codec to regenerate all frequencies. Helps with removing out of distribution frequencies. Output sample rate: 48kHz. Recommended for: weird hihats sounds in electronic music.
Remove Clipping - Declipping restoration
Fixes harsh crackling when volume is too high. Algorithms find optimal settings to remove all crackling, while still maintaining loudness.
Remove Room Echo - Dialogue isolation
voiceRemove short reverb from voice recordings recorded a room with a laptop microphone. Good for podcasts, voice-overs, and dialogue that have a subtle room echo. Less effective on music or singing.
Remove Noise - Denoising separator
Reduces hiss, hum, and background noise while keeping the main vocals/instruments. Uses the same model family as stem splitting, but returns a single cleaned track only.
Denoise and debleed - Denoise/debleed separator
Reduces background noise and source bleed while keeping the main instrumental content. Uses the same model family as stem splitting, but returns a single cleaned track only.
Remove Long Reverb - Dereverb separator
Removes long reverb tails, delay, and echo to make the sound drier. Great for singing, music, and live recording with hall echo. Not too good with subtle echo in clean recording. Uses the same model family as stem splitting, but returns a single dry track only.
Remove Crowd Noise - Crowd-noise separator
Removes audience noise from live recordings while preserving the performance. Uses the same model family as stem splitting, but returns a single cleaned track only.
LavaSR Speech Upscaler - LavaSR v2
voiceFast and high-quality speech upscaling. Evolution of NovaSR that uses Vocos architecture for efficiency. Output sample rate: 48kHz.
AERO Upscaler - AERO
musicSpectral super-resolution model with selectable voice and music variants. Output sample rate depends on selected variant: Music (MUSDB)=44.1kHz, Voice 4-16=16kHz, Voice 8-16=16kHz, Voice 8-24=24kHz, Voice 12-48=48kHz. Not recommended for: Bass.