Neural Analog Restoration Models

Learn more about the technology powering audio restoration

MP3 Music Restoration - Apollo

Upscale low quality MP3 back to high quality. Trained on pairs of high quality music, and their degraded mp3 versions. Restores missing high frequencies and removes compression artifacts. Output sample rate matches the prepared input rate (44.1kHz or 48kHz). Recommended for: online source imports, songs downloaded from the internet, compressed sound, tracks missing >16Khz. Not effective to regenerate 16Khz+ audio? Try AudioSR instead.

Neural Remix - Stable Audio 3 - Stable Audio 3 Medium

ai_remix

Recreates the track with Stability AI's Stable Audio 3 Medium audio-to-audio editing model. This prompt-guided remix mode uses the source audio as a reference while the text prompt steers style, tone, and instrumentation. Recommended for: creative remixes, inpainting missing musical ideas, alternate takes, and low-end or texture repair that benefits from generative reconstruction. Try a short section first for predictable results.

Neural Remix - ACEStep 1.5 XL - ACE-Step 1.5 XL

ai_remix2x

Recreates the track with the ACE-Step XL 2026 music generation model in remix mode. This is similar to SUNO 'cover' mode. It will use similar sounds as the reference, but use new notes if you lower the reference strength. Recommended for: Bass tracks, inpainting missing notes in stems, low end mudiness. Interesting results when blended with the original audio.

AudioSR Upscaler - AudioSR

music

Regenerates high frequencies above the selected cutoff while keeping lower frequencies from the original audio. Recommended for: Low quality recordings, missing high frequencies, hissing noise around 10Khz (hihats, voice, suno hiss). Low thresholds (eg: 3Khz) will change audio the most, while high threshold will mostly preserve what's already here. Not recommended for: Bass tracks, muddy low end. Use Neural Remix instead.

UniverSR Upscaler - UniverSR

musicPRO6x

Upscale music, voice, and sound effects. UniverSR is a 2026 model developed by the University of Seoul which performs audio super-resolution directly in the complex STFT domain using flow matching. Very similar to AudioSR, but with more coherent high ends. Recommended for: Low quality recordings, missing high frequencies, hissing noise around 10Khz (hihats, voice, suno hiss). Low thresholds (eg: 4Khz) will change audio the most, while high threshold will preserve what's already here. Not recommended for: muddy bass or muddy low end. Use Neural Remix instead.

NovaSR Speech Upscaler - NovaSR

voice

Upscaling model trained on English speech. Best for restoring podcasts, voice-overs, or isolated vocal tracks.

FlashSR Upscaler - FlashSR

music

Fast single-step super-resolution model for bandwidth extension and detail recovery. Recommended for: Western music, single instrument upscaling, preparing 44.1Khz stems before Atmos Export. Not recommended for: Bass.

Neural Reconstruction - DACVAE

Leverage the DACVAE neural codec to regenerate all frequencies. Helps with removing out of distribution frequencies. Recommended for: weird hihats sounds in electronic music.

Remove Clipping - Declipping restoration

Fixes harsh crackling when volume is too high. Algorithms find optimal settings to remove all crackling, while still maintaining loudness.

Remove Room Echo - Dialogue isolation

voice

Remove short reverb from voice recordings recorded a room with a laptop microphone. Good for podcasts, voice-overs, and dialogue that have a subtle room echo. Less effective on music or singing.

Remove Noise - Denoising separator

Reduces hiss, hum, background noise, and optional source bleed while keeping the main vocals/instruments. Uses the same model family as stem splitting, but returns a single cleaned track only.

Remove Long Reverb - Dereverb separator

Removes long reverb tails, delay, and echo to make the sound drier. Great for singing, music, and live recording with hall echo. Not too good with subtle echo in clean recording. Uses the same model family as stem splitting, but returns a single dry track only.

Remove Crowd Noise - Crowd-noise separator

Removes audience noise from live recordings while preserving the performance. Uses the same model family as stem splitting, but returns a single cleaned track only.

Keep Only Center Mono - MDX23C CenterWide

Extracts the "phantom center", the content that should be mono in a track. Use this for: bass, kick drums, podcast voice. Removes phaser, chorus, or flanger from instrument stems. Good for mixing.

RE-USE Speech Enhancer - RE-USE

voicePRO6x

Improve clarity, upscale, remove reverb, remove noise and audio glitches in multilingual speech. Recommended for: getting dry vocals, clearer podcasts, noisy voice notes, etc. Not recommended for: full music mixes or instrumental upscaling. Use UniverSR, AudioSR, or a music restoration model instead.

Singing Upscaler - Apollo Voice

voice

Specialized voice upscaling. Restores missing high frequencies and removes compression artifacts in lower frequencies. Output sample rate is 44.1kHz for lower-rate inputs and 48kHz for 48kHz+ inputs.