Crowd Extraction / Removal MelBandRoformer (by @aufr33, @viperx) OpenVINO Models

This repo stores OpenVINO(TM) models in IR format that are used to perform Crowd Extraction & Removal.

The OpenVINO IRs (.xml, .bin files) stored here have been converted from the pytorch model checkpoints / configs from here: https://github.com/ZFTurbo/Music-Source-Separation-Training/releases/tag/v.1.0.4

The OpenVINO IRs are intended to be used with the set of OpenVINO-based AI plugins for Audacity(R), here: https://github.com/intel/openvino-plugins-ai-audacity

To better support a range of OpenVINO-supported devices, the MelBandRoformer model has been sliced / converted to 3 separate OpenVINO IRs:

mel_band_pre.xml/.bin -> Pre-processing operations (such as STFT) which convert input audio waveforms to frequency domain.
mel_band_fwd.xml / .bin -> The majority of the layers / ops in the original model.
mel_band_post.xml / .bin -> Post-processing operations (such as iSTFT) which convert frequency domain outputs from mel_band_fwd to output waveforms.

Intel’s Human Rights Disclaimer:

Intel is committed to respecting human rights and avoiding complicity in human rights abuses. See Intel's Global Human Rights Principles. Intel's products and software are intended only to be used in applications that do not cause or contribute to a violation of an internationally recognized human right.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support