Crowd Extraction / Removal MelBandRoformer (by @aufr33, @viperx) OpenVINO Models
This repo stores OpenVINO(TM) models in IR format that are used to perform Crowd Extraction & Removal.
The OpenVINO IRs (.xml, .bin files) stored here have been converted from the pytorch model checkpoints / configs from here: https://github.com/ZFTurbo/Music-Source-Separation-Training/releases/tag/v.1.0.4
The OpenVINO IRs are intended to be used with the set of OpenVINO-based AI plugins for Audacity(R), here: https://github.com/intel/openvino-plugins-ai-audacity
To better support a range of OpenVINO-supported devices, the MelBandRoformer model has been sliced / converted to 3 separate OpenVINO IRs:
- mel_band_pre.xml/.bin -> Pre-processing operations (such as STFT) which convert input audio waveforms to frequency domain.
- mel_band_fwd.xml / .bin -> The majority of the layers / ops in the original model.
- mel_band_post.xml / .bin -> Post-processing operations (such as iSTFT) which convert frequency domain outputs from
mel_band_fwdto output waveforms.
Intel’s Human Rights Disclaimer:
Intel is committed to respecting human rights and avoiding complicity in human rights abuses. See Intel's Global Human Rights Principles. Intel's products and software are intended only to be used in applications that do not cause or contribute to a violation of an internationally recognized human right.