What's new
LiteRECORDS

Register a free account today to become a member! Once signed in, you'll be able to participate on this site by adding your own topics and posts, as well as connect with other members through your own private inbox!

  • Guest, before your account can be reviewed you must click the activation link sent to your email account. Please ensure you check your junk folders.
    If you do not see the link after 24 hours please open a support ticket.

Powerful New Vocal Remover AI - Instructions

[MENTION=10167]NewAgeRipper[/MENTION]

Regarding the GoogleColab, I've been pretty hands off of it. [MENTION=26971]djtayz[/MENTION] - Although I've confirmed the GUI will never work with GoogleColab, are there any code changes you'd like me to implement that might make the process easier? Let me know and edit the code accordingly.

I've just been sticking with Google Colab until a final is released or the conversion process is updated in Google Colab. [MENTION=26971]djtayz[/MENTION] may leave instructions for a "fileless method in Google Colab. Waiting for that one as well.
 
I've just been sticking with Google Colab until a final is released or the conversion process is updated in Google Colab. [MENTION=26971]djtayz[/MENTION] may leave instructions for a "fileless method in Google Colab. Waiting for that one as well.

I got it working to the point until you process a track. Working with this thing is tedious, I will try to make it happen. But I don't want to give promises, since I'm not really a coder.
 
I got it working to the point until you process a track. Working with this thing is tedious, I will try to make it happen. But I don't want to give promises, since I'm not really a coder.
No biggie. It was just the thought of it which would be awesome if it works.
 
***UPDATE***

Here's the beta release for the Ultimate Vocal Remover GUI v4 with 3 brand new models!

Install Instructions:

1. Install Python via the following link and make sure to check the box that says "Add Python 3.6 to PATH" - https://www.python.org/ftp/python/3.6.8/python-3.6.8-amd64.exe
2. Open the cmd prompt and run the following -

pip install Pillow
pip install tqdm==4.30.0
pip install librosa==0.6.3
pip install opencv-python
pip install numba==0.48.0
pip install SoundFile
pip install soundstretch
pip install torch==1.6.0+cu101 torchvision==0.7.0+cu101 -f https://download.pytorch.org/whl/torch_stable.html

3. Download Ultimate Vocal Remover GUI v4 Beta here - http://www.mediafire.com/file/nybrcjk6m9gq6ss/V4GUI-BETA.zip/file
4. Extract the V4GUI-BETA folder to your documents from the zip file
5. Now go into the V4GUI-BETA folder and double click the VocalRemover_v4.py to verify it works
6. Create and place a shortcut for the VocalRemover_v4.py file to your desktop for easy access
7. *****READ EVERYTHING BELOW FOR PROPER CONVERSIONS!*****

Here's a list of the models (THE SR & HOP LENGTH VALUES MUST BE IN LINE WITH THE MODEL IN ORDER FOR YOUR CONVERSIONS TO COME OUT RIGHT!):

- MGM-32000-512.pth - Set the SR to 32000 and the HOP LENGTH to 512 before doing conversions with this model!
- MGM-44100-512.pth - Set the SR to 44100 and the HOP LENGTH to 512 before doing conversions with this model!
- MGM-44100-1024.pth - Set the SR to 44100 and the HOP LENGTH to 1024 before doing conversions with this model!

A few notes & added features:

- Remembers "Save to" and last folders accessed.
- The "Add New Model(s)" button automatically opens the models directory. When you add new models to the appropriate folders, the application will automatically detect them so you don't have to restart it.
- I only have instrumental models for v4 at this time. I still need to train a few stacked models and a vocal model so ignore the options for those... for now.
- Keep in mind this is the beta so please feel free to report any bugs to me here!
- More enhamcements will be made as well

***TROUBLESHOOTING***

- If the VocalRemover_v4.py file won't open under any circumstances, please do the following

1. Open the cmd prompt from the V4GUI-BETA directory
2. Run the following - python VocalRemover_v4.py
3. Copy and paste the error in the cmd prompt here for further assistance

Link:

http://www.mediafire.com/file/nybrcjk6m9gq6ss/V4GUI-BETA.zip/file
 
Last edited:
I think the biggest issue will be remembering to go back and point or set ffmpeg for Vocal Remover that we had to do to start with.
 
:

- MGM-32000-512.pth - Set the SR to 32000 and the HOP LENGTH to 512 before doing conversions with this model!
- MGM-44100-512.pth - Set the SR to 44100 and the HOP LENGTH to 512 before doing conversions with this model!
- MGM-44100-1024.pth - Set the SR to 44100 and the HOP LENGTH to 1024 before doing conversions with this model!


So which of these is suppose to be the best?
 
***UPDATE***

Here's the beta release for the Ultimate Vocal Remover GUI v4 with 3 brand new models!

OK, so I've been playing with the new beta release just posted and I'm finding that the vocal removal isn't quite on par with the current Google Colab results. I tried all 3 models and it appears that for the song I already have done using the Google Colab method, I don't really hear any improvement differences. That's not saying it's useless. It's saying I will have to mess with another track to hear for sure that Google Colab can't currently do yet. It will probably have to be when the final release is finished.
 
OK, so I've been playing with the new beta release just posted and I'm finding that the vocal removal isn't quite on par with the current Google Colab results. I tried all 3 models and it appears that for the song I already have done using the Google Colab method, I don't really hear any improvement differences. That's not saying it's useless. It's saying I will have to mess with another track to hear for sure that Google Colab can't currently do yet. It will probably have to be when the final release is finished.

Sorry, it's been hard squeezing in time to update this thread. Have you been setting the sr and hop length accordingly based on the model you're using?
 
Sorry, it's been hard squeezing in time to update this thread. Have you been setting the sr and hop length accordingly based on the model you're using?

LOL yes. I already had Stan Bush - Dare done using Google Colab. Seemed like I heard more vocal residue with the new beta release when I ran the same song again.
 
- MGM-32000-512.pth - This model is very good at capturing lower end frequencies. So, on tracks that convert poorly on all of the other models should come out well on this one.
- MGM-44100-512.pth - My tests of this model have shown this one to actually be the best one I've done to date.

If you used the GoogleColab version, what command did you use to do the inference?
 
They are great sounding models, though I have not had a ton of time to test them yet. Life stuff is finally settling down now, so I just gotta wait for my headphones to get back from the shop so I can listen to the conversion results as best as possible.

Curiously though, (at least through my speakers), I felt like 32000-512 did the best job of the three on Rush - The Pass, though it was somewhat close with 44100-512 and 44100-1024 did the worst by a real margin. Geddy is a very high singer, so I found this interesting. I'm looking forward to trying some low range stuff though once my headphones are fixed :)
 
- MGM-32000-512.pth - This model is very good at capturing lower end frequencies. So, on tracks that convert poorly on all of the other models should come out well on this one.
- MGM-44100-512.pth - My tests of this model have shown this one to actually be the best one I've done to date.

If you used the GoogleColab version, what command did you use to do the inference?

On Google Colab I usually use the multi model and the NP model for a comparison for most tracks. on the GUI I do run all 3 models to see which sounds best and usually Google Colab always comes out better for some reason. Or Google and GUI will come out the same in some cases. I'm not complaining as I'm sure due to single mixed tracks it'll never be 100% like the studio. But for the most part Google Colab has helped me in my needs the best. For anyone wanting cleaner stems run a track through the vocal remover first, then run the instrumental through demucs. I find the stems much cleaner doing that.
 
Last edited:
[MENTION=10167]NewAgeRipper[/MENTION] - I can't respond to your message. Says you exceeded your storage.
 
***GUI BETA UPDATE***

===========================================================================

Bug Fixes -

~The application no longer cuts name off of some filenames after conversions.
~ Application now accepts all file types compatible with ffmpeg
~ (please install ffmpeg prior to running anything other than a wav file)

Changes -

~ The application will now read model parameters from filename (if present)
~ For example, a model with the filename "MGM-LOWEND_sr32000_hl512_w512_nf2048" will automatically fill the SR, HOP LENGTH, WINDOW SIZE, & N_FFT values
~ If the filename was "MGM-LOWEND", the SR, HOP LENGTH, WINDOW SIZE, & N_FFT values will auto-populate with the defaults.
~ The application reads these values from the following portion of the file "_sr32000_hl512_w512_nf2048"

~ A new option called "Model Test Mode" has been added.
~ This option is meant to make it easier for users to test the results of different models without having to manually create new folders and/or change the filenames.
~ When it's selected, the application will automatically generate a new folder with the name of the selected model in the "Save to" path you have chosen.
~ The completed files will have the selected model name appended to it and be saved to the auto-generated folder.

Here's a list of the models (PLEASE DO NOT CHANGE THE NAME OF THE FIRST 2 MODELS LISTED AS THE PARAMETERS ARE SPECIFIED IN THE FILENAMES!):

- MGM-LOWEND_sr32000_hl512_w512_nf2048.pth - This model is good at capturing vocals on the low end of the spectrogram.
- MGM-44100-512_sr44100_hl512_w512_nf2048.pth - This is a multi-genre model that was trained with a hop length size of 512. It's debatably the best model of this group.
- MGM-44100-1024.pth - This is a multi-genre model trained with basic parameters.

Link: http://www.mediafire.com/file/q5xefq7mg905tro/V4GUI-BETAv1102.zip/file

===========================================================================

Here's the beta release for the Ultimate Vocal Remover GUI v4 with 3 brand new models!

Install Instructions:

1. Install Python via the following link and make sure to check the box that says "Add Python 3.6 to PATH" - https://www.python.org/ftp/python/3.6.8/python-3.6.8-amd64.exe
2. Open the cmd prompt and run the following -

pip install Pillow
pip install tqdm==4.30.0
pip install librosa==0.6.3
pip install opencv-python
pip install numba==0.48.0
pip install SoundFile
pip install soundstretch
pip install torch==1.6.0+cu101 torchvision==0.7.0+cu101 -f https://download.pytorch.org/whl/torch_stable.html

3. Download Ultimate Vocal Remover GUI v4 Beta here - http://www.mediafire.com/file/nybrcjk6m9gq6ss/V4GUI-BETA.zip/file
4. Extract the V4GUI-BETA folder to your documents from the zip file
5. Now go into the V4GUI-BETA folder and double click the VocalRemover_v4.py to verify it works
6. Create and place a shortcut for the VocalRemover_v4.py file to your desktop for easy access

***TROUBLESHOOTING***

- If the VocalRemover_v4.py file won't open under any circumstances, please do the following

1. Open the cmd prompt from the V4GUI-BETA directory
2. Run the following - python VocalRemover_v4.py
3. Copy and paste the error in the cmd prompt to the technical channel for further assistance
 
Last edited:
***GUI BETA UPDATE***

===========================================================================

Bug Fixes -

~The application no longer cuts name off of some filenames after conversions.
~ Application now accepts all file types compatible with ffmpeg
~ (please install ffmpeg prior to running anything other than a wav file)

Changes -

~ The application will now read model parameters from filename (if present)
~ For example, a model with the filename "MGM-LOWEND_sr32000_hl512_w512_nf2048" will automatically fill the SR, HOP LENGTH, WINDOW SIZE, & N_FFT values
~ If the filename was "MGM-LOWEND", the SR, HOP LENGTH, WINDOW SIZE, & N_FFT values will auto-populate with the defaults.
~ The application reads these values from the following portion of the file "_sr32000_hl512_w512_nf2048"

~ A new option called "Model Test Mode" has been added.
~ This option is meant to make it easier for users to test the results of different models without having to manually create new folders and/or change the filenames.
~ When it's selected, the application will automatically generate a new folder with the name of the selected model in the "Save to" path you have chosen.
~ The completed files will have the selected model name appended to it and be saved to the auto-generated folder.

Here's a list of the models (PLEASE DO NOT CHANGE THE NAME OF THE FIRST 2 MODELS LISTED AS THE PARAMETERS ARE SPECIFIED IN THE FILENAMES!):

- MGM-LOWEND_sr32000_hl512_w512_nf2048.pth - This model is good at capturing vocals on the low end of the spectrogram.
- MGM-44100-512_sr44100_hl512_w512_nf2048.pth - This is a multi-genre model that was trained with a hop length size of 512. It's debatably the best model of this group.
- MGM-44100-1024.pth - This is a multi-genre model trained with basic parameters.

Link: http://www.mediafire.com/file/q5xefq7mg905tro/V4GUI-BETAv1102.zip/file

===========================================================================

Here's the beta release for the Ultimate Vocal Remover GUI v4 with 3 brand new models!

Install Instructions:

1. Install Python via the following link and make sure to check the box that says "Add Python 3.6 to PATH" - https://www.python.org/ftp/python/3.6.8/python-3.6.8-amd64.exe
2. Open the cmd prompt and run the following -

pip install Pillow
pip install tqdm==4.30.0
pip install librosa==0.6.3
pip install opencv-python
pip install numba==0.48.0
pip install SoundFile
pip install soundstretch
pip install torch==1.6.0+cu101 torchvision==0.7.0+cu101 -f https://download.pytorch.org/whl/torch_stable.html

3. Download Ultimate Vocal Remover GUI v4 Beta here - http://www.mediafire.com/file/nybrcjk6m9gq6ss/V4GUI-BETA.zip/file
4. Extract the V4GUI-BETA folder to your documents from the zip file
5. Now go into the V4GUI-BETA folder and double click the VocalRemover_v4.py to verify it works
6. Create and place a shortcut for the VocalRemover_v4.py file to your desktop for easy access

***TROUBLESHOOTING***

- If the VocalRemover_v4.py file won't open under any circumstances, please do the following

1. Open the cmd prompt from the V4GUI-BETA directory
2. Run the following - python VocalRemover_v4.py
3. Copy and paste the error in the cmd prompt to the technical channel for further assistance

To my ears the new models sound like they give the same results as the last ones. The 44100-1024 doesn't auto set to that in the SR and HOP. It sets to 33075-384 but you can change it manually still. The other 2 auto set as intended.
 
will you update it on Colab Google soon?

We're still trying to determine if there is really much difference in improvement to warrant updating Google Colab. I need [MENTION=39673]Anjok[/MENTION] to tell me the songs he's tested that make the 44100-512 model better than previously to compare between the 2. Also no one else has been giving any feedback on this really.
 
I'm still waiting for my headphones. Can't really test anything until I get them back. My speakers are pretty good, but it's hard to hear the small differences on tracks that convert well. Need the headphones for that.