Reverse MelScale, Griffin-Lim transformation
See original GitHub issue🚀 Feature
I have implemented PyTorch versions of librosa
’s mel_to_stft
and griffinlim
, which I believe would be useful for any generation task using mel-spectrograms. You can see it as a part of my Melnet implementation here: https://github.com/jaeyeun97/MelNet/blob/master/audio.py. While it currently does not conform to the standards of this repo with a little touch here and there I think it would be a good addition.
Motivation
-
librosa
hasmel_to_stft
, which solves for a spectrogram of linear frequency scale from a Mel scale spectrogram. Having a pure PyTorch implementation would remove the need of alibrosa
dependency and add support for GPU processing (which speeds up the processes significantly) -
Similarly, Griffin-Lim algorithm provides an approximation for phase in order to provide a better spectrogram inversion, and since this project provides an implementation for
istft
I thought I would use it to port thegriffinlim
function fromlibrosa
.
Pitch
I would like to add the code for Griffin-Lim and mel-to-linear scaling I referenced above to this repository.
I don’t know why this hasn’t been done before–was there any license issue, etc. regarding porting the code? Otherwise, I think this would be a relatively a painless merge into the repository.
Issue Analytics
- State:
- Created 4 years ago
- Reactions:2
- Comments:6 (5 by maintainers)
Top GitHub Comments
Thanks again for working on this @jaeyeun97! #448 merged the second request made in this issue.
I usually separate, but please do what’s convenient for you for this 😃