Mel-spectrum of 29.1s segment
See original GitHub issueHi, thanks for the effort.
I try to use the Mel-spectrum downloaded from gdrive to run the baseline but found that the downloaded files are full song. As a result, I try to run scripts/melspectrograms.py to get Mel-Spectrogram of 29.1s segment. However, I kept getting the error below:
RuntimeError: Error while configuring MelBands: Parameter normalize = "unit_tri" is not within specified range: {unit_sum,unit_max}
May I ask what did I miss? Thanks for the help.
Issue Analytics
- State:
- Created 4 years ago
- Comments:5 (1 by maintainers)
Top Results From Across the Web
Minz Won - Tesis Doctorals en Xarxa
step, a 29.1s audio segment is converted to a 96 × 1366 mel spectrogram. It is then used as an input and is...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found

@annahung31 We have updated or PyPi wheels with the newest version of Essentia. Install or upgrade to the latest Essentia from pip and you should be able to run the spectrogram extraction code without a problem.
To avoid recomputing all spectrograms, I made a small change to the dataset, so that every file is now cropped to the desired shape of (96, 1366). This is probably not the best way to do it, but it works. The getitem method looks like:
There is another change needed in the model, because now the batches have shape
(batch, channels, width, height), so no need to unsqueeze.