Feature: AudioCutting feature extractor
See original GitHub issueIs it possible right now to use CutSet to cut all recordings into separate wav files and dump these files on disk? If not, would it be a considerable feature?
It could be used in a similar way to feature extraction.
recording_set = RecordingSet.from_yaml('audio.yml')
audio_cutter = AudioCutter()
with AudioWriter() as storage:
builder = FeatureSetBuilder(feature_extractor=audio_cutter, storage=storage)
feature_set = builder.process_and_store_recordings(
recordings=recording_set,
num_jobs=8
)
or the cutset would simply have a function CutSet.cut_audio(path)
or similar.
The audio cuts could be stored in subfolders of path based eg on recording name. Along with the wav cuts, there would also be a manifest file for the Cuts. Let me know if this would be of interest or if something similar is already possible. Thanks
Issue Analytics
- State:
- Created 3 years ago
- Comments:6 (2 by maintainers)
Top Results From Across the Web
Streamline audio feature extraction - MATLAB - MathWorks
audioFeatureExtractor encapsulates multiple audio feature extractors into a streamlined and modular implementation.
Read more >Audio Feature Extraction - Devopedia
It deals with the processing or manipulation of audio signals. It removes unwanted noise and balances the time-frequency ranges by converting ...
Read more >Sound Feature Extraction - - Mael Fabien
Sound features can be used to detect speakers, detect the gender, the age, diseases and much more through the voice. To extract features,...
Read more >Audio Feature Extraction | Thecleverprogrammer
Feature Extraction is the process of reducing the number of features in the data by creating new features using the existing ones. The...
Read more >wxjiao/Multimodal-Feature-Extraction - GitHub
A detailed description on how to extract and align text, audio, and video features at word-level. 1. Text-Audio Alignment. - Extract Audio Track...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Agree, sorry for late response. I was quite busy before Christmas. I can give it a try, but probably in a few weeks from now.
I think we can mimic the approach we took in saving the feature matrices to files, i.e. use the first three symbols from the Cut ID as a name for the sub-directory, and store the new recording (with name == cut ID) in it.
Maybe we can save it as a FLAC rather than WAV to save some space - I think it’s supported by soundfile out-of-the-box.
So e.g. for cuts with IDs:
we’d create the following structure:
my_output_dir |— abc | |— abc12345.flac | |— abc34576.flac |— abd |— abd23452.flac
You can find code that does a similar thing in the
LilcomFilesWriter
class.