Answer the question
In order to leave comments, you need to log in
Voice detection in a (telephone) audio file and cut, which library to use in Python?
There is a task, to cut the voice of a telephone conversation into separate small wav
. I have already broken the calling and receiving channel, now the question is how to cut the remaining segments of the voice, is there a python library that works with this?
Thanks
Answer the question
In order to leave comments, you need to log in
I did it without a library. To do this, you can use the energy of the signal in the window, and if the energy exceeds a certain threshold, then there is a voice in the window. There is often noise in a telephone audio file, so it makes sense to first "align" the signal in the window - i.e. subtract the average of each element.
More or less like this:
window = signal[i:(i+win_len)]
energy = ((window - window.mean()) ** 2).sum()
voice = energy > threshold
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question