Answer the question
In order to leave comments, you need to log in
How to implement auto-detection of phrase intervals in an audio book?
How to automatically detect phrase intervals (start-end seconds) in an audiobook (no background sounds, only speech and silence). I did not find such a function in the existing software for creating subtitles. In English, for queries: "audio segmentation" , "speach activity" and the like, I also did not find anything particularly useful.
If you have to implement this algorithm, in what environment is it better to do it? I need access to the volume of the sound and the presence of a sound signal at any given time, I think. And then, by setting the parameters of the pause between words (the level of silence and its duration), it will be possible to automatically determine the timing of each individual phrase. What environment is more convenient to implement this?
Answer the question
In order to leave comments, you need to log in
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question