Answer the question
In order to leave comments, you need to log in
How to recognize speech by separating voices in single channel audio?
Such a task, there are single-channel wav tracks recording a conversation between two people. It is necessary to translate the speech into text, but separating the owners of the voice. Now I use Yandex speechkit, but it translates all the voices into text all the way. How to split text by owners. automated? Not important . in what programming language.
Answer the question
In order to leave comments, you need to log in
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question