How to recognize speech by separating voices in single channel audio?

T

tatarrr952019-08-13 04:20:19

Speech recognition

tatarrr95, 2019-08-13 04:20:19

Such a task, there are single-channel wav tracks recording a conversation between two people. It is necessary to translate the speech into text, but separating the owners of the voice. Now I use Yandex speechkit, but it translates all the voices into text all the way. How to split text by owners. automated? Not important . in what programming language.

Reply

Answer the question

In order to leave comments, you need to log in

1 answer(s)

T

tatarrr95, 2019-08-14
@tatarrr95

Google speech could, it's called diarization, I'm attaching a link to an example, maybe it will help someone.
https://cloud.google.com/speech-to-text/docs/multi...