T
T
tatarrr952019-08-13 04:20:19
Speech recognition
tatarrr95, 2019-08-13 04:20:19

How to recognize speech by separating voices in single channel audio?

Such a task, there are single-channel wav tracks recording a conversation between two people. It is necessary to translate the speech into text, but separating the owners of the voice. Now I use Yandex speechkit, but it translates all the voices into text all the way. How to split text by owners. automated? Not important . in what programming language.

Answer the question

In order to leave comments, you need to log in

1 answer(s)
T
tatarrr95, 2019-08-14
@tatarrr95

Google speech could, it's called diarization, I'm attaching a link to an example, maybe it will help someone.
https://cloud.google.com/speech-to-text/docs/multi...

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question