Skip to content

overlapping speech #434

Answered by jongwook
Majdoddin asked this question in Q&A
Oct 29, 2022 · 2 comments · 2 replies
Discussion options

You must be logged in to vote

This is a limitation of the model because the training data often had transcription for one speaker while treating other voices as background noise. You might be able to nudge the model to produce overlapping transcription by prompting what each speaker would be saying 30 seconds ago, potentially with speaker labels like --prefix "[Bob] So I was saying that [Alice] But there was always this". Please note, [ and ] are suppressed from the output by default, and you'll need to edit this line to re-enable them:

symbols = list("\"#()*+/:;<=>@[\\]^_`{|}~「」『』")

If you have a dataset of overlapping speech (or create one yourself b…

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@Tejasvidash
Comment options

Comment options

You must be logged in to vote
1 reply
@loretoparisi
Comment options

Answer selected by jongwook
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
5 participants