Abstract
A method for improving a transcription may include identifying, in the transcription, reliable channel tokens of an utterance of a reliable channel and an unreliable channel token of an utterance of an unreliable channel, and generating, using a machine learning model, a vector embedding for the unreliable channel token and vector embeddings for the reliable channel tokens. The method may further include calculating vector distances between the vector embedding and the vector embeddings, and generating, for the unreliable channel token and using the vector distances, a score corresponding to a reliable channel token. The method may further include determining that the score is within a threshold score, and in response to determining that the score is within the threshold score, replacing, in the transcription, the unreliable channel token with the reliable channel token.
| Original language | English |
|---|---|
| Patent number | US11170765 |
| IPC | H04M 3/ 50 A I |
| Priority date | 24/01/20 |
| State | Published - 29 Jul 2021 |
Fingerprint
Dive into the research topics of 'CONTEXTUAL MULTI-CHANNEL SPEECH TO TEXT'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver