2:00 PM - 2:20 PM
[4N3-GS-7-01] Error Correction for Japanese Speech Recognition by Combining N-best Hypotheses and Large Language Models
Keywords:ASR, Error Correction, LLM
Our company, which provides services utilizing speech recognition, recognizes the accuracy of speech recognition as essential for successful service deployment. While there are numerous methods to correct speech recognition results, this study focuses on scoring N-best hypotheses generated by a speech recognition model using large language models (LLMs) to correct Japanese speech recognition results. The scoring process demonstrated improvements in both WER (Word Error Rate) and CER (Character Error Rate).
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.