Poor quality transcripts with V3 for Dutch #1843
Replies: 2 comments 1 reply
-
got any updates regarding improvements on Dutch? |
Beta Was this translation helpful? Give feedback.
-
Can you share a sample audio/video file we can test with? |
Beta Was this translation helpful? Give feedback.
-
Hi!
I have currently switched back to V2 as the output for V3 is very poor for my use case. I mainly use whisper to transcribe interviews. V2's performance was pretty good in most cases for Teams calls recordings, but the output for V3 is pretty much useless..
V3 seems to get stuck in loops a lot more often:
It also introduces new hallucinations in multiple languages, even though the language has been specified to be Dutch. I have never encountered this issue with V2:
This has been happening with every Teams recording I feed to whisper.
The benchmarks suggest that performance for Dutch should be a lot better with V3.
I am currently not tweaking any hyperparameters and run whisper from python like so:
whisper.transcribe(model, audio, verbose=True, fp16=False, language="dutch")
Anyone else experiencing these issues?
Beta Was this translation helpful? Give feedback.
All reactions