Yehor/w2v-xls-r-uk

automatic speech recognitiontransformersuktransformerssafetensorswav2vec2automatic-speech-recognitionukdataset:mozilla-foundation/common_voice_10_0apache-2.0
358.2K

🚨🚨🚨 ATTENTION! 🚨🚨🚨

Use an updated model: https://huggingface.co/Yehor/w2v-bert-uk-v2.1


Community

See other Ukrainian models: https://github.com/egorsmkv/speech-recognition-uk

Evaluation results

Metrics (float16) using evaluate library with batch_size=1:

  • WER: 0.2024 metric, 20.24%
  • CER: 0.0364 metric, 3.64%
  • Accuracy on words: 79.76%
  • Accuracy on chars: 96.36%
  • Inference time: 63.4848 seconds
  • Audio duration: 16665.5212 seconds
  • RTF: 0.0038

Cite this work

@misc {smoliakov_2025,
	author       = { {Smoliakov} },
	title        = { w2v-xls-r-uk (Revision 55b6dc0) },
	year         = 2025,
	url          = { https://huggingface.co/Yehor/w2v-xls-r-uk },
	doi          = { 10.57967/hf/4556 },
	publisher    = { Hugging Face }
}
DEPLOY IN 60 SECONDS

Run w2v-xls-r-uk on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.