Speech Recognition for Uyghur using deep learning
Go to file
2024-10-20 03:36:48 +08:00
assets ref: re-organize project structure 2024-10-20 03:36:48 +08:00
test ref: re-organize project structure 2024-10-20 03:36:48 +08:00
.gitignore ref: re-organize project structure 2024-10-20 03:36:48 +08:00
data.py ref: re-organize project structure 2024-10-20 03:36:48 +08:00
README.md ref: re-organize project structure 2024-10-20 03:36:48 +08:00
requirements.txt ref: re-organize project structure 2024-10-20 03:36:48 +08:00
thuyg20_test.csv Add files via upload 2021-06-14 16:56:51 +09:00
thuyg20_train.csv Add files via upload 2021-06-14 16:56:51 +09:00
tonu.py ref: re-organize project structure 2024-10-20 03:36:48 +08:00
train.py ref: re-organize project structure 2024-10-20 03:36:48 +08:00
UModel.py ref: re-organize project structure 2024-10-20 03:36:48 +08:00
uyghur.py Add files via upload 2021-06-14 16:56:51 +09:00

Agnlash

A ASR(Automatic Speech Recognition) model for Uyghur language.

This project is forked from uyghur-asr-ctc.

The Anglash is fine-tuned on the CommonVoice dataset which contains 313 hours of data.

The original project uses A free Uyghur speech database Released by CSLT@Tsinghua University & Xinjiang University. This dataset contains 22.45 hours of data.