sparkastML/dataset/public/README.md
2024-09-16 17:29:12 +08:00

14 lines
427 B
Markdown

# sparkastML Datasets
Here are the datasets published by sparkastML project.
## Translation ZH-EN
High-quality, fresh synthetic data containing over 100,000 sentences of Chinese-English parallel corpora.
Version: 1
Last Update: 2024/09/16
[Google Drive](https://drive.google.com/drive/folders/1_ADblZcB5p9BUvawkYDmp1qIUDZgkkoe?usp=sharing)
[IPFS](https://ipfs.a2x.pub/ipfs/QmYz4ew4nSzPc6TZvoWk6jXpGN82qt3J46nwfb75N2YKc4/)