23 lines
865 B
Markdown
23 lines
865 B
Markdown
# sparkastML Datasets
|
|
|
|
This repository contains datasets published by the sparkastML project.
|
|
|
|
## Translation ZH-EN
|
|
|
|
This dataset features high-quality, fresh synthetic data comprising over 100,000 sentences of Chinese-English parallel corpora.
|
|
|
|
### Details
|
|
|
|
- **Source Language:** Chinese (Simplified)
|
|
- **Target Language:** English
|
|
- **Version:** 1
|
|
- **Last Update:** 2024/09/16
|
|
- **LICENSE:** [CC-BY 4.0](https://creativecommons.org/licenses/by/4.0/)
|
|
|
|
### Download
|
|
|
|
- **Google Drive:** [Download from Google Drive](https://drive.google.com/drive/folders/1_ADblZcB5p9BUvawkYDmp1qIUDZgkkoe)
|
|
- **IPFS:** [Download from IPFS](https://ipfs.a2x.pub/ipfs/QmYz4ew4nSzPc6TZvoWk6jXpGN82qt3J46nwfb75N2YKc4/)
|
|
- CID: `QmYz4ew4nSzPc6TZvoWk6jXpGN82qt3J46nwfb75N2YKc4`
|
|
- **GitHub Release:** [Go to Release Page](https://github.com/alikia2x/sparkastML/releases/tag/v1-dataset)
|