.. | ||
README.md |
sparkastML Datasets
This repository contains datasets published by the sparkastML project.
Translation ZH-EN
This dataset features high-quality, fresh synthetic data comprising over 100,000 sentences of Chinese-English parallel corpora.
Details
- Source Language: Chinese (Simplified)
- Target Language: English
- Version: 1
- Last Update: 2024/09/16
- LICENSE: CC-BY 4.0
Download
- Google Drive: Download from Google Drive
- IPFS: Download from IPFS
- CID:
QmYz4ew4nSzPc6TZvoWk6jXpGN82qt3J46nwfb75N2YKc4
- CID:
- GitHub Release: Go to Release Page