# sparkastML Datasets This repository contains datasets published by the sparkastML project. ## Translation ZH-EN This dataset features high-quality, fresh synthetic data comprising over 100,000 sentences of Chinese-English parallel corpora. ### Details - **Version:** 1 - **Last Update:** 2024/09/16 ### Download Links - **Google Drive:** [Download from Google Drive](https://drive.google.com/drive/folders/1_ADblZcB5p9BUvawkYDmp1qIUDZgkkoe?usp=sharing) - **IPFS:** [Download from IPFS](https://ipfs.a2x.pub/ipfs/QmYz4ew4nSzPc6TZvoWk6jXpGN82qt3J46nwfb75N2YKc4/)