sparkastML/dataset/public/README.md
2024-09-17 20:20:47 +08:00

865 B

sparkastML Datasets

This repository contains datasets published by the sparkastML project.

Translation ZH-EN

This dataset features high-quality, fresh synthetic data comprising over 100,000 sentences of Chinese-English parallel corpora.

Details

  • Source Language: Chinese (Simplified)
  • Target Language: English
  • Version: 1
  • Last Update: 2024/09/16
  • LICENSE: CC-BY 4.0

Download