Commit Graph

33 Commits

Author SHA1 Message Date
fdff155673
update: idk 2024-11-16 21:51:19 +08:00
c4ca9c7d4f
ref: clean forced-alignment 2024-11-07 03:14:47 +08:00
65123d1b39
add: full example of forced alignment for music 2024-11-03 01:11:45 +08:00
aeab34f84b
add: forced alignment example 2024-11-02 15:46:12 +08:00
37d2507f10
update: latest synthetic data script 2024-10-07 23:15:25 +08:00
33754146c8
add: text-difficulty/grammar 2024-10-02 21:11:23 +08:00
ae6f10a6f0
add: open set validation 2024-09-28 21:53:55 +08:00
bf2c9a393a
update: add metadata export of intention classify 2024-09-26 22:57:27 +08:00
853d158c41
update: README for translate 2024-09-23 21:30:44 +08:00
9f071ee0a0
ref: the intention-classification model 2024-09-22 03:58:56 +08:00
66cf093177
add: dataset quality check 2024-09-20 00:53:51 +08:00
237d2f5c96
ref: remove unnecessary file 2024-09-19 22:05:05 +08:00
01597c298d
update: evaluation 2024-09-19 22:03:54 +08:00
435faa4b92
update: README 2024-09-17 20:44:55 +08:00
580753bb6f
update: README 2024-09-17 20:20:47 +08:00
6500e378be
add: translation evaluation 2024-09-17 20:07:47 +08:00
3bb222bda1
update: README 2024-09-16 17:40:11 +08:00
3ebeaf4655
update: readme 2024-09-16 17:34:13 +08:00
932cbd4336
add: dataset 2024-09-16 17:29:12 +08:00
a9a7430a58
update: fetching with cooldown
fix: post-process unmatch
improve: LLM-translate now request with temprature
2024-09-16 04:08:33 +08:00
6f25183654
update: fetcher and post-process
move the max threads and fetch limit in fetcher into env
update the postprocess flow to remove duplicates
2024-09-16 00:59:58 +08:00
9eeb3de828
update: fetcher, translator
increase threshold of split in fetcher
improve prompt for LLM-translator
2024-09-16 00:48:07 +08:00
7021687e10
add: postprocess 2024-09-15 23:54:37 +08:00
4c9f411f67
add: content fetcher for translate 2024-09-15 23:43:01 +08:00
ebd1113a6e
update: llm translate 2024-09-10 21:35:25 +08:00
dcf53ca002
add: spider 2024-09-10 21:35:00 +08:00
1acc1ce703
add: LLM-based batch translation
used for improve translation dataset quality
2024-09-10 00:52:55 +08:00
dc1722ca3d
ref: use argos-translate instead 2024-09-07 23:02:50 +08:00
bb0aa5b79b
update: translate
improve speed
2024-09-07 23:00:15 +08:00
12b9b910f4
add: translation 2024-09-07 15:53:21 +08:00
86394c7f87
update: readme 2024-09-01 22:57:52 +08:00
2c88faf9c0
add: readme 2024-09-01 22:31:52 +08:00
f28f83b48e
init 2024-09-01 22:17:04 +08:00