Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,403 workflow runs
1,403 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

add token and char count to histogram stats
Secret Leaks #44: Commit 359d9fa pushed by guipenedo
July 15, 2024 15:32 20s hist_token_counts
July 15, 2024 15:32 20s
do not kill everything when a single task fails
Secret Leaks #43: Commit 84dd126 pushed by guipenedo
July 12, 2024 00:13 15s slurm_nodes
July 12, 2024 00:13 15s
fix merge stats
Secret Leaks #42: Commit 0814338 pushed by guipenedo
July 10, 2024 22:37 16s slurm_nodes
July 10, 2024 22:37 16s
bugfixes
Secret Leaks #41: Commit bc19878 pushed by guipenedo
July 10, 2024 22:31 17s slurm_nodes
July 10, 2024 22:31 17s
push slurm nodes executor
Secret Leaks #40: Commit 6142073 pushed by guipenedo
July 10, 2024 16:37 16s slurm_nodes
July 10, 2024 16:37 16s
Add withdirs to extra_options only when not using glob_pattern (#244)
Test & Check Code Quality #189: Commit c279f26 pushed by guipenedo
July 9, 2024 10:18 3m 8s main
July 9, 2024 10:18 3m 8s
Add withdirs to extra_options only when not using glob_pattern (#244)
Secret Leaks #39: Commit c279f26 pushed by guipenedo
July 9, 2024 10:18 21s main
July 9, 2024 10:18 21s
fix shard check
Secret Leaks #38: Commit aa43e3f pushed by guipenedo
July 8, 2024 16:14 18s main
July 8, 2024 16:14 18s
fix shard check
Test & Check Code Quality #187: Commit aa43e3f pushed by guipenedo
July 8, 2024 16:14 3m 23s main
July 8, 2024 16:14 3m 23s
fix linter
Secret Leaks #37: Commit 8391d12 pushed by guipenedo
July 8, 2024 11:18 22s main
July 8, 2024 11:18 22s
fix linter
Test & Check Code Quality #186: Commit 8391d12 pushed by guipenedo
July 8, 2024 11:18 2m 58s main
July 8, 2024 11:18 2m 58s
fix split_dataset_by_node on HuggingFaceReader
Secret Leaks #36: Commit e01bd0a pushed by guipenedo
July 8, 2024 10:25 18s main
July 8, 2024 10:25 18s
fix split_dataset_by_node on HuggingFaceReader
Test & Check Code Quality #185: Commit e01bd0a pushed by guipenedo
July 8, 2024 10:25 3m 20s main
July 8, 2024 10:25 3m 20s
add dependencies lid.py, io.py #239 (#241)
Test & Check Code Quality #184: Commit 55a9072 pushed by guipenedo
July 8, 2024 10:09 3m 19s main
July 8, 2024 10:09 3m 19s
add dependencies lid.py, io.py #239 (#241)
Secret Leaks #35: Commit 55a9072 pushed by guipenedo
July 8, 2024 10:09 23s main
July 8, 2024 10:09 23s
index file read fix (#229)
Secret Leaks #34: Commit af63762 pushed by guipenedo
July 8, 2024 09:42 22s main
July 8, 2024 09:42 22s
index file read fix (#229)
Test & Check Code Quality #182: Commit af63762 pushed by guipenedo
July 8, 2024 09:42 4m 11s main
July 8, 2024 09:42 4m 11s
add dependencies lid.py, io.py #239
Test & Check Code Quality #181: Pull request #241 opened by aiqwe
July 8, 2024 06:02 3m 50s aiqwe:add_lid_dependencies
July 8, 2024 06:02 3m 50s
option to keep more language scores
Test & Check Code Quality #180: Commit 061d4db pushed by guipenedo
July 5, 2024 15:40 3m 46s main
July 5, 2024 15:40 3m 46s
option to keep more language scores
Secret Leaks #33: Commit 061d4db pushed by guipenedo
July 5, 2024 15:40 17s main
July 5, 2024 15:40 17s
add batching to filters
Test & Check Code Quality #179: Commit 7ba873f pushed by guipenedo
July 5, 2024 10:33 4m 6s main
July 5, 2024 10:33 4m 6s
add batching to filters
Secret Leaks #32: Commit 7ba873f pushed by guipenedo
July 5, 2024 10:33 18s main
July 5, 2024 10:33 18s
nit
Test & Check Code Quality #178: Commit 898efc0 pushed by guipenedo
July 3, 2024 23:25 3m 18s main
July 3, 2024 23:25 3m 18s
nit
Secret Leaks #31: Commit 898efc0 pushed by guipenedo
July 3, 2024 23:25 21s main
July 3, 2024 23:25 21s