Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,403 workflow runs
1,403 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Better multilingual support
Test & Check Code Quality #253: Pull request #285 synchronize by guipenedo
October 3, 2024 15:36 2m 57s multilingual
October 3, 2024 15:36 2m 57s
changed indic langs tokenizers to indicnlp
Secret Leaks #86: Commit cea9102 pushed by guipenedo
October 3, 2024 15:36 19s multilingual
October 3, 2024 15:36 19s
Fix languages listify bug
Test & Check Code Quality #252: Pull request #294 opened by BramVanroy
September 29, 2024 16:38 2m 18s BramVanroy:patch-2
September 29, 2024 16:38 2m 18s
Add several open-source text extraction libraries
Test & Check Code Quality #251: Pull request #293 synchronize by garrethlee
September 27, 2024 02:55 2m 15s feat/text-extraction
September 27, 2024 02:55 2m 15s
fix: move postprocessor to init
Secret Leaks #85: Commit 891850e pushed by garrethlee
September 27, 2024 02:55 17s feat/text-extraction
September 27, 2024 02:55 17s
Add several open-source text extraction libraries
Test & Check Code Quality #250: Pull request #293 opened by garrethlee
September 27, 2024 01:01 2m 22s feat/text-extraction
September 27, 2024 01:01 2m 22s
feat: changed configs & pyproject
Secret Leaks #84: Commit ea3a915 pushed by garrethlee
September 27, 2024 01:00 17s feat/text-extraction
September 27, 2024 01:00 17s
Better multilingual support
Test & Check Code Quality #249: Pull request #285 synchronize by guipenedo
September 21, 2024 17:33 2m 57s multilingual
September 21, 2024 17:33 2m 57s
add todo
Secret Leaks #83: Commit 9ad0747 pushed by guipenedo
September 21, 2024 17:33 17s multilingual
September 21, 2024 17:33 17s
Update huggingface.py
Secret Leaks #82: Commit c7f6f51 pushed by guipenedo
September 11, 2024 13:39 16s main
September 11, 2024 13:39 16s
Update huggingface.py
Test & Check Code Quality #248: Commit c7f6f51 pushed by guipenedo
September 11, 2024 13:39 2m 55s main
September 11, 2024 13:39 2m 55s
Fixed a bug that in the reader pipline, the document count is always …
Secret Leaks #81: Commit 9142e3e pushed by guipenedo
September 11, 2024 11:35 24s main
September 11, 2024 11:35 24s
Fixed a bug that in the reader pipline, the document count is always …
Test & Check Code Quality #247: Commit 9142e3e pushed by guipenedo
September 11, 2024 11:35 2m 51s main
September 11, 2024 11:35 2m 51s
Better multilingual support
Test & Check Code Quality #246: Pull request #285 synchronize by guipenedo
September 11, 2024 10:05 3m 8s multilingual
September 11, 2024 10:05 3m 8s
fix tokenizer issues
Secret Leaks #80: Commit a147fd5 pushed by guipenedo
September 11, 2024 10:05 19s multilingual
September 11, 2024 10:05 19s
Better multilingual support
Test & Check Code Quality #243: Pull request #285 opened by guipenedo
September 5, 2024 17:48 3m 11s multilingual
September 5, 2024 17:48 3m 11s
add all available tokenizers and all iso-639-1 languages
Secret Leaks #79: Commit 25a5919 pushed by guipenedo
September 5, 2024 17:48 20s multilingual
September 5, 2024 17:48 20s
September 4, 2024 17:16 21s
Secret Leaks
Secret Leaks #77: by guipenedo
September 4, 2024 11:51 22s multilingual
September 4, 2024 11:51 22s
Add job_id_position Parameter to launch_slurm_job Method
Test & Check Code Quality #240: Pull request #282 opened by StephenRebel
September 3, 2024 22:32 3m 30s StephenRebel:slurm_submition_changes
September 3, 2024 22:32 3m 30s
Merge pull request #280 from huggingface/readme_formatting_issues
Test & Check Code Quality #239: Commit c2fc902 pushed by hynky1999
September 2, 2024 12:20 2m 50s main
September 2, 2024 12:20 2m 50s
Merge pull request #280 from huggingface/readme_formatting_issues
Secret Leaks #76: Commit c2fc902 pushed by hynky1999
September 2, 2024 12:20 21s main
September 2, 2024 12:20 21s
Readme nits
Test & Check Code Quality #238: Pull request #280 synchronize by hynky1999
September 2, 2024 12:18 3m 5s readme_formatting_issues
September 2, 2024 12:18 3m 5s