Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing HF dataset: HausaNLP/afrisenti-lid-data #1777

Open
isaac-chung opened this issue Jan 12, 2025 · 4 comments
Open

Missing HF dataset: HausaNLP/afrisenti-lid-data #1777

isaac-chung opened this issue Jan 12, 2025 · 4 comments

Comments

@isaac-chung
Copy link
Collaborator

isaac-chung commented Jan 12, 2025

https://github.com/embeddings-benchmark/mteb/actions/runs/12734219217/job/35491879099?pr=1775

=========================== short test summary info ============================
FAILED tests/test_tasks/test_all_abstasks.py::test_dataset_availability - AssertionError: Datasets not available on Hugging Face:
  HausaNLP/afrisenti-lid-data - revision f17cb5f3ec522ac604601fd09db9fd644ac66ca5
assert False

[update]: Broke main as well.

@Samoed
Copy link
Collaborator

Samoed commented Jan 12, 2025

Organization HF https://huggingface.co/HausaNLP, github https://github.com/hausanlp

@isaac-chung
Copy link
Collaborator Author

Organization HF https://huggingface.co/HausaNLP, github https://github.com/hausanlp

What is your suggestion here?

@Samoed
Copy link
Collaborator

Samoed commented Jan 13, 2025

@hausanlp Could you provide information on why this dataset was removed?

@KennethEnevoldsen
Copy link
Contributor

I suppose if we do not get a response here we will have to remove it. Might be worth rehosting at least the dataset used for benchmarks on MTEB in case they are removed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants