Add AudioCaps scenario #3137

asillycat · 2024-11-07T00:30:21Z

scenario_state.json
Add AudioCaps scenario.

src/helm/benchmark/scenarios/audio_language/audiocaps_senario.py

ImKeTT · 2024-11-07T01:58:09Z

src/helm/benchmark/static/schema_speech.yaml

+    taxonomy:
+      task: audio captioning
+      what: audio clips in the wild
+      who: youtube


I think it's real speakers from youtube videos, right? How about just change this to real speakers?

ImKeTT

Thanks Zijun, just left some comments

ImKeTT · 2024-11-07T04:27:00Z

src/helm/benchmark/scenarios/audio_language/audiocaps_senario.py

-
-        for row in tqdm(load_dataset("Olivia714/audiocaps", cache_dir=output_path, split=TEST_SPLIT)):
+        ensure_file_downloaded(source_url=AudioCapsScenario.DOWNLOADING_URL, target_path=data_dir, unpack=True)
+        assert os.path.exists(data_dir), f"Download the wav_files from {AudioCapsScenario.DOWNLOADING_URL}"


Could you delete this line? This is redundant since we're using ensure_file_downloaded

ImKeTT · 2024-11-07T04:28:25Z

src/helm/benchmark/static/schema_speech.yaml

@@ -297,6 +297,6 @@ run_groups:
    taxonomy:
      task: audio captioning
      what: audio clips in the wild
-      who: youtube
+      who: sound and human voices from real-life scenes


how about change to real speakers?

ImKeTT

Thanks Zijun, I think this PR is ready

add 1 audio-language scenarios

4d65c9c

ImKeTT self-requested a review November 7, 2024 01:28

ImKeTT reviewed Nov 7, 2024

View reviewed changes

src/helm/benchmark/scenarios/audio_language/audiocaps_senario.py Outdated Show resolved Hide resolved

ImKeTT reviewed Nov 7, 2024

View reviewed changes

src/helm/benchmark/scenarios/audio_language/audiocaps_senario.py Outdated Show resolved Hide resolved

ImKeTT reviewed Nov 7, 2024

View reviewed changes

ImKeTT requested changes Nov 7, 2024

View reviewed changes

[fi bugs in dataset loading

d6daade

ImKeTT reviewed Nov 7, 2024

View reviewed changes

asillycat added 2 commits November 7, 2024 04:39

[fix] change schema type

fc0f30c

[fix] remove redundancy

2b4814d

ImKeTT approved these changes Nov 7, 2024

View reviewed changes

ImKeTT merged commit a40c760 into stanford-crfm:main Nov 7, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add AudioCaps scenario #3137

Add AudioCaps scenario #3137

asillycat commented Nov 7, 2024

ImKeTT Nov 7, 2024

ImKeTT left a comment

ImKeTT Nov 7, 2024

ImKeTT Nov 7, 2024

ImKeTT left a comment

Add AudioCaps scenario #3137

Add AudioCaps scenario #3137

Conversation

asillycat commented Nov 7, 2024

ImKeTT Nov 7, 2024

Choose a reason for hiding this comment

ImKeTT left a comment

Choose a reason for hiding this comment

ImKeTT Nov 7, 2024

Choose a reason for hiding this comment

ImKeTT Nov 7, 2024

Choose a reason for hiding this comment

ImKeTT left a comment

Choose a reason for hiding this comment