To launch the benchmarking #1

Gpoxolcku · 2024-04-01T15:31:53Z

Hi! Awesome work and datasets collection! Is there a way (or plan to release such a script) to launch a model's benchmark evaluation on the full set of data and obtain a comprehensive report on all the metrics?

danielz02 · 2024-04-01T15:56:25Z

Thanks a lot for your interest! We are working on it :) The end goal would be to support common Hugging Face models. Do you have any model in mind?

Gpoxolcku · 2024-04-01T16:56:17Z

Thank you for quick answer! Do you know any approximate release time?) Just developing yet another model, interested in metrics to track the progress :)

danielz02 · 2024-04-01T17:12:23Z

I'm thinking of some time around ICLR, which is early May, but I can definitely adjust the priority if there is a need for evaluating new models. What interface does your model use? Is it a Hugging Face pipeline or a Llava-like interface?

Gpoxolcku · 2024-04-01T17:15:27Z

That would be very nice of you, thank you! I use Llava-like interface on a local machine

Gpoxolcku · 2024-07-08T08:41:42Z

Hi, is there any success in finalizing the eval scripts? such benchmark would be very helpful for my projects, thanks :)

alievrusik · 2024-08-27T12:11:18Z

Hello! I would also greatly appreciate such script, right now it's not very convenient to compare different EO VLMs on this benchmark. Do you still plan to release it?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

To launch the benchmarking #1

To launch the benchmarking #1

Gpoxolcku commented Apr 1, 2024

danielz02 commented Apr 1, 2024

Gpoxolcku commented Apr 1, 2024

danielz02 commented Apr 1, 2024 •

edited

Loading

Gpoxolcku commented Apr 1, 2024

Gpoxolcku commented Jul 8, 2024

alievrusik commented Aug 27, 2024

To launch the benchmarking #1

To launch the benchmarking #1

Comments

Gpoxolcku commented Apr 1, 2024

danielz02 commented Apr 1, 2024

Gpoxolcku commented Apr 1, 2024

danielz02 commented Apr 1, 2024 • edited Loading

Gpoxolcku commented Apr 1, 2024

Gpoxolcku commented Jul 8, 2024

alievrusik commented Aug 27, 2024

danielz02 commented Apr 1, 2024 •

edited

Loading