Spaces:

opencompass
/

open_vlm_leaderboard

Running on CPU Upgrade

App Files Files Community

Gemma results weirdness

#18

by louisglobal - opened 10 days ago

Discussion

louisglobal

10 days ago

Hi,
There are incoherent results between Gemma 3 paper and this eval toolkit. On the paper https://arxiv.org/pdf/2503.19786, they claim a 68.8 score on ChartQA versus 33.7 on the leaderboard ? To be honest ,I was not able to reproduce either since inference simply does not work on gemma with any dataset with code from the VLMEval git.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment