You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am using Llama-3.2-1B-Instruct model with hugging face and can't reproduce MMLU score of 49.3. My 5 shot eval ended up with 46.65 instead. I wonder what is wrong with my setup, any examples of how exactly to compose a prompt?
Hi,
Thanks for the model release.
I am using Llama-3.2-1B-Instruct model with hugging face and can't reproduce MMLU score of 49.3. My 5 shot eval ended up with 46.65 instead. I wonder what is wrong with my setup, any examples of how exactly to compose a prompt?
I've used this as a starting point https://github.com/QwenLM/Qwen/blob/main/eval/evaluate_mmlu.py and modified model init to
any pointers on what can be wrong with my setup or examples of exact prompt etc I should use?
Thank you.
The text was updated successfully, but these errors were encountered: