quantization simulation for a LLM model example #3439

1826133674 · 2024-10-28T08:14:16Z

Hello， I'm trying to use AIMET_TORCH to quantize a LLM model, e.g.: llama v2。 where can I find a jupyter NB example which shows quantization simulation for a LLM model?

DongGeun123 · 2024-11-14T11:14:34Z

@1826133674 Hi Did you achieve quantizing llama?

1826133674 · 2024-11-14T11:20:35Z

@1826133674 Hi Did you achieve quantizing llama?

not yet

xuli-vecml · 2025-01-03T02:30:43Z

I tried to quantize llama1b, but the quantized model output random charactors...
I kind of wondering the different between aimet pro and aimet public version. I see that the llama3b is quantized pretty good with aimet pro by the qualcomm team. But there is no example released of how they do that.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantization simulation for a LLM model example #3439

quantization simulation for a LLM model example #3439

1826133674 commented Oct 28, 2024

DongGeun123 commented Nov 14, 2024

1826133674 commented Nov 14, 2024

xuli-vecml commented Jan 3, 2025

quantization simulation for a LLM model example #3439

quantization simulation for a LLM model example #3439

Comments

1826133674 commented Oct 28, 2024

DongGeun123 commented Nov 14, 2024

1826133674 commented Nov 14, 2024

xuli-vecml commented Jan 3, 2025