How much video memory is needed for this experiment？ #9

carbonatedbeverages · 2024-11-08T16:09:28Z

I set the parameter gradient_accumulation_steps to 1,bachsize to 1 and use LoRA to make the number of trainable parameters reduce to 3,276,800.However,with two v100(32G),I still can't run this experiment for CUDA out of memory.
What other methods can reduce the need for video memory？

yaojin17 · 2024-11-08T18:17:42Z

Hi, I use 8 A100 GPUs with 80GB of memory each to fine-tune the model. For your case, I suggest using FP16 training and reducing the number of LoRA trainable parameters to conduct the experiments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How much video memory is needed for this experiment？ #9

How much video memory is needed for this experiment？ #9

carbonatedbeverages commented Nov 8, 2024

yaojin17 commented Nov 8, 2024

How much video memory is needed for this experiment？ #9

How much video memory is needed for this experiment？ #9

Comments

carbonatedbeverages commented Nov 8, 2024

yaojin17 commented Nov 8, 2024