GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Mar 9, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Posted by Cecile G. Tamura in category: computing

GaLore.

Memory-efficient LLM training by gradient low-rank projection.

V/@animaanandkumar.

For the first time, we show that the Llama 7B LLM can be trained on a single consumer-grade GPU (RTX 4090) with only 24GB memory.

Join the discussion on this paper page.

Comments are closed.

GETAS THREAT LEVEL: ELEVATED
FACEBOOK: 14,013 MEMBERS
LINKEDIN: 2,066 MEMBERS
TWITTER FEED: 31,618 MEMBERS
GETTR FEED: 39,482 MEMBERS

LIFEBOAT NEWS: 3,409 SUBSCRIBERS
GETAS ALERTS: 574 SUBSCRIBERS
BLOG: 123,547 POSTS
DONORS: 6,030

BOARDS: 2,944 MEMBERS
REPORTS: 74
PROGRAMS: 25
FORUMS: 24
QUOTES: 136

FIGHT AIDS: 3 MEMBERS
FOLDING@HOME: 15 MEMBERS
ROSETTA@HOME: 44 MEMBERS

© 2002–2025 Lifeboat Foundation