Hugging Face: GaLore enables 7B-parameter LLM training on consumer GPUs with 82.5% optimizer-memory reduction | SignalBreak | SignalBreak