Will it be possible to fine-tune a 65B parameter model with 30GB of GPU memory (average) by the end of 2023?
14
Ṁ290Ṁ299resolved Mar 10
Resolved
N/A1H
6H
1D
1W
1M
ALL
QLoRA reduced the avg memory requirements from 750+ GB to < 48 GB of GPU memory (average) for a 65B model.
They checked by training 1000 models across several different instruction sets + architectures + parameter ranges [80M, 65B].
Will it be possible to reduce it further? Not just on 1 model but reliably, need consistent and compelling evidence.
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
People are also trading
Will GigaChat release an open-weights model with ≥100B parameters by the end of 2026?
45% chance
1GW AI training run before 2027?
81% chance
Before 2028, will a GPU the same or smaller die size as b100 achieve 2x or better max throughput on GPT-oss-120b?
50% chance
Will Aidan McLau's claim that very large models are "refusing instruction tuning" be validated by 2030?
42% chance
Will a GPT-4 quality model be trained for under $10.000 by 2030?
90% chance
How fast will you be able to train a GPT-2-level AI on a consumer GPU in 2030?
546
Open "Nano Banana Pro"‑Level Model on a Gaming GPU by 2028?
68% chance
Will a major cosmological simulation be AI-accelerated by the end of 2027?
53% chance
100GW AI training run before 2031?
32% chance
AI model training time decreases fourfold by mid-2027?
65% chance
Sort by:
This market needs clarification regarding time to finetune, and finetuned model performance, required for something to count as "finetuning".
Otherwise, I can trivially finetune even a 1T parameter model with zero gpus, because finetuning a model is a computational operation and regular non-gpu computers are Turing-complete.
People are also trading
Related questions
Will GigaChat release an open-weights model with ≥100B parameters by the end of 2026?
45% chance
1GW AI training run before 2027?
81% chance
Before 2028, will a GPU the same or smaller die size as b100 achieve 2x or better max throughput on GPT-oss-120b?
50% chance
Will Aidan McLau's claim that very large models are "refusing instruction tuning" be validated by 2030?
42% chance
Will a GPT-4 quality model be trained for under $10.000 by 2030?
90% chance
How fast will you be able to train a GPT-2-level AI on a consumer GPU in 2030?
546
Open "Nano Banana Pro"‑Level Model on a Gaming GPU by 2028?
68% chance
Will a major cosmological simulation be AI-accelerated by the end of 2027?
53% chance
100GW AI training run before 2031?
32% chance
AI model training time decreases fourfold by mid-2027?
65% chance