How Much VRAM ?
#7
by
						
Ni3SinghR
	
							
						- opened
							
					
How much VRAM Required to run this model?
- You can run it with 750gb of RAM(not VRAM).
I don't think that's necessary.. You can run it on 400GB RAM when quantized. You'll need about 96GB VRAM to be able to run it decently fast.
Rule of thumb , for fp8 you will need around  the same amount of parameters for vram,
for example :
70b -> ~  70 gb vram
30b ->~   30gn vram 
For fb16 it will be twice
70b ->~  140gb
30b->~  60gb

 
						 
						 
						
