Spaces:

AC2513
/

gemma-demo

Running on Zero

AC2513 commited on Jun 5

Commit

6acfb9b

1 Parent(s): e6d4031

finished final choice

Files changed (1) hide show

Documentation/Technical_Report.md CHANGED Viewed

@@ -62,7 +62,7 @@ Since I am deploying on HuggingFace Spaces, I will be utilizing their **ZeroGPU*
 ### Final Choice
-For my final choice, in order to fully demonstrate the capabilities of the Gemma 3 family, I have selected the Gemma 3-27B model with Key-Value caching enabled. This setup leverages the full compute and memory bandwidth of the NVIDIA H200 (70GB VRAM) provided by HuggingFace's ZeroGPU environment. Overall, this configuration strikes a strong balance between maximum model capability and inference efficiency, ensuring that the demo remains smooth, accurate, and production-ready — even when scaling to complex or multimodal inputs.
 ## User Research

 ### Final Choice
+For my final choice, in order to fully demonstrate the capabilities of the Gemma 3 family, I have selected the Gemma 3-27B model with Key-Value caching enabled. This setup leverages the full compute and memory bandwidth of the NVIDIA H200 (70GB VRAM) provided by HuggingFace's ZeroGPU environment. Overall, this configuration strikes a strong balance between maximum model capability and inference efficiency, ensuring that the demo remains smooth, accurate, and production-ready—even when scaling to complex or multimodal inputs.
 ## User Research