ZeroGPU-LLM-Inference

Running

Luigi commited on May 3

Commit

371669a

verified ·

1 Parent(s): 076c1f2

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,10 +1,10 @@
 ---
-title: 'ZeroGPU-LLM-Inference'
 emoji: 🧠
 colorFrom: pink
 colorTo: purple
 sdk: gradio
-sdk_version: 5.25.2
 app_file: app.py
 pinned: false
 license: apache-2.0
@@ -77,4 +77,4 @@ Use the dropdown to select any of these:
 3. After up to *Search Timeout* seconds, snippets merge into the system prompt.
 4. The selected model pipeline is loaded (bf16→f16→f32 fallback) on ZeroGPU.
 5. Prompt is formatted—any `<think>…</think>` blocks will be streamed as separate “💭 Thought.”
-6. Tokens stream to the Chatbot UI. Press **Cancel** to stop mid-generation.

 ---
+title: ZeroGPU-LLM-Inference
 emoji: 🧠
 colorFrom: pink
 colorTo: purple
 sdk: gradio
+sdk_version: 5.29.0
 app_file: app.py
 pinned: false
 license: apache-2.0
 3. After up to *Search Timeout* seconds, snippets merge into the system prompt.
 4. The selected model pipeline is loaded (bf16→f16→f32 fallback) on ZeroGPU.
 5. Prompt is formatted—any `<think>…</think>` blocks will be streamed as separate “💭 Thought.”
+6. Tokens stream to the Chatbot UI. Press **Cancel** to stop mid-generation.