MagmaCubes1133's picture
Update README.md
8c88a99 verified

A newer version of the Gradio SDK is available: 6.1.0

Upgrade
metadata
title: 3B Thinking (vLLM + Controller)
emoji: πŸ†
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: true
license: apache-2.0

This Space wraps meta-llama/Llama-3.2-3B-Instruct with a simple controller: brainstorm (high T) β†’ critic (low T) β†’ finalize (low T).

Setup

  • Attach a GPU (T4 small is fine).
  • Add a Space Secret HF_TOKEN so the app can pull gated weights.

Notes

  • Uses the tokenizer's chat template for correct formatting.
  • Private reasoning stays inside <THINK>…</THINK>; only <FINAL>…</FINAL> is shown to the user.