view reply people commenting about how they can't run this locally are saying more about themselves than they are about anything else wheres the spirit of innovation? get out a PC from 1997, try to get it running LM Studio, never know if you dont try!
view reply I'll give this a try on our office GPU rig. VRAM limited to 96gb but DRAM is 1024gb attached to a 64 core threadripper. I doubt I'll be able to pull 40 TPS on this rig, but hey, local is local! Thx to all the Unsloth guys as usual!
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper β’ 2511.21689 β’ Published Nov 26, 2025 β’ 121
view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv +8 Oct 23, 2025 β’ 148
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 Dec 4, 2025 β’ 37
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator Dec 17, 2025 β’ 46
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents Paper β’ 2509.06501 β’ Published Sep 8, 2025 β’ 80
[models] CPU-Offload &/|| A6000x2 Collection TPS can be as low as 1.0, seriously. its SLOW. β’ 6 items β’ Updated about 21 hours ago β’ 1
[papers] Gameplay Optimization Collection Research papers that may contribute to a broader approach to teaching machines how to play complex strategy games beyond just Chess. β’ 16 items β’ Updated about 21 hours ago β’ 1
World Craft: Agentic Framework to Create Visualizable Worlds via Text Paper β’ 2601.09150 β’ Published 17 days ago β’ 19