Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 8 days ago • 58
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF Text Generation • 18B • Updated about 1 month ago • 58.6k • 449
nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation • 8B • Updated Apr 17, 2025 • 409 • 121
bartowski/DeepSeek-R1-Distill-Llama-70B-GGUF Text Generation • 71B • Updated Jan 22, 2025 • 2.45k • 35