view article Article DeepMath: A lightweight math reasoning Agent with SmolAgents +1 5 days ago • 11
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques Mar 24 • 20
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model +6 Oct 29, 2024 • 59
view article Article CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG +4 Mar 15, 2024 • 13
view article Article CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG +4 Mar 15, 2024 • 13