Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jonathan Mamou's picture
11 8

Jonathan Mamou

jmamou
moshew's profile picture 21world's profile picture Fishtiks's profile picture
·
  • jmamou

AI & ML interests

None yet

Organizations

Intel's profile picture Need4Speed's profile picture il-eai-nlp's profile picture

upvoted an article 9 months ago
view article
Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

Mar 24
•
20
upvoted a paper 10 months ago

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

Paper • 2502.09390 • Published Feb 13 • 16
upvoted a paper about 1 year ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 11
upvoted an article about 1 year ago
view article
Article

Faster Assisted Generation with Dynamic Speculation

  • +5
Oct 8, 2024
•
49
upvoted 3 papers over 1 year ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5, 2024 • 39

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7, 2024 • 2

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23, 2024 • 18
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs