Urro's picture

In a Training Loop 🔄

Urro

urroxyz

·

https://urro.xyz/

urroxyz

AI & ML interests

None yet

Recent Activity

updated a collection 39 minutes ago

WTF GENIUS PAPERS

upvoted a paper 39 minutes ago

Context Parametrization with Compositional Adapters

upvoted a paper about 12 hours ago

Scaling Embeddings Outperforms Scaling Experts in Language Models

View all activity

Organizations

upvoted a paper 39 minutes ago

Context Parametrization with Compositional Adapters

Paper • 2509.22158 • Published Sep 26, 2025 • 1

upvoted 8 papers about 12 hours ago

Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published 2 days ago • 77

FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale

Paper • 2601.22146 • Published 1 day ago • 4

Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning

Paper • 2601.20829 • Published 2 days ago • 5

VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning

Paper • 2601.20055 • Published 3 days ago • 6

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Paper • 2601.20209 • Published 3 days ago • 20

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published 2 days ago • 21

EvolVE: Evolutionary Search for LLM-based Verilog Generation and Optimization

Paper • 2601.18067 • Published 5 days ago • 4

Self-Distillation Enables Continual Learning

Paper • 2601.19897 • Published 3 days ago • 18

upvoted a paper 2 days ago

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

Paper • 2411.07279 • Published Nov 11, 2024 • 4

upvoted 3 papers 3 days ago

Even Small Reasoners Should Quote Their Sources: Introducing the Pleias-RAG Model Family

Paper • 2504.18225 • Published Apr 25, 2025 • 15

Mind2Web: Towards a Generalist Agent for the Web

Paper • 2306.06070 • Published Jun 9, 2023 • 20

Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models

Paper • 2512.06266 • Published Dec 6, 2025 • 5

upvoted 5 papers 4 days ago

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Paper • 2601.17058 • Published 9 days ago • 176

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Paper • 2601.18778 • Published 4 days ago • 38

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10, 2024 • 39

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

Paper • 2312.12456 • Published Dec 16, 2023 • 45

CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation

Paper • 2409.02098 • Published Sep 3, 2024 • 3

upvoted 2 papers 6 days ago

FQuAD: French Question Answering Dataset

Paper • 2002.06071 • Published Feb 14, 2020 • 1

OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment

Paper • 2510.07743 • Published Oct 9, 2025 • 10