euclaise

https://euclaise.xyz

euclaise

AI & ML interests

None yet

Recent Activity

liked a model about 14 hours ago

xTimeCrystal/MiniModel-200M-Base

liked a dataset about 14 hours ago

ronantakizawa/github-top-code

liked a model 1 day ago

jdopensource/JoyAI-LLM-Flash

View all activity

Organizations

liked a model about 14 hours ago

xTimeCrystal/MiniModel-200M-Base

Text Generation • Updated about 17 hours ago • 9 • 30

liked a dataset about 14 hours ago

ronantakizawa/github-top-code

Viewer • Updated 1 day ago • 1.12M • 394 • 56

liked a model 1 day ago

jdopensource/JoyAI-LLM-Flash

Text Generation • 49B • Updated 6 days ago • 1.17k • 150

upvoted an article 1 day ago

Article

Differential Transformer V2

Jan 20

•

upvoted a paper 3 days ago

2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

Paper • 2602.17363 • Published 5 days ago • 7

liked a model 3 days ago

trillionlabs/Tri-21B

Text Generation • 21B • Updated 5 days ago • 6.97k • 45

liked a model 4 days ago

aloobun/teeny-s

Text Generation • Updated 3 days ago • 1

upvoted 3 papers 4 days ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published 11 days ago • 28

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

Paper • 2602.15322 • Published 7 days ago • 9

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published 11 days ago • 42

liked 3 models 9 days ago

upvoted 4 papers 11 days ago

Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum

Paper • 2510.00526 • Published Oct 1, 2025 • 10

Dynamic Long Context Reasoning over Compressed Memory via End-to-End Reinforcement Learning

Paper • 2602.08382 • Published 15 days ago • 10

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 13 days ago • 28

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published 12 days ago • 95

upvoted 2 papers 13 days ago

Prism: Spectral-Aware Block-Sparse Attention

Paper • 2602.08426 • Published 15 days ago • 35

iGRPO: Self-Feedback-Driven LLM Reasoning

Paper • 2602.09000 • Published 15 days ago • 15

upvoted a paper 21 days ago

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Paper • 2602.01058 • Published 23 days ago • 41

euclaise

AI & ML interests

Recent Activity

Organizations

euclaise's activity

Differential Transformer V2