Ougrid Dumdang

Ougrid-D

ougrid

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted an article about 1 month ago

Generative AI for Recommendation Systems: A Guide to Tokenizing User Interaction Data

upvoted a paper about 1 month ago

ARGenSeg: Image Segmentation with Autoregressive Image Generation Model

View all activity

Organizations

upvoted an article 4 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

5 days ago

•

374

upvoted an article about 1 month ago

Article

Generative AI for Recommendation Systems: A Guide to Tokenizing User Interaction Data

Mar 26

•

upvoted 2 papers about 1 month ago

ARGenSeg: Image Segmentation with Autoregressive Image Generation Model

Paper • 2510.20803 • Published Oct 23 • 9

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published Oct 22 • 29

upvoted a paper about 2 months ago

RAG-Anything: All-in-One RAG Framework

Paper • 2510.12323 • Published Oct 14 • 49

upvoted a paper 3 months ago

LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence

Paper • 2509.12203 • Published Sep 15 • 19

upvoted 4 papers 4 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 242

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

A Survey on Diffusion Language Models

Paper • 2508.10875 • Published Aug 14 • 34

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

Paper • 2508.05954 • Published Aug 8 • 6

upvoted an article 4 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

•

509

upvoted 5 articles 5 months ago

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Jul 23

•

Article

Five Big Improvements to Gradio MCP Servers

Jul 17

•

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

722

Article

Asynchronous Robot Inference: Decoupling Action Prediction and Execution

Jul 10

•

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3

•

289

upvoted 4 papers 5 months ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9 • 45

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 259

KV Cache Steering for Inducing Reasoning in Small Language Models

Paper • 2507.08799 • Published Jul 11 • 40

Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5 • 51

Ougrid Dumdang

AI & ML interests

Recent Activity

Organizations

Ougrid-D's activity

We Got Claude to Fine-Tune an Open Source LLM

Generative AI for Recommendation Systems: A Guide to Tokenizing User Interaction Data

Welcome GPT OSS, the new open-source model family from OpenAI!

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Five Big Improvements to Gradio MCP Servers

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Asynchronous Robot Inference: Decoupling Action Prediction and Execution

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data