Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Darya Poludova's picture

Darya Poludova

dapoli

AI & ML interests

None yet

Organizations

None yet

Collections 1

interesting
  • Training Language Models to Self-Correct via Reinforcement Learning

    Paper • 2409.12917 • Published Sep 19, 2024 • 140
  • Language Models Learn to Mislead Humans via RLHF

    Paper • 2409.12822 • Published Sep 19, 2024 • 11
interesting
  • Training Language Models to Self-Correct via Reinforcement Learning

    Paper • 2409.12917 • Published Sep 19, 2024 • 140
  • Language Models Learn to Mislead Humans via RLHF

    Paper • 2409.12822 • Published Sep 19, 2024 • 11

spaces 1

Paused

First Agent Template

⚡

May 6, 2025

models 1

dapoli/ppo-LunarLander-v2

Reinforcement Learning • Updated Jun 3, 2024 • 1

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs