Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Pedro Henrique Luz de Araujo's picture

Pedro Henrique Luz de Araujo

peluz
webxos's profile picture arthrod's profile picture BrunoDCDO's profile picture
·
https://peluz.github.io/

AI & ML interests

None yet

Recent Activity

updated a Space 14 days ago
peluz/qwen3-0.6b-cat-lingo-grpo
published a Space 15 days ago
peluz/qwen3-0.6b-cat-lingo-grpo
updated a model 15 days ago
peluz/qwen3-0.6b-cat-lingo-grpo
View all activity

Organizations

NLP lab at the University of Vienna's profile picture University of Vienna's profile picture

Papers 3

arxiv:2512.12775
arxiv:2508.19764
arxiv:2407.02099

spaces 2

Sleeping
Agents

Qwen3 0.6b Cat Lingo Grpo

👀

🐾 Qwen3-Cat — GRPO Cat-Lingo Demo

14 days ago
Sleeping
Agents

Qwen3 0.6b Cat Lingo Dpo

🐢

A Qwen3-0.6B model fine-tuned to be a cat

29 days ago

models 7

peluz/qwen3-0.6b-cat-lingo-grpo

Updated 15 days ago

peluz/qwen3-0.6b-cat-lingo-dpo

0.6B • Updated 29 days ago • 58

peluz/q-Taxi-v4

Reinforcement Learning • Updated about 1 month ago

peluz/q-Taxi-v3

Reinforcement Learning • Updated about 1 month ago

peluz/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated about 1 month ago

peluz/ppo-Huggy

Reinforcement Learning • Updated May 10

peluz/ppo-LunarLander-v2

Reinforcement Learning • Updated May 9

datasets 1

peluz/lener_br

Updated Jan 18, 2024 • 258 • 39
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs