Sergio Paniego's picture

Sergio Paniego PRO

sergiopaniego

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset about 6 hours ago

agents-course/final-certificates

updated a dataset about 6 hours ago

agents-course/course-certificates-of-excellence

posted an update about 6 hours ago

Meet OpenEnv 👋, an open ecosystem of environments for intelligent agents. Build, share, and test agents safely and consistently. Ideal for training with TRL (we include examples🤓), deployment, and community collaboration via the HF Hub Blog: https://huggingface.co/blog/openenv Hub for Environments: https://huggingface.co/openenv OpenEnv repo: https://github.com/meta-pytorch/OpenEnv Try it out using TRL: https://huggingface.co/docs/trl/main/en/openenv

View all activity

Organizations

sergiopaniego 's models 80

sergiopaniego/Qwen3-8B-SFT-test

Updated 4 days ago

sergiopaniego/Qwen3-VL-4B-Instruct-trl-sft

Updated 4 days ago

sergiopaniego/gemma-3-4b-it

Updated 7 days ago

sergiopaniego/gemma-3n-E2B-it

Updated 7 days ago

sergiopaniego/Qwen3-8B-SFT-merged

Text Generation • 8B • Updated 8 days ago • 49

sergiopaniego/Qwen3-8B-SFT

Updated 8 days ago

sergiopaniego/Qwen2.5-7B-Instruct

Updated 13 days ago

sergiopaniego/Qwen2.5-0.5B-Instruct

Updated 13 days ago

sergiopaniego/Qwen2.5-0.5B-Instruct-merged

Text Generation • 0.5B • Updated 13 days ago • 24

sergiopaniego/Qwen3-1.7B-SFT-merged

Text Generation • 2B • Updated 14 days ago • 8

sergiopaniego/granite-4.0-micro-merged

Updated 14 days ago

sergiopaniego/granite-4.0-micro

Updated 14 days ago

sergiopaniego/Qwen3-4B-SFT

Updated 15 days ago

sergiopaniego/Qwen3-1.7B-SFT

Updated 15 days ago

sergiopaniego/Qwen3-14B-SFT-merged

Text Generation • 0.6B • Updated 15 days ago • 23

sergiopaniego/Qwen3-14B-SFT

Updated 15 days ago

sergiopaniego/Qwen2-0-5B-GRPO-vllm-trl

Updated 16 days ago

sergiopaniego/Qwen2-0.5B-GRPO-vllm-trl

Updated 20 days ago • 1

sergiopaniego/Qwen2-0.5B-GRPO-test

Updated 21 days ago

sergiopaniego/smol-course-smolvlm-instruct-trl-sft-ChartQA

Updated 30 days ago

sergiopaniego/qwen2-7b-instruct-trl-sft-ChartQA

sergiopaniego/smollm3-dpo-aligned

sergiopaniego/Qwen3-0.6B-SFT-20250911081335

Text Generation • 0.6B • Updated Sep 11 • 9

sergiopaniego/Qwen3-0.6B-SFT-20250911070158

Text Generation • 0.6B • Updated Sep 11 • 5

sergiopaniego/Qwen3-0.6B-SFT-20250908105022

Text Generation • 0.6B • Updated Sep 8 • 5

sergiopaniego/Qwen3-0.6B-SFT-20250908104717

sergiopaniego/trainer_output

Text Generation • 0.5B • Updated Aug 26 • 12 • 1

sergiopaniego/Qwen2-0.5B-SFT

Text Generation • 0.5B • Updated Aug 26 • 17 • 1

sergiopaniego/online-dpo-Qwen2.5-VL-3B-Instruct

sergiopaniego/pythia-1b-tldr-xpo