Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9.2
TFLOPS
204
124
74
Sergio Paniego
PRO
sergiopaniego
Follow
sudanenator's profile picture
rajivgangadharan's profile picture
jmgomezroi's profile picture
1318 followers
Β·
70 following
https://sergiopaniego.github.io/
sergiopaniego
sergiopaniego
sergio-paniego-blanco
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 6 hours ago
agents-course/final-certificates
updated
a dataset
about 6 hours ago
agents-course/course-certificates-of-excellence
posted
an
update
about 6 hours ago
Meet OpenEnv π, an open ecosystem of environments for intelligent agents. Build, share, and test agents safely and consistently. Ideal for training with TRL (we include examplesπ€), deployment, and community collaboration via the HF Hub Blog: https://huggingface.co/blog/openenv Hub for Environments: https://huggingface.co/openenv OpenEnv repo: https://github.com/meta-pytorch/OpenEnv Try it out using TRL: https://huggingface.co/docs/trl/main/en/openenv
View all activity
Organizations
sergiopaniego
's models
80
Sort:Β Recently updated
sergiopaniego/Qwen3-8B-SFT-test
Updated
4 days ago
sergiopaniego/Qwen3-VL-4B-Instruct-trl-sft
Updated
4 days ago
sergiopaniego/gemma-3-4b-it
Updated
7 days ago
sergiopaniego/gemma-3n-E2B-it
Updated
7 days ago
sergiopaniego/Qwen3-8B-SFT-merged
Text Generation
β’
8B
β’
Updated
8 days ago
β’
49
sergiopaniego/Qwen3-8B-SFT
Updated
8 days ago
sergiopaniego/Qwen2.5-7B-Instruct
Updated
13 days ago
sergiopaniego/Qwen2.5-0.5B-Instruct
Updated
13 days ago
sergiopaniego/Qwen2.5-0.5B-Instruct-merged
Text Generation
β’
0.5B
β’
Updated
13 days ago
β’
24
sergiopaniego/Qwen3-1.7B-SFT-merged
Text Generation
β’
2B
β’
Updated
14 days ago
β’
8
sergiopaniego/granite-4.0-micro-merged
Updated
14 days ago
sergiopaniego/granite-4.0-micro
Updated
14 days ago
sergiopaniego/Qwen3-4B-SFT
Updated
15 days ago
sergiopaniego/Qwen3-1.7B-SFT
Updated
15 days ago
sergiopaniego/Qwen3-14B-SFT-merged
Text Generation
β’
0.6B
β’
Updated
15 days ago
β’
23
sergiopaniego/Qwen3-14B-SFT
Updated
15 days ago
sergiopaniego/Qwen2-0-5B-GRPO-vllm-trl
Updated
16 days ago
sergiopaniego/Qwen2-0.5B-GRPO-vllm-trl
Updated
20 days ago
β’
1
sergiopaniego/Qwen2-0.5B-GRPO-test
Updated
21 days ago
sergiopaniego/smol-course-smolvlm-instruct-trl-sft-ChartQA
Updated
30 days ago
sergiopaniego/qwen2-7b-instruct-trl-sft-ChartQA
Updated
Sep 18
sergiopaniego/smollm3-dpo-aligned
Updated
Sep 16
sergiopaniego/Qwen3-0.6B-SFT-20250911081335
Text Generation
β’
0.6B
β’
Updated
Sep 11
β’
9
sergiopaniego/Qwen3-0.6B-SFT-20250911070158
Text Generation
β’
0.6B
β’
Updated
Sep 11
β’
5
sergiopaniego/Qwen3-0.6B-SFT-20250908105022
Text Generation
β’
0.6B
β’
Updated
Sep 8
β’
5
sergiopaniego/Qwen3-0.6B-SFT-20250908104717
Updated
Sep 8
sergiopaniego/trainer_output
Text Generation
β’
0.5B
β’
Updated
Aug 26
β’
12
β’
1
sergiopaniego/Qwen2-0.5B-SFT
Text Generation
β’
0.5B
β’
Updated
Aug 26
β’
17
β’
1
sergiopaniego/online-dpo-Qwen2.5-VL-3B-Instruct
Updated
Aug 14
sergiopaniego/pythia-1b-tldr-xpo
Updated
Aug 13
Previous
1
2
3
Next