Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1210
107
99
Quentin Gallouédec
PRO
qgallouedec
Follow
sigridjineth's profile picture
rolandollamas's profile picture
cswiz's profile picture
470 followers
·
284 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a model
about 6 hours ago
qgallouedec/Qwen3-1.7B-parsing
published
a model
about 6 hours ago
qgallouedec/Qwen3-1.7B-parsing
upvoted
a
paper
2 days ago
ARE: Scaling Up Agent Environments and Evaluations
View all activity
Organizations
qgallouedec
's models
773
Sort: Recently updated
qgallouedec/Qwen2-0.5B-Instruct-Capybara
Updated
Oct 2, 2024
qgallouedec/xpo-qwen2
Text Generation
•
0.5B
•
Updated
Sep 26, 2024
•
7
qgallouedec/online-dpo-qwen2-4
Text Generation
•
0.5B
•
Updated
Sep 25, 2024
•
7
qgallouedec/online-dpo-qwen2-2
Text Generation
•
0.5B
•
Updated
Sep 25, 2024
•
5
qgallouedec/online-dpo-qwen2-3
Text Generation
•
0.5B
•
Updated
Sep 25, 2024
•
4
qgallouedec/pythia-1b-tldr-online-dpo-peft
Updated
Sep 15, 2024
qgallouedec/pythia-1b-tldr-online-dpo-no-peft
1B
•
Updated
Sep 13, 2024
•
3
qgallouedec/pythia-1b-tldr-online-dpo
1B
•
Updated
Sep 12, 2024
•
3
qgallouedec/llama-3.1-8b-ultrafeedback-online-dpo
Updated
Sep 9, 2024
qgallouedec/my_hub_model_id
Text Generation
•
0.3B
•
Updated
Sep 9, 2024
•
3
qgallouedec/llama3.1-8b-sft
Updated
Sep 9, 2024
qgallouedec/llama3.1-8b-instruct
Updated
Sep 4, 2024
qgallouedec/online_dpo_uf_1
0.5B
•
Updated
Aug 28, 2024
•
3
qgallouedec/online-dpo-qwen2-0.5B-lr-3e-7
0.5B
•
Updated
Aug 27, 2024
•
3
qgallouedec/online-dpo-qwen2-0.5B-lr-3e-6
0.5B
•
Updated
Aug 25, 2024
•
3
qgallouedec/kto-aligned-model
Text Generation
•
2B
•
Updated
Aug 22, 2024
•
3
qgallouedec/gpt2-imdb-pos-v2
Text Generation
•
0.1B
•
Updated
Aug 22, 2024
•
4
qgallouedec/EleutherAI_pythia-1b
Text Generation
•
1B
•
Updated
Aug 21, 2024
•
36
qgallouedec/reward_modeling_anthropic_hh
0.3B
•
Updated
Aug 18, 2024
•
3
qgallouedec/reward_modeling_anthropic_hh_crc
0.3B
•
Updated
Aug 17, 2024
•
3
qgallouedec/tmp
1B
•
Updated
Aug 17, 2024
•
3
qgallouedec/sft_openassistant-guanaco
Updated
Aug 5, 2024
qgallouedec/sft-llava-1.5-7b-hf
Updated
Jul 24, 2024
•
3
qgallouedec/test
Updated
Jul 23, 2024
qgallouedec/ppo-PushCube-v0
Reinforcement Learning
•
Updated
Jun 20, 2024
•
7
qgallouedec/ppo-ReachCube-v0
Reinforcement Learning
•
Updated
Jun 13, 2024
•
2
qgallouedec/tqc-ReachCube-v0
Reinforcement Learning
•
Updated
Jun 13, 2024
•
2
qgallouedec/ppo-LiftCube-v0
Robotics
•
Updated
Jun 10, 2024
•
2
qgallouedec/tqc-LiftCube-v0
Reinforcement Learning
•
Updated
Jun 9, 2024
•
3
qgallouedec/wildvision-internal-data_formatted
Updated
Jun 2, 2024
Previous
1
...
3
4
5
6
7
...
26
Next