Quentin Gallouédec's picture

Quentin Gallouédec PRO

qgallouedec

·

AI & ML interests

None yet

Recent Activity

updated a model about 6 hours ago

qgallouedec/Qwen3-1.7B-parsing

published a model about 6 hours ago

qgallouedec/Qwen3-1.7B-parsing

upvoted a paper 2 days ago

ARE: Scaling Up Agent Environments and Evaluations

View all activity

Organizations

qgallouedec 's models 773

qgallouedec/Qwen2-0.5B-Instruct-Capybara

Updated Oct 2, 2024

qgallouedec/xpo-qwen2

Text Generation • 0.5B • Updated Sep 26, 2024 • 7

qgallouedec/online-dpo-qwen2-4

Text Generation • 0.5B • Updated Sep 25, 2024 • 7

qgallouedec/online-dpo-qwen2-2

Text Generation • 0.5B • Updated Sep 25, 2024 • 5

qgallouedec/online-dpo-qwen2-3

Text Generation • 0.5B • Updated Sep 25, 2024 • 4

qgallouedec/pythia-1b-tldr-online-dpo-peft

Updated Sep 15, 2024

qgallouedec/pythia-1b-tldr-online-dpo-no-peft

1B • Updated Sep 13, 2024 • 3

qgallouedec/pythia-1b-tldr-online-dpo

1B • Updated Sep 12, 2024 • 3

qgallouedec/llama-3.1-8b-ultrafeedback-online-dpo

Updated Sep 9, 2024

qgallouedec/my_hub_model_id

Text Generation • 0.3B • Updated Sep 9, 2024 • 3

qgallouedec/llama3.1-8b-sft

Updated Sep 9, 2024

qgallouedec/llama3.1-8b-instruct

Updated Sep 4, 2024

qgallouedec/online_dpo_uf_1

0.5B • Updated Aug 28, 2024 • 3

qgallouedec/online-dpo-qwen2-0.5B-lr-3e-7

0.5B • Updated Aug 27, 2024 • 3

qgallouedec/online-dpo-qwen2-0.5B-lr-3e-6

0.5B • Updated Aug 25, 2024 • 3

qgallouedec/kto-aligned-model

Text Generation • 2B • Updated Aug 22, 2024 • 3

qgallouedec/gpt2-imdb-pos-v2

Text Generation • 0.1B • Updated Aug 22, 2024 • 4

qgallouedec/EleutherAI_pythia-1b

Text Generation • 1B • Updated Aug 21, 2024 • 36

qgallouedec/reward_modeling_anthropic_hh

0.3B • Updated Aug 18, 2024 • 3

qgallouedec/reward_modeling_anthropic_hh_crc

0.3B • Updated Aug 17, 2024 • 3

qgallouedec/tmp

1B • Updated Aug 17, 2024 • 3

qgallouedec/sft_openassistant-guanaco

Updated Aug 5, 2024

qgallouedec/sft-llava-1.5-7b-hf

Updated Jul 24, 2024 • 3

qgallouedec/test

Updated Jul 23, 2024

qgallouedec/ppo-PushCube-v0

Reinforcement Learning • Updated Jun 20, 2024 • 7

qgallouedec/ppo-ReachCube-v0

Reinforcement Learning • Updated Jun 13, 2024 • 2

qgallouedec/tqc-ReachCube-v0

Reinforcement Learning • Updated Jun 13, 2024 • 2

qgallouedec/ppo-LiftCube-v0

Robotics • Updated Jun 10, 2024 • 2

qgallouedec/tqc-LiftCube-v0

Reinforcement Learning • Updated Jun 9, 2024 • 3

qgallouedec/wildvision-internal-data_formatted

Updated Jun 2, 2024