Lê Võ Quyết Thắng's picture

In a Training Loop 🔄

Lê Võ Quyết Thắng

thangvip

·

https://vualidon.icu

AI & ML interests

Adapting LLM to specific domain

Recent Activity

new activity 13 days ago

thangvip/vwen-0.5:Question about thangvip/vwen-0.5 and its base model

updated a model 25 days ago

thangvip/paec-qwen3-1.7b-prefixes

published a model 25 days ago

thangvip/paec-qwen3-1.7b-prefixes

View all activity

Organizations

New activity in thangvip/vwen-0.5 13 days ago

Question about thangvip/vwen-0.5 and its base model

#1 opened about 1 month ago by

updated a model 25 days ago

thangvip/paec-qwen3-1.7b-prefixes

Updated 25 days ago

published a model 25 days ago

thangvip/paec-qwen3-1.7b-prefixes

Updated 25 days ago

updated a model about 1 month ago

thangvip/qwen2.5-1.5b-gspo-sgd-linear

Text Generation • 2B • Updated Mar 2 • 8

published a model about 1 month ago

thangvip/qwen2.5-1.5b-gspo-sgd-linear

Text Generation • 2B • Updated Mar 2 • 8

updated a model about 1 month ago

thangvip/qwen2.5-1.5b-seq-dspo-sgd-linear

Text Generation • 2B • Updated Feb 28 • 64

published 2 models about 2 months ago

thangvip/qwen2.5-1.5b-seq-dspo-sgd-linear

Text Generation • 2B • Updated Feb 28 • 64

thangvip/qwen2.5-1.5b-dspo-sgd-linear-5e

updated a model about 2 months ago

thangvip/qwen2.5-1.5b-grpo-sgd-linear

Text Generation • 2B • Updated Feb 18 • 38

published a model about 2 months ago

thangvip/qwen2.5-1.5b-grpo-sgd-linear

Text Generation • 2B • Updated Feb 18 • 38

updated a model about 2 months ago

thangvip/qwen2.5-1.5b-grpo-no-sft-sgd-linear

Text Generation • 2B • Updated Feb 17 • 8

published a model about 2 months ago

thangvip/qwen2.5-1.5b-grpo-no-sft-sgd-linear

Text Generation • 2B • Updated Feb 17 • 8

updated a model about 2 months ago

thangvip/qwen2.5-1.5b-dspo-no-sft-sgd-linear-steps-13000-steps

Text Generation • 2B • Updated Feb 16 • 2

published a model about 2 months ago

thangvip/qwen2.5-1.5b-dspo-no-sft-sgd-linear-steps-13000-steps

Text Generation • 2B • Updated Feb 16 • 2

updated a model about 2 months ago

thangvip/qwen2.5-1.5b-dspo-no-sft-sgd-linear-steps-12800-steps

Text Generation • 2B • Updated Feb 16 • 1

published a model about 2 months ago

thangvip/qwen2.5-1.5b-dspo-no-sft-sgd-linear-steps-12800-steps

Text Generation • 2B • Updated Feb 16 • 1

updated a model about 2 months ago

thangvip/qwen2.5-1.5b-dspo-no-sft-sgd-linear-steps-12600-steps

Text Generation • 2B • Updated Feb 16 • 4

published a model about 2 months ago

thangvip/qwen2.5-1.5b-dspo-no-sft-sgd-linear-steps-12600-steps

Text Generation • 2B • Updated Feb 16 • 4

updated a model about 2 months ago

thangvip/qwen2.5-1.5b-dspo-no-sft-sgd-linear-steps-12400-steps

Text Generation • 2B • Updated Feb 16 • 1

published a model about 2 months ago

thangvip/qwen2.5-1.5b-dspo-no-sft-sgd-linear-steps-12400-steps

Text Generation • 2B • Updated Feb 16 • 1