Jintao Huang's picture

Jintao Huang

study-hjt

·

https://github.com/Jintao-Huang

AI & ML interests

None yet

Recent Activity

liked a model about 15 hours ago

Qwen/Qwen3-235B-A22B-Instruct-2507

new activity about 2 months ago

Tongyi-Zhiwen/QwenLong-L1-32B:provide int4 version pls

new activity 3 months ago

Qwen/Qwen3-235B-A22B:GPTQ/AWQ

View all activity

Organizations

liked a model about 15 hours ago

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated 1 day ago • 1.32k • • 304

New activity in Tongyi-Zhiwen/QwenLong-L1-32B about 2 months ago

provide int4 version pls

#2 opened about 2 months ago by

New activity in Qwen/Qwen3-235B-A22B 3 months ago

GPTQ/AWQ

#3 opened 3 months ago by

New activity in Qwen/Qwen3-30B-A3B 3 months ago

AWQ quantized model support timeline?

#12 opened 3 months ago by

New activity in Qwen/Qwen3-235B-A22B 3 months ago

🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋

#6 opened 3 months ago by

New activity in Qwen/Qwen3-30B-A3B 3 months ago

🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋

#3 opened 3 months ago by

New activity in Qwen/Qwen3-32B 3 months ago

🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋

#7 opened 3 months ago by

New activity in Qwen/Qwen3-8B 3 months ago

🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋

#3 opened 3 months ago by

New activity in Qwen/Qwen2.5-Omni-7B 3 months ago

[Fine-tuning] 🚀SFT/DPO/GRPO support!

#20 opened 4 months ago by

New activity in microsoft/Phi-4-multimodal-instruct 5 months ago

thanks , how to fine tune?

#1 opened 5 months ago by

upvoted a paper 7 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 369

updated a model 11 months ago

study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4

Text Generation • 17B • Updated Aug 14, 2024 • 9 • 2

updated a dataset about 1 year ago

modelscope/self-cognition

Viewer • Updated Jun 8, 2024 • 108 • 126 • 19

liked a dataset about 1 year ago

modelscope/self-cognition

Viewer • Updated Jun 8, 2024 • 108 • 126 • 19

liked 3 models about 1 year ago

study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4

Text Generation • 11B • Updated Apr 23, 2024 • 8 • 6

study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int8

Text Generation • 3B • Updated Apr 23, 2024 • 7 • 2

study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int8

Text Generation • 20B • Updated Apr 23, 2024 • 10 • 2

updated 2 models about 1 year ago

study-hjt/Qwen1.5-110B-Chat-AWQ

Text Generation • 17B • Updated Apr 27, 2024 • 9

study-hjt/Qwen1.5-110B-Chat-GPTQ-Int8

Text Generation • 31B • Updated Apr 27, 2024 • 11

liked a model about 1 year ago

study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4

Text Generation • 17B • Updated Aug 14, 2024 • 9 • 2