Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
17
1
10
Jintao Huang
study-hjt
Follow
kk3dmax's profile picture
21world's profile picture
LighterDarkness's profile picture
9 followers
Β·
2 following
https://github.com/Jintao-Huang
AI & ML interests
None yet
Recent Activity
new
activity
11 days ago
Tongyi-Zhiwen/QwenLong-L1-32B:
provide int4 version pls
new
activity
about 1 month ago
Qwen/Qwen3-235B-A22B:
GPTQ/AWQ
new
activity
about 1 month ago
Qwen/Qwen3-30B-A3B:
AWQ quantized model support timeline?
View all activity
Organizations
study-hjt
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Tongyi-Zhiwen/QwenLong-L1-32B
11 days ago
provide int4 version pls
β
π
2
2
#2 opened 12 days ago by
Josh1026
New activity in
Qwen/Qwen3-235B-A22B
about 1 month ago
GPTQ/AWQ
π
12
4
#3 opened about 1 month ago by
ndurkee
New activity in
Qwen/Qwen3-30B-A3B
about 1 month ago
AWQ quantized model support timeline?
π
7
2
#12 opened about 1 month ago by
hyunw55
New activity in
Qwen/Qwen3-235B-A22B
about 1 month ago
π[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practicesπ
π
6
1
#6 opened about 1 month ago by
study-hjt
New activity in
Qwen/Qwen3-30B-A3B
about 1 month ago
π[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practicesπ
π
4
#3 opened about 1 month ago by
study-hjt
New activity in
Qwen/Qwen3-32B
about 1 month ago
π[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Trainingπ
π₯
π
3
#7 opened about 1 month ago by
study-hjt
New activity in
Qwen/Qwen3-8B
about 1 month ago
π[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Trainingπ
π
π
3
#3 opened about 1 month ago by
study-hjt
New activity in
Qwen/Qwen2.5-Omni-7B
about 1 month ago
[Fine-tuning] πSFT/DPO/GRPO support!
#20 opened 2 months ago by
study-hjt
New activity in
microsoft/Phi-4-multimodal-instruct
3 months ago
thanks , how to fine tune?
19
#1 opened 3 months ago by
NickyNicky
upvoted
a
paper
6 months ago
Qwen2.5 Technical Report
Paper
β’
2412.15115
β’
Published
Dec 19, 2024
β’
368
updated
a model
10 months ago
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
β’
Updated
Aug 14, 2024
β’
20
β’
2
updated
a dataset
about 1 year ago
modelscope/self-cognition
Viewer
β’
Updated
Jun 8, 2024
β’
108
β’
166
β’
19
liked
a dataset
about 1 year ago
modelscope/self-cognition
Viewer
β’
Updated
Jun 8, 2024
β’
108
β’
166
β’
19
liked
3 models
about 1 year ago
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
β’
Updated
Apr 23, 2024
β’
16
β’
6
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int8
Text Generation
β’
Updated
Apr 23, 2024
β’
15
β’
2
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int8
Text Generation
β’
Updated
Apr 23, 2024
β’
22
β’
2
updated
2 models
about 1 year ago
study-hjt/Qwen1.5-110B-Chat-AWQ
Text Generation
β’
Updated
Apr 27, 2024
β’
19
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int8
Text Generation
β’
Updated
Apr 27, 2024
β’
13
liked
a model
about 1 year ago
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
β’
Updated
Aug 14, 2024
β’
20
β’
2
updated
a model
about 1 year ago
study-hjt/Qwen1.5-32B-Chat-GPTQ-Int8
Text Generation
β’
Updated
Apr 26, 2024
β’
12
β’
1
Load more