I want to use this model to run my code
#33 opened about 4 hours ago
by
yunxi0827

Qwen3-32B-Base?
π
6
2
#32 opened 5 days ago
by
canac84073
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
#31 opened 7 days ago
by
VMoorjani
Add assistant mask support to Qwen3-32B
#30 opened 7 days ago
by
waleko

Setting 'enable_thinking=False' has no effect.
1
#29 opened 17 days ago
by
ktrocks
finetune question
1
#28 opened 20 days ago
by
Saicy
Qwen3ForCausalLM - Architecture issue
1
#26 opened 27 days ago
by
cr-gkn
Request to Release the Base Model for Qwen3-32B
β
π
12
#25 opened about 1 month ago
by
eramax

How to control thinking length?
β
6
2
#24 opened about 1 month ago
by
lidh15
Qwen3 does not deploy on Endpoints
#23 opened about 1 month ago
by
zenfiric

The model's instructions follow too poorly
β
1
3
#22 opened about 1 month ago
by
xldistance
Update README.md
#21 opened about 1 month ago
by
Logical-Transcendence84

please release AWQ version
#20 opened about 1 month ago
by
classdemo
Collections of Bad Cases User Reviews and Comments of Qwen3 32B model
#19 opened about 1 month ago
by
DeepNLP
Potential issue with large context sizes - can someone confirm?
15
#18 opened about 1 month ago
by
Thireus
Qwen 3 presence of tools affect output length?
#17 opened about 1 month ago
by
evetsagg
"/no_think" control is unstable
1
#16 opened about 1 month ago
by
Smorty100
LICENSE files missing
π
1
#14 opened about 1 month ago
by
johndoe2001
After setting /nothinking or enable_thinking=False, can the empty <thinking> tag be omitted from the response?
π
3
2
#13 opened about 1 month ago
by
pteromyini

Feedback: It's a good model, however it hallucinates very badly at local facts (Germany)
π
π
9
2
#12 opened about 1 month ago
by
Dampfinchen
The correct way of fine-tuning on multi-turn trajectories
π
8
1
#11 opened about 1 month ago
by
hr0nix
Providing a GPTQ version
π
3
12
#10 opened about 1 month ago
by
blueteamqq1
how to set, enable_thinking=False, on ollama
π
6
2
#9 opened about 1 month ago
by
TatsuhiroC
π[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Trainingπ
π
π₯
3
#7 opened about 1 month ago
by
study-hjt

Reasoning or Non-reasoning model?
4
#6 opened about 1 month ago
by
dipta007

Local Installation Video and Testing - Step by Step
#5 opened about 1 month ago
by
fahdmirzac

γEvaluationγBest practice for evaluating Qwen3 !!
π
π₯
5
#4 opened about 1 month ago
by
wangxingjun778

Base Model?
π
β
8
12
#3 opened about 1 month ago
by
Downtown-Case
Is this multimodal?
1
#2 opened about 1 month ago
by
pbarker

Add languages tag
π
2
#1 opened about 1 month ago
by
de-francophones
