Why is there a chat template for a base model?
#11 opened 16 days ago
by
Winterjitheshpavan
Add assistant mask support to Qwen3-4B
#9 opened 28 days ago
by
waleko

UnslothVisionDataCollator problem
2
#8 opened about 2 months ago
by
orkungedik

Translation task in low-resource language can be done pretty well
#7 opened about 2 months ago
by
luweigen
Why are the new 4B and 8B models slower than the previous 7B-1M model??
3
#6 opened about 2 months ago
by
stev236
Collections of Qwen3 4B model Bad Cases User Reviews and Comments
😔
1
#5 opened 2 months ago
by
DeepNLP
YaRN: is "performance" referring to quality or speed?
👀
1
#4 opened 2 months ago
by
kmouratidis

Use the more common reverse filter in template
#3 opened 2 months ago
by
tahayassine

【Evaluation】Best practice for evaluating Qwen3 !!
🔥
🚀
2
#2 opened 2 months ago
by
wangxingjun778

Add languages tag
#1 opened 2 months ago
by
de-francophones
