Phil
phil111
AI & ML interests
None yet
Recent Activity
new activity
about 10 hours ago
rednote-hilab/dots.llm1.inst:Only 9.3 on the English SimpleQA despite 143b total parameters
new activity
7 days ago
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B:Tried it, but not good as expected.
new activity
12 days ago
deepseek-ai/DeepSeek-R1-0528:Benchmarks please
Organizations
None yet
phil111's activity
Only 9.3 on the English SimpleQA despite 143b total parameters
5
#2 opened 3 days ago
by
phil111
Tried it, but not good as expected.
3
#11 opened 11 days ago
by
kk3dmax
Benchmarks please
10
#20 opened 13 days ago
by
Blazgo

Pruning doesn't appear to be viable.
๐
3
#2 opened 15 days ago
by
phil111
Thanks for not grossly overfitting this model.
โค๏ธ
๐
7
1
#4 opened 16 days ago
by
phil111
Thanks. Dedicated math and coding models is the only reasonable path forward.
๐ฅ
5
7
#5 opened 20 days ago
by
phil111
Temp 0.7 is too high.
2
#1 opened 29 days ago
by
phil111
Thanks for the effort and honesty.
๐ค
๐
3
#4 opened 29 days ago
by
phil111
Qwen3 is great, but could be better.
๐
7
21
#18 opened about 1 month ago
by
phil111
Add reasoning capabilities for gemma 3
4
#66 opened about 1 month ago
by
devopsML

Qwen is loosing broad knowledge since Qwen2.
๐
๐ฅ
11
14
#16 opened about 1 month ago
by
phil111
Very fast and powerful, but with one glaring weakness.
๐
1
2
#6 opened about 1 month ago
by
phil111
SimpleQA Scores Are WAY off
๐ฅ
5
5
#3 opened about 2 months ago
by
phil111
Less Knowledge Than Llama 3.3 70b?
๐
2
5
#60 opened about 2 months ago
by
phil111
13 B and34 B Pleeease!!! Most people cannot even run this.
๐
โค๏ธ
4
4
#52 opened 2 months ago
by
UniversalLove333
SimpleQA?
๐
8
3
#29 opened 3 months ago
by
phil111
This Mistral Small has FAR less knowledge than the last.
๐ฅ
5
20
#5 opened 4 months ago
by
phil111
This Mistral Small has FAR less knowledge than the last.
๐ฅ
5
20
#5 opened 4 months ago
by
phil111
This Mistral Small has FAR less knowledge than the last.
๐ฅ
5
20
#5 opened 4 months ago
by
phil111