Phil's picture

Phil

phil111

·

AI & ML interests

None yet

Recent Activity

new activity about 10 hours ago

rednote-hilab/dots.llm1.inst:Only 9.3 on the English SimpleQA despite 143b total parameters

new activity 7 days ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B:Tried it, but not good as expected.

new activity 12 days ago

deepseek-ai/DeepSeek-R1-0528:Benchmarks please

View all activity

Organizations

None yet

phil111's activity

New activity in rednote-hilab/dots.llm1.inst about 10 hours ago

Only 9.3 on the English SimpleQA despite 143b total parameters

#2 opened 3 days ago by

New activity in deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 7 days ago

Tried it, but not good as expected.

#11 opened 11 days ago by

New activity in deepseek-ai/DeepSeek-R1-0528 12 days ago

Benchmarks please

#20 opened 13 days ago by

New activity in nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1 15 days ago

Pruning doesn't appear to be viable.

#2 opened 15 days ago by

New activity in tiiuae/Falcon-H1-34B-Instruct 16 days ago

Thanks for not grossly overfitting this model.

#4 opened 16 days ago by

New activity in mistralai/Devstral-Small-2505 18 days ago

Thanks. Dedicated math and coding models is the only reasonable path forward.

#5 opened 20 days ago by

New activity in Qwen/Qwen3-30B-A3B-GGUF 25 days ago

Temp 0.7 is too high.

#1 opened 29 days ago by

New activity in PrimeIntellect/INTELLECT-2 27 days ago

Thanks for the effort and honesty.

#4 opened 29 days ago by

New activity in Qwen/Qwen3-30B-A3B 29 days ago

Qwen3 is great, but could be better.

#18 opened about 1 month ago by

New activity in google/gemma-3-27b-it about 1 month ago

Add reasoning capabilities for gemma 3

#66 opened about 1 month ago by

New activity in Qwen/Qwen3-235B-A22B about 1 month ago

Qwen is loosing broad knowledge since Qwen2.

#16 opened about 1 month ago by

New activity in Qwen/Qwen3-30B-A3B about 1 month ago

Very fast and powerful, but with one glaring weakness.

#6 opened about 1 month ago by

New activity in THUDM/GLM-4-32B-0414 about 2 months ago

SimpleQA Scores Are WAY off

#3 opened about 2 months ago by

New activity in meta-llama/Llama-4-Scout-17B-16E-Instruct about 2 months ago

Less Knowledge Than Llama 3.3 70b?

#60 opened about 2 months ago by

New activity in meta-llama/Llama-4-Scout-17B-16E-Instruct 2 months ago

13 B and34 B Pleeease!!! Most people cannot even run this.

#52 opened 2 months ago by

UniversalLove333

New activity in deepseek-ai/DeepSeek-V3-0324 3 months ago

SimpleQA?

#29 opened 3 months ago by

New activity in mistralai/Mistral-Small-24B-Instruct-2501 4 months ago

This Mistral Small has FAR less knowledge than the last.

#5 opened 4 months ago by

This Mistral Small has FAR less knowledge than the last.

#5 opened 4 months ago by

This Mistral Small has FAR less knowledge than the last.

#5 opened 4 months ago by