Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
31
1
38
Phil
phil111
Follow
nlpguy's profile picture
altomek's profile picture
esselte974's profile picture
9 followers
·
9 following
AI & ML interests
None yet
Recent Activity
new
activity
about 18 hours ago
internlm/internlm3-8b-instruct:
English tests and tasks are absurdly overfit.
new
activity
8 days ago
microsoft/phi-4:
A heavily filtered corpus simply doesn't work.
new
activity
8 days ago
microsoft/phi-4:
I Don't Understand This Model
View all activity
Organizations
None yet
phil111
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
internlm/internlm3-8b-instruct
about 18 hours ago
English tests and tasks are absurdly overfit.
21
#8 opened 3 days ago by
phil111
New activity in
microsoft/phi-4
8 days ago
A heavily filtered corpus simply doesn't work.
4
#19 opened 8 days ago by
phil111
I Don't Understand This Model
16
#9 opened 10 days ago by
phil111
New activity in
matteogeniaccio/phi-4
21 days ago
Notably better than Phi3.5 in many ways, but something is wrong.
8
#5 opened about 1 month ago by
phil111
liked
a model
22 days ago
deepseek-ai/DeepSeek-V3
Updated
19 days ago
•
148k
•
2.01k
New activity in
deepseek-ai/DeepSeek-V3-Base
22 days ago
Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
2
#27 opened 22 days ago by
phil111
liked
a model
22 days ago
deepseek-ai/DeepSeek-V3-Base
Updated
19 days ago
•
16.1k
•
1.27k
New activity in
NyxKrage/Microsoft_Phi-4
26 days ago
SimpleQA score
2
#1 opened about 1 month ago by
frappuccino
New activity in
ibm-granite/granite-3.1-8b-instruct
28 days ago
Exceptional creative writer
5
#1 opened 30 days ago by
SubtleOne
liked
2 models
28 days ago
ibm-granite/granite-3.1-8b-instruct
Text Generation
•
Updated
30 days ago
•
47.6k
•
122
QuantFactory/granite-3.1-8b-instruct-GGUF
Text Generation
•
Updated
30 days ago
•
2.32k
•
7
New activity in
tiiuae/Falcon3-7B-Instruct
29 days ago
Very High English MMLU scores, Yet Extremely Low Broad English Knowledge
2
#8 opened 30 days ago by
phil111
New activity in
CohereForAI/c4ai-command-r7b-12-2024
about 1 month ago
How was r7b?
6
#3 opened about 1 month ago by
MRU4913
Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks
12
#1 opened about 1 month ago by
Fizzarolli
New activity in
meta-llama/Llama-3.3-70B-Instruct
about 1 month ago
local Llama + GPU(cuda)
7
#34 opened about 1 month ago by
Luciolla
Base Model?
3
#32 opened about 1 month ago by
User8213
New activity in
open-llm-leaderboard/open_llm_leaderboard
about 1 month ago
Add Hymba-1.5B to the leaderboard
3
#1030 opened about 1 month ago by
pmolchanov
liked
2 models
about 1 month ago
lmstudio-community/Llama-3.3-70B-Instruct-GGUF
Text Generation
•
Updated
Dec 6, 2024
•
72.2k
•
43
meta-llama/Llama-3.3-70B-Instruct
Text Generation
•
Updated
28 days ago
•
473k
•
•
1.67k
liked
a model
about 2 months ago
mistralai/Mistral-Large-Instruct-2411
Updated
Nov 19, 2024
•
2.26M
•
189
Load more