3 10 7

David Newman

darthhexx

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

zai-org/GLM-4.5-Air-FP8

upvoted a paper 4 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

liked a model 4 months ago

OpenGVLab/InternVL3-78B

View all activity

Organizations

liked a model 5 days ago

zai-org/GLM-4.5-Air-FP8

Text Generation • 111B • Updated 7 days ago • 8.17k • • 39

upvoted a paper 4 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 280

liked a model 4 months ago

OpenGVLab/InternVL3-78B

Image-Text-to-Text • 78B • Updated May 29 • 106k • 210

upvoted a collection 4 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 595

upvoted a paper 4 months ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 121

upvoted 2 papers 5 months ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 175

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 105

updated 2 models 6 months ago

darthhexx/Meta-Llama-3.1-8B-Instruct-FP8

Text Generation • 8B • Updated Feb 12 • 6

darthhexx/Qwen2.5-VL-3B-Instruct-FP8-Dynamic

Image-Text-to-Text • 4B • Updated Feb 11 • 9

published a model 6 months ago

darthhexx/Qwen2.5-VL-3B-Instruct-FP8-Dynamic

Image-Text-to-Text • 4B • Updated Feb 11 • 9

updated a model 6 months ago

darthhexx/Meta-Llama-3-8B-Instruct-FP8-Dynamic

Text Generation • 8B • Updated Feb 5 • 3

published a model 6 months ago

darthhexx/Meta-Llama-3-8B-Instruct-FP8-Dynamic

Text Generation • 8B • Updated Feb 5 • 3

updated a model 6 months ago

darthhexx/Qwen2.5-VL-7B-Instruct-FP8-Dynamic

Image-Text-to-Text • 8B • Updated Feb 5 • 20

published a model 6 months ago

darthhexx/Qwen2.5-VL-7B-Instruct-FP8-Dynamic

Image-Text-to-Text • 8B • Updated Feb 5 • 20

liked a model 10 months ago

Qwen/Qwen2.5-72B-Instruct-AWQ

Text Generation • 12B • Updated Oct 9, 2024 • 52.1k • 72

updated 2 models about 1 year ago

darthhexx/Phi-3-medium-128k-instruct-fp8

Text Generation • 14B • Updated Jul 22, 2024 • 3

darthhexx/Meta-Llama-3-8B-Instruct-FP8

Text Generation • 8B • Updated Jul 22, 2024 • 3

New activity in darthhexx/Phi-3-medium-128k-instruct-fp8 about 1 year ago

Delete pytorch_model.bin

#2 opened about 1 year ago by

darthhexx

updated a model about 1 year ago

darthhexx/Phi-3-medium-128k-instruct-awq

Text Generation • 2B • Updated Jul 10, 2024 • 2

New activity in darthhexx/Phi-3-medium-128k-instruct-fp8 about 1 year ago

Adding `safetensors` variant of this model

#1 opened about 1 year ago by

darthhexx

David Newman

AI & ML interests

Recent Activity

Organizations

darthhexx's activity

Delete pytorch_model.bin

Adding `safetensors` variant of this model