-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 286k • 209 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 23k • 105 -
jinaai/jina-reranker-v2-base-multilingual
Text Classification • 0.3B • Updated • 1.29M • 298 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 144k • 726
Bjorn Melin
BjornMelin
AI & ML interests
Large Language Models, AI Agents, Multi-Agent Orchestrations, Deep Learning, NLP, Local LLM Optimization.
Recent Activity
updated
a collection
about 20 hours ago
Smol Models
liked
a model
about 20 hours ago
nvidia/NVIDIA-Nemotron-Nano-9B-v2
liked
a model
about 20 hours ago
deepseek-ai/DeepSeek-V3.1
Organizations
None yet
Datasets
Fine Tuning
Legendary VL Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 320k • 292 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 218k • • 199 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 813k • • 788 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 359k • • 530
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 153k • 14 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.9M • • 1.67k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 14.3M • • 4.5k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 157k • 25
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 457k • • 3.96k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 45 -
internlm/internlm3-8b-instruct
Text Generation • 9B • Updated • 83.6k • 227 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 1.48k • • 549
Embedding Models
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 3.39k • • 109 -
Qwen/Qwen3-32B-AWQ
Text Generation • 6B • Updated • 473k • 100 -
all-hands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 2.73k • • 388 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 218k • • 552
Leaderboards
-
Running137137
smolagents LLM leaderboard
🏆A leaderboard for LLMs powering smolagents
-
Running370370
LLM Performance Leaderboard
🐨View LLM performance rankings
-
Running180180
Low-bit Quantized Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running1.03k1.03k
UGI Leaderboard
📢Uncensored General Intelligence Leaderboard
Coding Models
Google
-
google/gemma-3-27b-it-qat-q4_0-gguf
Image-Text-to-Text • 27B • Updated • 7.24k • 326 -
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text • 27B • Updated • 54.2k • 150 -
google/gemma-3-27b-it
Image-Text-to-Text • 27B • Updated • 523k • • 1.57k -
google/gemma-3n-E4B-it
Image-Text-to-Text • 8B • Updated • 105k • 734
Qwen
Rerankers
-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 286k • 209 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 23k • 105 -
jinaai/jina-reranker-v2-base-multilingual
Text Classification • 0.3B • Updated • 1.29M • 298 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 144k • 726
Embedding Models
Datasets
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 3.39k • • 109 -
Qwen/Qwen3-32B-AWQ
Text Generation • 6B • Updated • 473k • 100 -
all-hands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 2.73k • • 388 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 218k • • 552
Fine Tuning
Leaderboards
-
Running137137
smolagents LLM leaderboard
🏆A leaderboard for LLMs powering smolagents
-
Running370370
LLM Performance Leaderboard
🐨View LLM performance rankings
-
Running180180
Low-bit Quantized Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running1.03k1.03k
UGI Leaderboard
📢Uncensored General Intelligence Leaderboard
Legendary VL Models
Coding Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 320k • 292 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 218k • • 199 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 813k • • 788 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 359k • • 530
Google
-
google/gemma-3-27b-it-qat-q4_0-gguf
Image-Text-to-Text • 27B • Updated • 7.24k • 326 -
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text • 27B • Updated • 54.2k • 150 -
google/gemma-3-27b-it
Image-Text-to-Text • 27B • Updated • 523k • • 1.57k -
google/gemma-3n-E4B-it
Image-Text-to-Text • 8B • Updated • 105k • 734
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 153k • 14 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.9M • • 1.67k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 14.3M • • 4.5k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 157k • 25
Qwen
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 457k • • 3.96k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 45 -
internlm/internlm3-8b-instruct
Text Generation • 9B • Updated • 83.6k • 227 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 1.48k • • 549