Jaro
JustJaro
AI & ML interests
GNNs, transformers, multimodal models, model explainability, graphs of knowledge, advanced prompting (graph of prompts, experts, etc.)
Recent Activity
updated
a model
8 days ago
JustJaro/Arcee-Blitz_gptq_g32_4bit
updated
a model
11 days ago
ConfidentialMind/Rombos-LLM-V2.6-Qwen-14b-GPTQ-G32-W4A16-KVFP8
published
a model
11 days ago
ConfidentialMind/Rombos-LLM-V2.6-Qwen-14b-GPTQ-G32-W4A16-KVFP8
Organizations
JustJaro's activity
Any plan for 8bit version?
2
#1 opened 22 days ago
by
jm4n21
the weights?
1
#1 opened 3 months ago
by
MaziyarPanahi

no weights?
8
#1 opened 3 months ago
by
KnutJaegersberg

Performance on MTEB
2
#4 opened 3 months ago
by
JustJaro

Apologies for no concrete evals - those will be coming in later quants as we set up the automations.
#1 opened 4 months ago
by
JustJaro

Un-censoring methods and effects on performance
1
#2 opened 4 months ago
by
JustJaro

Could you publish results compared to Sonnet 3.5?
3
#3 opened 8 months ago
by
JustJaro

Safetensor format
2
#14 opened 4 months ago
by
JustJaro

How to inference the model?
2
#3 opened 5 months ago
by
frankgu3528
support vllm
3
#10 opened 6 months ago
by
CarrotAI

Multi-token Prediction Models for Large Language Models: Code and Discussion
2
#2 opened 9 months ago
by
ashishpatel26
