Chris Scott's picture

Chris Scott

getfit

·

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

rednote-hilab/dots.llm1.inst:Yarn context

liked a model 10 days ago

ResembleAI/chatterbox

updated a model 12 days ago

getfit/orpheus-3b-0.1-ft-FP8-Dynamic

View all activity

Organizations

None yet

getfit's activity

New activity in rednote-hilab/dots.llm1.inst 1 day ago

Yarn context

#3 opened 1 day ago by

New activity in Ithanil/Llama-3_1-Nemotron-Ultra-253B-v1-W8A8-Dynamic 17 days ago

Quantization question

#1 opened 26 days ago by

New activity in Qwen/Qwen3-235B-A22B-GPTQ-Int4 23 days ago

I get errors trying to deploy this in vllm or sglang.

#1 opened 27 days ago by

New activity in cognitivecomputations/Qwen3-235B-A22B-AWQ about 1 month ago

VLLM, SGLANG

#1 opened about 1 month ago by

New activity in fastllm/Qwen3-235B-A22B-INT4MIX about 1 month ago

How was this made? Quant configuration? Have you deployed this with SGLANG or vllm ?

#1 opened about 1 month ago by

New activity in justinjja/Qwen3-235B-A22B-INT4-W4A16 about 1 month ago

Slow inference on vLLM

#1 opened about 1 month ago by

New activity in mlx-community/DeepSeek-V3-0324-4bit about 2 months ago

Larger version?

#2 opened 2 months ago by

New activity in meta-llama/Llama-4-Scout-17B-16E-Instruct 2 months ago

FP8 weights

#41 opened 2 months ago by

Thank you!, Is it possible to run this with vLLM or sglang ?

#18 opened 2 months ago by

No one with a consumer grade GPU (< 32 vram) can run the lower L4 model... 😓

#20 opened 2 months ago by

UniversalLove333

New activity in Qwen/QwQ-32B 3 months ago

missing opening <think>

#4 opened 3 months ago by

New activity in qihoo360/TinyR1-32B-Preview 3 months ago

Output repeating

#1 opened 3 months ago by

New activity in Qwen/Qwen2.5-Coder-7B-Instruct 9 months ago

32b Coder

#5 opened 9 months ago by

New activity in Qwen/Qwen2.5-72B-Instruct 9 months ago

There's a HUGE drop in popular knowledge from v2 to v2.5.

#1 opened 9 months ago by