51 1

Big Deeper

BigDeeper

AI & ML interests

Differentiable hashing, orthonormal polynomial language modeling, image compression into language representations.

Recent Activity

new activity about 2 months ago

black-forest-labs/FLUX.1-Kontext-dev:What are my options to run it on multiple GPUs?

new activity 3 months ago

unsloth/medgemma-27b-text-it:Says image-text to text

new activity 3 months ago

nvidia/parakeet-tdt-0.6b-v2:Does this model identifies speaker?

View all activity

Organizations

None yet

New activity in black-forest-labs/FLUX.1-Kontext-dev about 2 months ago

What are my options to run it on multiple GPUs?

#32 opened about 2 months ago by

BigDeeper

New activity in unsloth/medgemma-27b-text-it 3 months ago

Says image-text to text

#2 opened 3 months ago by

BigDeeper

New activity in nvidia/parakeet-tdt-0.6b-v2 3 months ago

Does this model identifies speaker?

👀 1

#16 opened 3 months ago by

SouravAhmed

Is the model capable of splitting different speakers?

👀 1

#29 opened 3 months ago by

BigDeeper

liked a model 5 months ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27 • 478k • • 3.95k

New activity in ByteDance/LatentSync 6 months ago

Very large RAM foot print.

#1 opened 7 months ago by

BigDeeper

New activity in brittlewis12/s1-32B-GGUF 6 months ago

THE q8_0 version appears to go on and on indefinitely.

#1 opened 6 months ago by

BigDeeper

New activity in ndkhanh95/LatentSync 7 months ago

Having a problem. Unable to find a suitable output format for 'video_out.mp4

#1 opened 7 months ago by

BigDeeper

New activity in chunyu-li/LatentSync 7 months ago

Any ideas how to mitigate this problem?

#3 opened 7 months ago by

BigDeeper

New activity in Lightricks/LTX-Video 8 months ago

Longer video?

#25 opened 9 months ago by

BigDeeper

New activity in Lightricks/LTX-Video 9 months ago

What minimal VRAM does it require?

#18 opened 9 months ago by

DrNicefellow

New activity in Qwen/Qwen2.5-Coder-32B-Instruct 9 months ago

VSCODE + Cline + Ollama + Qwen2.5-Coder-32B-Instruct.Q8_0

#20 opened 9 months ago by

BigDeeper

New activity in black-forest-labs/FLUX.1-dev about 1 year ago

comfyui does not recognize model files in sft format

👍 👀 4

#18 opened about 1 year ago by

peidong

New activity in bigscience/bloomz-3b about 1 year ago

Are there advantages or disadvantages in changing the format for translation?

#10 opened about 1 year ago by

BigDeeper

New activity in QuantFactory/Meta-Llama-3-120B-Instruct-GGUF over 1 year ago

What does 120B really mean?

#1 opened over 1 year ago by

BigDeeper

New activity in meta-llama/Meta-Llama-3-70B over 1 year ago

Does anyone know which specific Python library contains the tokenizer that was used to train Llama-3-70b?

👍 1

#11 opened over 1 year ago by

BigDeeper

15 TeraTokens = 190 Million books

#4 opened over 1 year ago by

Languido

New activity in meta-llama/Meta-Llama-3-8B over 1 year ago

I was trying to fine-tune llama3 8b but getting following error - TypeError: LlamaForCausalLM.forward() got an unexpected keyword argument 'decoder_input_ids'

#117 opened over 1 year ago by

aniiikket11

New activity in dphn/dolphin-2.9-llama3-8b-gguf over 1 year ago

Has anyone tried this gguf with agentic framework?

#6 opened over 1 year ago by

BigDeeper

New activity in microsoft/Phi-3-mini-128k-instruct over 1 year ago

gguf

#24 opened over 1 year ago by

LaferriereJC

Big Deeper

AI & ML interests

Recent Activity

Organizations

BigDeeper's activity

What are my options to run it on multiple GPUs?

Says image-text to text

Does this model identifies speaker?

Is the model capable of splitting different speakers?

Very large RAM foot print.

THE q8_0 version appears to go on and on indefinitely.

Having a problem. Unable to find a suitable output format for 'video_out.mp4

Any ideas how to mitigate this problem?

Longer video?

What minimal VRAM does it require?

VSCODE + Cline + Ollama + Qwen2.5-Coder-32B-Instruct.Q8_0

comfyui does not recognize model files in sft format

Are there advantages or disadvantages in changing the format for translation?

What does 120B really mean?

Does anyone know which specific Python library contains the tokenizer that was used to train Llama-3-70b?

15 TeraTokens = 190 Million books

I was trying to fine-tune llama3 8b but getting following error - TypeError: LlamaForCausalLM.forward() got an unexpected keyword argument 'decoder_input_ids'

Has anyone tried this gguf with agentic framework?

gguf