Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
50
1
Big Deeper
BigDeeper
Follow
21world's profile picture
1 follower
ยท
0 following
AI & ML interests
Differentiable hashing, orthonormal polynomial language modeling, image compression into language representations.
Recent Activity
new
activity
12 days ago
unsloth/medgemma-27b-text-it:
Says image-text to text
new
activity
24 days ago
nvidia/parakeet-tdt-0.6b-v2:
Does this model identifies speaker?
new
activity
24 days ago
nvidia/parakeet-tdt-0.6b-v2:
Is the model capable of splitting different speakers?
View all activity
Organizations
None yet
BigDeeper
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
unsloth/medgemma-27b-text-it
12 days ago
Says image-text to text
#2 opened 12 days ago by
BigDeeper
New activity in
nvidia/parakeet-tdt-0.6b-v2
24 days ago
Does this model identifies speaker?
๐
1
8
#16 opened about 1 month ago by
SouravAhmed
Is the model capable of splitting different speakers?
๐
1
1
#29 opened 24 days ago by
BigDeeper
liked
a model
2 months ago
deepseek-ai/DeepSeek-V3
Text Generation
โข
Updated
Mar 27
โข
2.42M
โข
โข
3.87k
New activity in
ByteDance/LatentSync
3 months ago
Very large RAM foot print.
4
#1 opened 5 months ago by
BigDeeper
New activity in
brittlewis12/s1-32B-GGUF
4 months ago
THE q8_0 version appears to go on and on indefinitely.
6
#1 opened 4 months ago by
BigDeeper
New activity in
ndkhanh95/LatentSync
5 months ago
Having a problem. Unable to find a suitable output format for 'video_out.mp4
#1 opened 5 months ago by
BigDeeper
New activity in
chunyu-li/LatentSync
5 months ago
Any ideas how to mitigate this problem?
#3 opened 5 months ago by
BigDeeper
New activity in
Lightricks/LTX-Video
6 months ago
Longer video?
6
#25 opened 6 months ago by
BigDeeper
What minimal VRAM does it require?
12
#18 opened 7 months ago by
DrNicefellow
New activity in
Qwen/Qwen2.5-Coder-32B-Instruct
7 months ago
VSCODE + Cline + Ollama + Qwen2.5-Coder-32B-Instruct.Q8_0
3
#20 opened 7 months ago by
BigDeeper
New activity in
black-forest-labs/FLUX.1-dev
10 months ago
comfyui does not recognize model files in sft format
๐
๐
4
5
#18 opened 10 months ago by
peidong
New activity in
bigscience/bloomz-3b
11 months ago
Are there advantages or disadvantages in changing the format for translation?
3
#10 opened 11 months ago by
BigDeeper
New activity in
QuantFactory/Meta-Llama-3-120B-Instruct-GGUF
about 1 year ago
What does 120B really mean?
3
#1 opened about 1 year ago by
BigDeeper
New activity in
meta-llama/Meta-Llama-3-70B
about 1 year ago
Does anyone know which specific Python library contains the tokenizer that was used to train Llama-3-70b?
๐
1
2
#11 opened about 1 year ago by
BigDeeper
15 TeraTokens = 190 Million books
2
#4 opened about 1 year ago by
Languido
New activity in
meta-llama/Meta-Llama-3-8B
about 1 year ago
I was trying to fine-tune llama3 8b but getting following error - TypeError: LlamaForCausalLM.forward() got an unexpected keyword argument 'decoder_input_ids'
4
#117 opened about 1 year ago by
aniiikket11
New activity in
cognitivecomputations/dolphin-2.9-llama3-8b-gguf
about 1 year ago
Has anyone tried this gguf with agentic framework?
3
#6 opened about 1 year ago by
BigDeeper
New activity in
microsoft/Phi-3-mini-128k-instruct
about 1 year ago
gguf
30
#24 opened about 1 year ago by
LaferriereJC
New activity in
pjh64/Phi-3-mini-128K-Instruct.gguf
about 1 year ago
How did you manage to produce gguf files, when llama.cpp/convert.py gives an error about the ROPE encoding?
4
#1 opened about 1 year ago by
BigDeeper
Load more