Zeze Nene
Neman
AI & ML interests
LLM, evolutionary programming, AI
Recent Activity
liked
a model
2 days ago
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
liked
a model
5 days ago
google/medgemma-4b-it
liked
a model
9 days ago
ngxson/MiMo-VL-7B-RL-GGUF
Organizations
None yet
Neman's activity
Distill
๐ฅ
โค๏ธ
2
5
#17 opened 11 days ago
by
Neman

Problem with demo code using pipeline
1
#2 opened 4 months ago
by
Neman

unknown pre-tokenizer type: 'deepseek-r1-qwen'
๐ฅ
1
7
#1 opened 5 months ago
by
Neman

unknown pre-tokenizer type: 'deepseek-r1-qwen'
๐
4
2
#1 opened 5 months ago
by
Neman

safetensors size
4
#1 opened 5 months ago
by
Neman

What ViT?
2
#2 opened about 1 year ago
by
Neman

4-bit quant?
2
#3 opened about 1 year ago
by
Neman

Base or Chat?
2
#1 opened about 1 year ago
by
Neman

NameError: name 'flash_attn_func' is not defined
2
#4 opened over 1 year ago
by
Neman

'QWenTokenizer' object has no attribute 'IMAGE_ST'
4
#1 opened over 1 year ago
by
Neman

Will it come?
21
#2 opened over 1 year ago
by
Neman

ImportError: cannot import name 'SeamlessM4TModel' from 'transformers'
3
#13 opened over 1 year ago
by
Neman

Question What are the results for image captioning for fuyu-8b in comparison to other models?
๐
1
1
#8 opened over 1 year ago
by
Said2k
What are the memory requirements for running the model?
9
#6 opened over 1 year ago
by
joanfihu
gguf variant?
1
#1 opened over 1 year ago
by
scrawnyether