Ramesh
rameshch
·
AI & ML interests
None yet
Recent Activity
liked
a model
21 days ago
Qwen/Qwen2.5-VL-32B-Instruct
new activity
21 days ago
google/gemma-3-27b-it:Tokens generated per second
Organizations
rameshch's activity
Tokens generated per second
1
3
#39 opened 22 days ago
by
rameshch
Thank You for Open-Sourcing Your Model & Feedback
1
#4 opened 22 days ago
by
rameshch
How do we use it with Transformers? can you give some sample code ?
9
#22 opened 28 days ago
by
rameshch
Error(s) in loading state_dict for PeftModelForCausalLM:
2
#23 opened 6 months ago
by
rameshch
Is it possible to merge MiniCPM-Llama3-V-2-5 with a Llama-3-1 based model using MOE
10
#68 opened 8 months ago
by
rameshch
llava-Onevision-projector for LLama-3.1-8B Model
1
#4 opened 8 months ago
by
rameshch
RuntimeError: only Tensors of floating point dtype can require gradients
1
#69 opened 8 months ago
by
rameshch
Is it possible to merge MiniCPM-Llama3-V-2-5 with a Llama-3-1 based model using MOE
10
#68 opened 8 months ago
by
rameshch
Is it possible to merge MiniCPM-Llama3-V-2-5 with a Llama-3-1 based model using MOE
10
#68 opened 8 months ago
by
rameshch