Ramesh 's picture

17 12

Ramesh

rameshch

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

ContactDoctor/Bio-Medical-Llama-3-3-8B

new activity 3 months ago

ContactDoctor/Bio-Medical-ContactDoctorVLLM-14B-V1-102025:Gguf request

liked a model 3 months ago

ContactDoctor/Bio-Medical-ContactDoctorVLLM-8B-V1-102025

View all activity

Organizations

New activity in ContactDoctor/Bio-Medical-ContactDoctorVLLM-14B-V1-102025 3 months ago

Gguf request

#1 opened 3 months ago by

New activity in XiaomiMiMo/MiMo-VL-7B-RL 7 months ago

Is there a way to disable or turn off the thinking process? Additionally, when asked about itself, it responds by saying, "I am ChatGPT from OpenAI."

#5 opened 7 months ago by

New activity in Qwen/Qwen2.5-VL-32B-Instruct-AWQ 7 months ago

RuntimeError: expected mat1 and mat2 to have the same dtype, but got: struct c10::Half != struct c10::BFloat16

#9 opened 9 months ago by

New activity in Qwen/Qwen2.5-Omni-3B 8 months ago

EOS_TOKEN_ID ?

#6 opened 8 months ago by

New activity in google/gemma-3-27b-it 10 months ago

Tokens generated per second

#39 opened 10 months ago by

New activity in Qwen/Qwen2.5-VL-32B-Instruct 10 months ago

Thank You for Open-Sourcing Your Model & Feedback

#4 opened 10 months ago by

New activity in mistralai/Mistral-Small-3.1-24B-Instruct-2503 10 months ago

How do we use it with Transformers? can you give some sample code ?

#22 opened 10 months ago by

New activity in meta-llama/Llama-3.2-1B-Instruct about 1 year ago

Error(s) in loading state_dict for PeftModelForCausalLM:

#23 opened about 1 year ago by

New activity in openbmb/MiniCPM-Llama3-V-2_5 over 1 year ago

Is it possible to merge MiniCPM-Llama3-V-2-5 with a Llama-3-1 based model using MOE

#68 opened over 1 year ago by

New activity in lmms-lab/llava-onevision-projectors over 1 year ago

llava-Onevision-projector for LLama-3.1-8B Model

#4 opened over 1 year ago by

New activity in openbmb/MiniCPM-Llama3-V-2_5 over 1 year ago

RuntimeError: only Tensors of floating point dtype can require gradients

#69 opened over 1 year ago by

Is it possible to merge MiniCPM-Llama3-V-2-5 with a Llama-3-1 based model using MOE

#68 opened over 1 year ago by

Is it possible to merge MiniCPM-Llama3-V-2-5 with a Llama-3-1 based model using MOE

#68 opened over 1 year ago by