Praful Mohanan
Praful932
AI & ML interests
smol and fast llms + open source + low level design
Recent Activity
upvoted
an
article
about 1 month ago
Mixture of Experts Explained
liked
a model
about 1 month ago
deepseek-ai/DeepSeek-R1
Organizations
Praful932's activity
update pad token & eos token
#5 opened 12 months ago
by
Praful932

update model file base name
#1 opened over 1 year ago
by
Praful932

Zero Training Loss while finetuning the model for summarization
1
#14 opened over 1 year ago
by
Praful932

Source data not found
6
#5 opened almost 2 years ago
by
keremturgutlu