Finetune of Qwen-2.5-7B model on a dump of DTF posts and comments.
Nikita Sushko
chameleon-lizard
AI & ML interests
NLP, Multilingual Models, Multiagent Systems
Recent Activity
upvoted
a
paper
4 days ago
T-LoRA: Single Image Diffusion Model Customization Without Overfitting
upvoted
an
article
5 days ago
SmolLM3: smol, multilingual, long-context reasoner