Srihari Thyagarajan

Haleshot

AI & ML interests

AI, ML, DL, CV, RecSys

Recent Activity

liked a Space 2 days ago
nanotron/ultrascale-playbook
updated a Space 3 days ago
Haleshot/sample2
updated a Space 3 days ago
Haleshot/sample1
View all activity

Organizations

Hugging Face Discord Community's profile picture Nerdy Face's profile picture

Haleshot's activity

reacted to merve's post with β€οΈπŸ‘ 6 days ago
view post
Post
4575
Your weekly recap of open AI is here, and it's packed with models! merve/feb-14-releases-67af876b404cc27c6d837767

πŸ‘€ Multimodal
> OpenGVLab released InternVideo 2.5 Chat models, new video LMs with long context
> AIDC released Ovis2 model family along with Ovis dataset, new vision LMs in different sizes (1B, 2B, 4B, 8B, 16B, 34B), with video and OCR support
> ColQwenStella-2b is a multilingual visual retrieval model that is sota in it's size
> Hoags-2B-Exp is a new multilingual vision LM with contextual reasoning, long context video understanding

πŸ’¬ LLMs
A lot of math models!
> Open-R1 team released OpenR1-Math-220k large scale math reasoning dataset, along with Qwen2.5-220K-Math fine-tuned on the dataset, OpenR1-Qwen-7B
> Nomic AI released new Nomic Embed multilingual retrieval model, a MoE with 500 params with 305M active params, outperforming other models
> DeepScaleR-1.5B-Preview is a new DeepSeek-R1-Distill fine-tune using distributed RL on math
> LIMO is a new fine-tune of Qwen2.5-32B-Instruct on Math

πŸ—£οΈ Audio
> Zonos-v0.1 is a new family of speech recognition models, which contains the model itself and embeddings

πŸ–ΌοΈ Vision and Image Generation
> We have ported DepthPro of Apple to transformers for your convenience!
> illustrious-xl-v1.0 is a new illustration generation model
Β·
upvoted an article 17 days ago
upvoted an article 24 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

β€’ 769
upvoted an article 25 days ago
view article
Article

Welcome to Inference Providers on the Hub πŸ”₯

β€’ 381