ICCV2023

community

AI & ML interests

None defined yet.

Recent Activity

ICCV2023's activity

AdinaY 
posted an update about 7 hours ago
AdinaY 
posted an update 4 days ago
AdinaY 
posted an update 5 days ago
AdinaY 
posted an update 6 days ago
AdinaY 
posted an update 6 days ago
view post
Post
2047
Skywork-R1V🚀 38B open multimodal reasoning model with advanced visual CoT capabilities, released by Skywork.

Skywork/Skywork-R1V-38B

✨ Visual Reasoning: Breaks down complex images step by step.
✨ Math & Science: Solves visual problems with high precision.
✨ Combines text & images for deeper understanding.

AdinaY 
posted an update 7 days ago
AtAndDev 
posted an update 9 days ago
view post
Post
4092
There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...
  • 6 replies
·
AdinaY 
posted an update 11 days ago
AdinaY 
posted an update 12 days ago
AtAndDev 
posted an update 12 days ago
view post
Post
1544
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
AdinaY 
posted an update 13 days ago
AdinaY 
posted an update 13 days ago
AdinaY 
posted an update 19 days ago
view post
Post
2294
Babel🗼A multilingual LLM supporting 25 languages, released by the Alibaba DAMO team.

Model: Tower-Babel/babel-67c172157372d4d6c4b4c6d5
Paper: Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers (2503.00865)

✨ 9B/83B chat & base
✨ Supports 25 languages: English, Chinese, Hindi, Spanish, Arabic, French, Bengali, Portuguese, Russian, Urdu, Indonesian, German, Japanese, Swahili, Filipino, Tamil, Vietnamese, Turkish, Italian, Javanese, Korean, Hausa, Persian, Thai, and Burmese
  • 1 reply
·
AdinaY 
posted an update 21 days ago
view post
Post
1672
Qilin 🔥a large scale multimodal dataset for search, recommendation and RAG research, released by Xiaohongshu & Tsinghua University

Dataset: THUIR/Qilin
Paper: Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions (2503.00501)

✨Multiple content modalities (text, images, video thumbnails)
✨Rich user interaction data ( from Xiaohongshu’s 300M+ MAUs, 70%+ search penetration)
✨Comprehensive evaluation metrics
✨Support for RAG system development