小明's picture

4 12

小明

xiaoming

·

xiaominghero

AI & ML interests

nlp

Recent Activity

upvoted a collection 13 days ago

upvoted a paper 13 days ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

liked a model 13 days ago

ds4sd/SmolDocling-256M-preview

View all activity

Organizations

None yet

xiaoming's activity

upvoted a collection 13 days ago

Document AI

16 items • Updated 14 days ago • 3

upvoted a paper 13 days ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published 18 days ago • 79

upvoted a paper about 1 month ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 54

upvoted a paper 3 months ago

HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

Paper • 1906.03327 • Published Jun 7, 2019 • 1