Tasos Stamoulakatos

stamtron

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

upvoted an article 14 days ago

Open-source DeepResearch – Freeing our search agents

upvoted an article 3 months ago

SmolVLM2: Bringing Video Understanding to Every Device

View all activity

Organizations

upvoted a paper 3 days ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Paper • 2507.01955 • Published 8 days ago • 29

upvoted an article 14 days ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.27k

upvoted 2 articles 3 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

and 6 others •

Feb 20

• 282

Article

SmolVLM - small yet mighty Vision Language Model

and 4 others •

Nov 26, 2024

• 329

liked a model 3 months ago

meta-llama/Llama-3.2-1B

Text Generation • 1B • Updated Oct 24, 2024 • 3.53M • • 1.98k

upvoted 3 articles 3 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 273

Article

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

and 1 other •

Mar 21

• 34

Article

Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm

•

Mar 19

• 6

upvoted an article 4 months ago

Article

The Large Language Model Course

•

Jan 16

• 195

liked 2 models 4 months ago

meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.37M • • 1.58k

meta-llama/Meta-Llama-3-8B

Text Generation • 8B • Updated Sep 27, 2024 • 366k • • 6.24k

liked a dataset 6 months ago

mlabonne/llmtwin

Viewer • Updated Aug 27, 2024 • 3.34k • 207 • 15

liked a Space over 1 year ago

Mask Rcnn

💻

Tasos Stamoulakatos

AI & ML interests

Recent Activity

Organizations

stamtron's activity

Open-source DeepResearch – Freeing our search agents

SmolVLM2: Bringing Video Understanding to Every Device

SmolVLM - small yet mighty Vision Language Model

ColPali: Efficient Document Retrieval with Vision Language Models 👀

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm

The Large Language Model Course

Mask Rcnn