Mohammadmostafa Rostamkhani's picture

8

Mohammadmostafa Rostamkhani

Mohammadmostafa

·

AI & ML interests

NLP, Vision, Multimodal

Recent Activity

upvoted a collection about 2 months ago

Vision Language Models

upvoted a collection 3 months ago

upvoted a collection 3 months ago

Meta's Llama 3.2 multimodal models

View all activity

Organizations

upvoted a collection about 2 months ago

Vision Language Models

140 items • Updated Jul 22 • 6

upvoted 6 collections 3 months ago

VisionLM

1414 items • Updated 1 day ago • 101

Meta's Llama 3.2 multimodal models

5 items • Updated Dec 13, 2024 • 44

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 609

One-RL-to-See-Them-All

One RL to See Them All: Visual Triple Unified Reinforcement Learning. GitHub: https://github.com/MiniMax-AI/One-RL-to-See-Them-All • 5 items • Updated Jun 10 • 27

Any-to-Any Models, Datasets, Spaces

18 items • Updated Jun 20 • 23

MiMo-VL

6 items • Updated 1 day ago • 36

updated 8 datasets 5 months ago

VQA-Illusion/FashionMnist_train

Viewer • Updated Apr 2 • 6.3k • 116

VQA-Illusion/MNIST_train

Viewer • Updated Mar 27 • 7.56k • 98

VQA-Illusion/FashionMnist_test

Viewer • Updated Mar 27 • 5.76k • 377

VQA-Illusion/IllusionAnimals_test

Viewer • Updated Mar 27 • 5k • 726 • 2

VQA-Illusion/MNIST_test

Viewer • Updated Mar 27 • 5.55k • 80

VQA-Illusion/IllusionChar_test

Viewer • Updated Mar 27 • 4.9k • 1.1k

VQA-Illusion/IllusionChar_train

Viewer • Updated Mar 27 • 27.4k • 942 • 1

VQA-Illusion/IllusionAnimals_train

Viewer • Updated Mar 27 • 6.3k • 545 • 1

upvoted a paper 7 months ago

Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions

Paper • 2412.08169 • Published Dec 11, 2024 • 2

authored a paper 8 months ago

Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions

Paper • 2412.08169 • Published Dec 11, 2024 • 2