Tran Quang Long's picture

Tran Quang Long

tqlong84

AI & ML interests

None yet

Recent Activity

replied to merve's post about 14 hours ago

stop using VLMs blindly ✋🏻 compare different VLM outputs on a huge variety of inputs (from reasoning to OCR!) 🔥 https://huggingface.co/spaces/visionLMsftw/comparevlms > has support for multiple VLMs: https://huggingface.co/google/gemma-3-27b-it, https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct, https://huggingface.co/Qwen/Qwen2.5-VL-32B-Instruct, https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct, https://huggingface.co/HuggingFaceTB/SmolVLM2-2.2B-Instruct > recommend us new models or inputs, we'll add 🫡 so far I figured out > for fact-checks, you need a relatively bigger size (7B is ok!) > Gemma 3 gets downgrade without pan and scan (especially for 📑) > Qwen2.5VL-32B is very talkative, great for reasoning but not good for simple tasks 🗣️

View all activity

Organizations

None yet

tqlong84's activity

replied to merve's post about 14 hours ago

Hello Merve,

Can you send me the link to the test shown in the video?