Dũng Võ PRO

tuandunghcmut

tuandung222

AI & ML interests

Pretraining on Human-centric Tasks, Human-centric Visual Perception, Person Re-identification, Pedestrian Attribute Recognition, Text-based Person Retrieval, Text-based Human Parsing

Recent Activity

updated a model about 20 hours ago

tuandunghcmut/Qwen3-FT-Customer-Dataset

liked a model 2 days ago

Qwen/Qwen2.5-VL-32B-Instruct

liked a Space 2 days ago

hf-accelerate/model-memory-usage

View all activity

Organizations

None yet

tuandunghcmut's activity

upvoted a collection 10 days ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 4 days ago • 130

upvoted a paper 10 days ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published 11 days ago • 82

upvoted a collection 10 days ago

Chat Instruction datasets

Collection

12 items • Updated Nov 25, 2024 • 2

upvoted 2 collections 11 days ago

HELM Datasets

Collection

These are datasets used for Vietnamese LLM evaluation • 18 items • Updated Apr 24 • 1

SEA-HELM Evaluation Datasets

Collection

13 items • Updated Dec 19, 2024 • 1

upvoted a collection 9 months ago

GIT

Collection

GIT (Generative Image-to-text Transformer) is a model useful for vision-language tasks such as image/video captioning and question answering. • 18 items • Updated 24 days ago • 13