xhl's picture

26 5 5

xhl PRO

Xianhang

·

https://xhl-video.github.io/xianhangli/

xhl-video

AI & ML interests

Computer Vision

Recent Activity

liked a Space 2 months ago

facebook/physical_reasoning_leaderboard

new activity 3 months ago

UCSC-VLAA/openvision-vit-base-patch8-384:Add model card

new activity 3 months ago

UCSC-VLAA/openvision-vit-large-patch14-84:Add model card

View all activity

Organizations

authored 2 papers about 1 year ago

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Paper • 2408.02900 • Published Aug 6, 2024 • 31

What If We Recaption Billions of Web Images with LLaMA-3?

Paper • 2406.08478 • Published Jun 12, 2024 • 42

authored 2 papers about 2 years ago

CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a \$10,000 Budget; An Extra \$4,000 Unlocks 81.8% Accuracy

Paper • 2306.15658 • Published Jun 27, 2023 • 12

An Inverse Scaling Law for CLIP Training

Paper • 2305.07017 • Published May 11, 2023 • 3