Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xhl's picture
26 5 5

xhl PRO

Xianhang
·
https://xhl-video.github.io/xianhangli/
  • xhl-video

AI & ML interests

Computer Vision

Recent Activity

liked a Space 2 months ago
facebook/physical_reasoning_leaderboard
new activity 3 months ago
UCSC-VLAA/openvision-vit-base-patch8-384:Add model card
new activity 3 months ago
UCSC-VLAA/openvision-vit-large-patch14-84:Add model card
View all activity

Organizations

UCSC-VLAA's profile picture

authored 2 papers about 1 year ago

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Paper • 2408.02900 • Published Aug 6, 2024 • 31

What If We Recaption Billions of Web Images with LLaMA-3?

Paper • 2406.08478 • Published Jun 12, 2024 • 42
authored 2 papers about 2 years ago

CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a \$10,000 Budget; An Extra \$4,000 Unlocks 81.8% Accuracy

Paper • 2306.15658 • Published Jun 27, 2023 • 12

An Inverse Scaling Law for CLIP Training

Paper • 2305.07017 • Published May 11, 2023 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs