1 8 1

Tyler Zhu

tyleryzhu

https://tylerzhu.com

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning

upvoted a paper 4 days ago

Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images

upvoted a paper about 1 month ago

Qwen2.5-Omni Technical Report

View all activity

Organizations

None yet

tyleryzhu's activity

upvoted a paper 2 days ago

COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning

Paper • 2504.21850 • Published 3 days ago • 24

upvoted a paper 4 days ago

Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images

Paper • 2504.09621 • Published 21 days ago • 11

upvoted 2 papers about 1 month ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 149

Attention IoU: Examining Biases in CelebA using Attention Maps

Paper • 2503.19846 • Published Mar 25 • 7

authored a paper about 1 month ago

Attention IoU: Examining Biases in CelebA using Attention Maps

Paper • 2503.19846 • Published Mar 25 • 7

upvoted a paper about 2 months ago

Video Action Differencing

Paper • 2503.07860 • Published Mar 10 • 33

updated a dataset 2 months ago

tyleryzhu/perception_test_val-event_recall

Viewer • Updated Feb 25 • 157 • 28

published a dataset 2 months ago

tyleryzhu/perception_test_val-event_recall

Viewer • Updated Feb 25 • 157 • 28

updated a model 4 months ago

tyleryzhu/merv

Updated Jan 5

upvoted a paper 4 months ago

Unifying Specialized Visual Encoders for Video Language Models

Paper • 2501.01426 • Published Jan 2 • 21

commented a paper 4 months ago

Unifying Specialized Visual Encoders for Video Language Models

Paper • 2501.01426 • Published Jan 2 • 21 •

authored 2 papers 4 months ago

xT: Nested Tokenization for Larger Context in Large Images

Paper • 2403.01915 • Published Mar 4, 2024

Unifying Specialized Visual Encoders for Video Language Models

Paper • 2501.01426 • Published Jan 2 • 21

upvoted 2 papers 7 months ago

Erasing Conceptual Knowledge from Language Models

Paper • 2410.02760 • Published Oct 3, 2024 • 14

AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Paper • 2410.03051 • Published Oct 4, 2024 • 6

liked a model about 1 year ago

lmsys/vicuna-7b-v1.5

Text Generation • Updated Mar 13, 2024 • 238k • 336