4 25 39

Haiwen Diao

Paranioar

https://Paranioar.github.io/

AI & ML interests

Vision-and-Language, Parameter-efficient Transfer Learning, Multi-modal Large Language Model

Recent Activity

upvoted a paper 6 days ago

Uniform Discrete Diffusion with Metric Path for Video Generation

updated a model 14 days ago

Paranioar/NEO1_0-2B-PT

updated a model 14 days ago

Paranioar/NEO1_0-2B-MT

View all activity

Organizations

authored 2 papers 15 days ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published 19 days ago • 65

GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning

Paper • 2410.15266 • Published Oct 20, 2024

authored a paper about 1 month ago

Visual Jigsaw Post-Training Improves MLLMs

Paper • 2509.25190 • Published Sep 29 • 35

authored a paper 5 months ago

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

Paper • 2502.06788 • Published Feb 10 • 13

authored a paper 6 months ago

End-to-End Vision Tokenizer Tuning

Paper • 2505.10562 • Published May 15 • 22

authored a paper 11 months ago

Autoregressive Video Generation without Vector Quantization

Paper • 2412.14169 • Published Dec 18, 2024 • 14

authored 4 papers over 1 year ago

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

Paper • 2407.08303 • Published Jul 11, 2024 • 19

SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning

Paper • 2407.07523 • Published Jul 10, 2024 • 6

Unveiling Encoder-Free Vision-Language Models

Paper • 2406.11832 • Published Jun 17, 2024 • 54

Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching

Paper • 2404.18114 • Published Apr 28, 2024

authored 3 papers almost 2 years ago

UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory

Paper • 2308.14316 • Published Aug 28, 2023

Similarity Reasoning and Filtration for Image-Text Matching

Paper • 2101.01368 • Published Jan 5, 2021

Plug-and-Play Regulators for Image-Text Matching

Paper • 2303.13371 • Published Mar 23, 2023

Haiwen Diao

AI & ML interests

Recent Activity

Organizations

Paranioar's activity