Shufan Li's picture

Shufan Li

jacklishufan

·

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago

jacklishufan/sparse-lavida

published a model 9 days ago

jacklishufan/sparse-lavida

authored a paper about 1 month ago

SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation

View all activity

Organizations

authored a paper about 1 month ago

SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation

Paper • 2603.15150 • Published Mar 16

submitted a paper to Daily Papers about 1 month ago

SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation

Paper • 2603.15150 • Published Mar 16

submitted a paper to Daily Papers 2 months ago

LaViDa-R1: Advancing Reasoning for Unified Multimodal Diffusion Language Models

Paper • 2602.14147 • Published Feb 15 • 6

submitted 2 papers to Daily Papers 4 months ago

MobileWorldBench: Towards Semantic World Modeling For Mobile Agents

Paper • 2512.14014 • Published Dec 16, 2025 • 3

Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models

Paper • 2512.14008 • Published Dec 16, 2025 • 10

authored a paper 7 months ago

Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation

Paper • 2509.19244 • Published Sep 23, 2025 • 12

authored a paper 11 months ago

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Paper • 2505.16839 • Published May 22, 2025 • 13

authored a paper about 1 year ago

Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection

Paper • 2503.12271 • Published Mar 15, 2025 • 9

authored 8 papers over 1 year ago

OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows

Paper • 2412.01169 • Published Dec 2, 2024 • 13

InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following

Paper • 2312.06738 • Published Dec 11, 2023

Hierarchical Open-vocabulary Universal Image Segmentation

Paper • 2307.00764 • Published Jul 3, 2023

Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data

Paper • 2402.05892 • Published Feb 8, 2024

xT: Nested Tokenization for Larger Context in Large Images

Paper • 2403.01915 • Published Mar 4, 2024 • 1

Aligning Diffusion Models by Optimizing Human Utility

Paper • 2404.04465 • Published Apr 6, 2024 • 15

Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning

Paper • 2212.14532 • Published Dec 30, 2022 • 1

SegLLM: Multi-round Reasoning Segmentation

Paper • 2410.18923 • Published Oct 24, 2024