Charles Cai

charlescai2016

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Lightricks/LTX-2

liked a model 3 days ago

tencent/HY-Motion-1.0

upvoted a paper 7 days ago

mHC: Manifold-Constrained Hyper-Connections

View all activity

Organizations

upvoted a paper 7 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 8 days ago • 227

upvoted an article 2 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

upvoted a paper 2 months ago

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Paper • 2510.04290 • Published Oct 5, 2025 • 18

upvoted an article 2 months ago

Article

Train your ControlNet with diffusers

Mar 24, 2023

•

upvoted 2 papers 2 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 116

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27, 2025 • 84

upvoted a paper 3 months ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published Oct 14, 2025 • 27

upvoted a collection 3 months ago

smol2operator Release

Collection

4 items • Updated Sep 23, 2025 • 24

upvoted an article 3 months ago

Article

Jupyter Agents: training LLMs to reason with notebooks

Sep 10, 2025

•

upvoted a paper 3 months ago

Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Paper • 2509.06917 • Published Sep 8, 2025 • 41

upvoted 6 papers 4 months ago

LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence

Paper • 2509.12203 • Published Sep 15, 2025 • 19

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings

Paper • 2509.04011 • Published Sep 4, 2025 • 28

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27, 2024 • 144

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89

upvoted 2 papers 5 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published Aug 20, 2025 • 43

upvoted an article 5 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Oct 14, 2024

•

100

upvoted a paper 5 months ago

Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization

Paper • 2508.14811 • Published Aug 20, 2025 • 42

Charles Cai

AI & ML interests

Recent Activity

Organizations

charlescai2016's activity

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Train your ControlNet with diffusers

Jupyter Agents: training LLMs to reason with notebooks

Model2Vec: Distill a Small Fast Model from any Sentence Transformer