Charles Cai

charlescai2016

AI & ML interests

None yet

Recent Activity

liked a dataset about 24 hours ago

rasoul-nikbakht/TSpec-LLM

liked a dataset 9 days ago

netop/TeleLogs

liked a model 11 days ago

HeartMuLa/HeartMuLa-oss-3B

View all activity

Organizations

liked a dataset about 24 hours ago

rasoul-nikbakht/TSpec-LLM

Updated May 6, 2025 • 1.21k • 68

liked a dataset 9 days ago

netop/TeleLogs

Viewer • Updated Aug 5, 2025 • 3.26k • 1.41k • 29

liked 2 models 11 days ago

HeartMuLa/HeartMuLa-oss-3B

Text-to-Audio • 4B • Updated 12 days ago • 9.17k • 237

HeartMuLa/HeartMuLaGen

Text-to-Audio • Updated 12 days ago • 23

upvoted a paper 11 days ago

HeartMuLa: A Family of Open Sourced Music Foundation Models

Paper • 2601.10547 • Published 16 days ago • 40

upvoted a paper 15 days ago

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published 19 days ago • 40

liked 2 models 18 days ago

cerebras/GLM-4.7-REAP-218B-A32B-FP8

Text Generation • Updated 21 days ago • 1.28k • 17

cerebras/GLM-4.7-REAP-268B-A32B

Text Generation • 269B • Updated 9 days ago • 60 • 18

liked a model 24 days ago

Lightricks/LTX-2

Image-to-Video • Updated 12 days ago • 2.5M • • 1.38k

liked a model 26 days ago

tencent/HY-Motion-1.0

Text-to-3D • Updated about 1 month ago • 971 • 357

upvoted a paper 30 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published about 1 month ago • 291

updated a collection about 2 months ago

Papers

Collection

4 items • Updated Dec 16, 2025

liked a dataset 2 months ago

iteratehack/code19-dataset

Viewer • Updated Nov 30, 2025 • 3.06k • 11 • 1

liked a model 2 months ago

PrimeIntellect/INTELLECT-3

Text Generation • 107B • Updated Nov 27, 2025 • 1.66k • 205

liked a model 3 months ago

ByteDance/BindWeave

Image-to-Video • Updated Nov 28, 2025 • 901 • 88

upvoted an article 3 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

upvoted a paper 3 months ago

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Paper • 2510.04290 • Published Oct 5, 2025 • 19

upvoted an article 3 months ago

Article

Train your ControlNet with diffusers

Mar 24, 2023

•

upvoted a paper 3 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 117

liked a Space 3 months ago

The Smol Training Playbook

📚

2.95k

The secrets to building world-class LLMs