1 23 172

Joshua Chak

JoshuaChak

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

liked a model 18 days ago

lym00/Wan2.1_T2V_1.3B_SelfForcing_VACE

upvoted an article 18 days ago

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

View all activity

Organizations

upvoted a paper 2 days ago

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published 4 days ago • 44

upvoted an article 18 days ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

and 1 other •

23 days ago

• 621

upvoted a paper 30 days ago

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Paper • 2506.20639 • Published Jun 25 • 27

upvoted a paper about 1 month ago

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Paper • 2506.08009 • Published Jun 9 • 26

upvoted a paper 2 months ago

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Paper • 2505.21333 • Published May 27 • 39

upvoted a paper 4 months ago

LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Paper • 2503.10625 • Published Mar 13 • 32

upvoted a collection 5 months ago

OLMo 2

Collection

Artifacts for the OLMo 2 release. • 35 items • Updated May 1 • 135

upvoted a paper 6 months ago

Scaling Embedding Layers in Language Models

Paper • 2502.01637 • Published Feb 3 • 24

upvoted a paper 7 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 153

upvoted 4 papers 9 months ago

upvoted 2 papers 10 months ago

Autonomous Character-Scene Interaction Synthesis from Text Instruction

Paper • 2410.03187 • Published Oct 4, 2024 • 7

Presto! Distilling Steps and Layers for Accelerating Music Generation

Paper • 2410.05167 • Published Oct 7, 2024 • 18

upvoted 4 papers about 1 year ago

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5, 2024 • 34

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 68

Transformers meet Neural Algorithmic Reasoners

Paper • 2406.09308 • Published Jun 13, 2024 • 45

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Paper • 2406.07522 • Published Jun 11, 2024 • 41

upvoted an article about 1 year ago

Article

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

•

Jun 20, 2024

• 26

Joshua Chak

AI & ML interests

Recent Activity

Organizations

JoshuaChak's activity

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖