Slad

Sladwell

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

upvoted a paper 1 day ago

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

updated a collection 4 days ago

Deep Think

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published 16 days ago • 51

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

Paper • 2509.15591 • Published 16 days ago • 45

upvoted a paper 4 days ago

When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance

Paper • 2509.22193 • Published 9 days ago • 35

upvoted a paper 5 days ago

PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning

Paper • 2509.19894 • Published 11 days ago • 31

upvoted a paper 10 days ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published 13 days ago • 124

upvoted a paper 14 days ago

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Paper • 2509.13313 • Published 19 days ago • 76

upvoted 4 articles 15 days ago

Article

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 80

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

and 1 other •

Aug 18

• 75

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 227

Article

`LeRobotDataset`: Bringing large-scale datasets to lerobot

and 10 others •

19 days ago

• 35

upvoted 2 papers 22 days ago

Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5 • 47

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

Paper • 2508.16279 • Published Aug 22 • 51

upvoted 2 papers 25 days ago

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

Paper • 2509.05263 • Published 30 days ago • 10

Symbolic Graphics Programming with Large Language Models

Paper • 2509.05208 • Published 30 days ago • 45

upvoted 2 papers about 1 month ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 126

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 86

Slad

AI & ML interests

Recent Activity

Organizations

Sladwell's activity

Small Language Models (SLM): A Comprehensive Overview

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

`LeRobotDataset`: Bringing large-scale datasets to lerobot