Rosswill

Kutches

AI & ML interests

Recent Activity

liked a model about 7 hours ago

woctordho/ltx-lora-pruned

updated a model about 8 hours ago

Kutches/ImageZV2

liked a model about 11 hours ago

Zhongzhu/OSCAR-RotationZoo

View all activity

Organizations

None yet

upvoted 2 papers 4 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 6 days ago • 200

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Paper • 2605.22791 • Published 5 days ago • 24

upvoted a paper 10 days ago

Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

Paper • 2605.14386 • Published 12 days ago • 59

upvoted 2 papers 11 days ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published 13 days ago • 217

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Paper • 2605.13724 • Published 13 days ago • 96

upvoted 2 papers 14 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 19 days ago • 229

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published 18 days ago • 97

upvoted a paper 19 days ago

Video Generation with Predictive Latents

Paper • 2605.02134 • Published 22 days ago • 24

upvoted a paper 27 days ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published 29 days ago • 118

upvoted a paper about 2 months ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published Apr 6 • 114

upvoted a collection about 2 months ago

Gemma 4 Uncensored

Collection

Abliterated Gemma 4 models with refusal behavior removed. Biprojection + EGA for MoE. Cross-validated against 686 prompts from 4 datasets. • 8 items • Updated Apr 5 • 86

upvoted 2 papers about 2 months ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 351

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Paper • 2603.25730 • Published Mar 26 • 53

upvoted 3 papers 2 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 185

From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space

Paper • 2603.12648 • Published Mar 13 • 14

Can Vision-Language Models Solve the Shell Game?

Paper • 2603.08436 • Published Mar 9 • 39

upvoted a collection 3 months ago

Qwen3.5 Unredacted MAX

Collection

Continual “abliteration” models – experimental. • 8 items • Updated 29 days ago • 4

upvoted 3 papers 3 months ago

Rosswill

AI & ML interests

Recent Activity

Organizations

Kutches's activity