Building on HF

53 40 22

vansin PRO

vansin

AI & ML interests

None yet

Recent Activity

posted an update 4 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

commented on a paper 5 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

commented on a paper 5 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

View all activity

Organizations

posted an update 4 days ago

Post

209

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

commented 2 papers 5 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 7 days ago • 54 •

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 7 days ago • 54 •

commented 2 papers 22 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 22 days ago • 228 •

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

Paper • 2512.01816 • Published 23 days ago • 88 •

liked a model 23 days ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated 23 days ago • 93.9k • • 1.01k

New activity in opencompass/RISEBench_Gallery 29 days ago

cpu quota limit,can't start

#1 opened 29 days ago by

vansin

upvoted a paper about 2 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27 • 96

upvoted a paper 2 months ago

SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation

Paper • 2510.06303 • Published Oct 7 • 15

updated a dataset 2 months ago

paperscope/AIConf

Viewer • Updated Oct 12 • 19.7k • 26 • 1

published a dataset 2 months ago

paperscope/AIConf

Viewer • Updated Oct 12 • 19.7k • 26 • 1

updated a Space 2 months ago

README

🌍

published a Space 2 months ago

README

🌍

upvoted a paper 3 months ago

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

Paper • 2510.04212 • Published Oct 5 • 23

commented a paper 3 months ago

AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems

Paper • 2510.05432 • Published Oct 6 • 6 •

upvoted a paper 3 months ago

REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration

Paper • 2510.01879 • Published Oct 2 • 8

updated a Space 3 months ago

Bot 3c3j4n70

⚡

Join Hugging Face's 8-day AI learning journey

published a Space 3 months ago

Bot 3c3j4n70

⚡

Join Hugging Face's 8-day AI learning journey

updated a Space 3 months ago

Hugging Face Paper Quiz

👁

Take a quiz to test your understanding of the Intern-S1 model

published a Space 3 months ago

Hugging Face Paper Quiz

👁

Take a quiz to test your understanding of the Intern-S1 model

vansin PRO

AI & ML interests

Recent Activity

Organizations

vansin's activity

cpu quota limit,can't start

README

README

Bot 3c3j4n70

Bot 3c3j4n70

Hugging Face Paper Quiz

Hugging Face Paper Quiz