Vadim Kurochkin

Vadim21221

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago

Vadim21221/sae_Qwen_Qwen2.5-1.5B_resid_post_layer_12_size_16384_topk_reg_coeff_0.0018

published a model 2 days ago

Vadim21221/sae_Qwen_Qwen2.5-1.5B_resid_post_layer_12_size_16384_topk_reg_coeff_0.0018

updated a model 2 days ago

Vadim21221/sae_Qwen_Qwen2.5-1.5B-Instruct_resid_post_layer_12_size_16384_topk_reg_coeff_0.0018

View all activity

Organizations

None yet

Vadim21221's activity

updated a model 2 days ago

Vadim21221/sae_Qwen_Qwen2.5-1.5B_resid_post_layer_12_size_16384_topk_reg_coeff_0.0018

Updated 2 days ago

published a model 2 days ago

Vadim21221/sae_Qwen_Qwen2.5-1.5B_resid_post_layer_12_size_16384_topk_reg_coeff_0.0018

Updated 2 days ago

updated a model 2 days ago

Vadim21221/sae_Qwen_Qwen2.5-1.5B-Instruct_resid_post_layer_12_size_16384_topk_reg_coeff_0.0018

Updated 2 days ago

published a model 2 days ago

Vadim21221/sae_Qwen_Qwen2.5-1.5B-Instruct_resid_post_layer_12_size_16384_topk_reg_coeff_0.0018

Updated 2 days ago

upvoted a paper 3 days ago

STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

Paper • 2506.06276 • Published 6 days ago • 18

upvoted a collection 7 days ago

Qwen3

Collection

40 items • Updated 22 days ago • 751

upvoted a paper 8 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 29 days ago • 189

authored a paper 10 days ago

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation

Paper • 2505.22255 • Published 15 days ago • 21

upvoted a paper 13 days ago

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation

Paper • 2505.22255 • Published 15 days ago • 21

updated a model 29 days ago

Vadim21221/sae

Updated 29 days ago

published a model about 1 month ago

Vadim21221/sae

Updated 29 days ago

updated a dataset about 2 months ago

Vadim21221/agri-vision-2021-segmentation

Viewer • Updated Apr 26 • 75.3k • 165

published a dataset about 2 months ago

Vadim21221/agri-vision-2021-segmentation

Viewer • Updated Apr 26 • 75.3k • 165

updated a dataset about 2 months ago

Vadim21221/Agriculture_Vision

Viewer • Updated Apr 26 • 1.06M • 60

published a dataset about 2 months ago

Vadim21221/Agriculture_Vision

Viewer • Updated Apr 26 • 1.06M • 60

upvoted 2 papers 4 months ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5 • 61

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 115

upvoted a paper about 1 year ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 123