Alexander Kovrigin
waleko
AI & ML interests
AI for Code
Recent Activity
upvoted
a
paper
about 6 hours ago
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via
Multi-Agent Multi-Turn Reinforcement Learning
upvoted
a
paper
8 days ago
ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion
Models
updated
a dataset
9 days ago
JetBrains-Research/EnvBench-trajectories