arxiv:2603.06082
Kuba Grudzien
iamkuba
ยท
AI & ML interests
Model Based Optimization; Reinforcement Learning
Recent Activity
authored a paper 3 days ago
Language Self-Play For Data-Free Training authored a paper 3 days ago
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion
Policies authored a paper 3 days ago
Offline Materials Optimization with CliqueFlowmer