Pawel Garbacki
pgarbacki
AI & ML interests
None yet
Recent Activity
updated
a collection
3 days ago
Long
updated
a collection
3 days ago
Long
updated
a collection
3 days ago
Long
Organizations
retrieval
image
optimizers
finetuning
routing
computer use
-
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Paper • 2412.04454 • Published • 66 -
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Paper • 2410.05243 • Published • 19 -
UI-TARS: Pioneering Automated GUI Interaction with Native Agents
Paper • 2501.12326 • Published • 62
data
tool use
multimodal
video
foundational models
-
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Paper • 2405.04434 • Published • 21 -
Titans: Learning to Memorize at Test Time
Paper • 2501.00663 • Published • 25 -
Transformer^2: Self-adaptive LLMs
Paper • 2501.06252 • Published • 55 -
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
Paper • 2502.11089 • Published • 160
reasoning
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 87 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 64 -
ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights
Paper • 2406.14596 • Published • 5 -
A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More
Paper • 2407.16216 • Published
RL
data
retrieval
tool use
image
multimodal
optimizers
video
finetuning
foundational models
-
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Paper • 2405.04434 • Published • 21 -
Titans: Learning to Memorize at Test Time
Paper • 2501.00663 • Published • 25 -
Transformer^2: Self-adaptive LLMs
Paper • 2501.06252 • Published • 55 -
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
Paper • 2502.11089 • Published • 160
routing
reasoning
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 87 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 64 -
ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights
Paper • 2406.14596 • Published • 5 -
A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More
Paper • 2407.16216 • Published
computer use
-
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Paper • 2412.04454 • Published • 66 -
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Paper • 2410.05243 • Published • 19 -
UI-TARS: Pioneering Automated GUI Interaction with Native Agents
Paper • 2501.12326 • Published • 62