Umar Azam

UmarAzam

Umar-Azam

AI & ML interests

Robotics and Simulations

Recent Activity

upvoted a paper 2 days ago

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

liked a model 11 days ago

Tongyi-MAI/MAI-UI-8B

upvoted a paper 12 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

View all activity

Organizations

None yet

upvoted a paper 2 days ago

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Paper • 2601.02204 • Published 4 days ago • 55

upvoted a paper 12 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 17 days ago • 60

upvoted a paper 30 days ago

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

Paper • 2512.03000 • Published Dec 2, 2025 • 36

upvoted 2 papers about 1 month ago

Monet: Reasoning in Latent Visual Space Beyond Images and Language

Paper • 2511.21395 • Published Nov 26, 2025 • 16

VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation

Paper • 2511.17199 • Published Nov 21, 2025 • 7

upvoted 2 papers about 2 months ago

RynnVLA-002: A Unified Vision-Language-Action and World Model

Paper • 2511.17502 • Published Nov 21, 2025 • 25

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Paper • 2508.14879 • Published Aug 20, 2025 • 69

upvoted 2 papers 2 months ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 42

Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects

Paper • 2511.01294 • Published Nov 3, 2025 • 13

upvoted a paper 3 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 55

upvoted 4 articles 3 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Oct 23, 2025

•

141

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.31k

Article

ScreenEnv: Deploy your full stack Desktop Agent

Jul 10, 2025

•

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

Sep 23, 2025

•

134

upvoted 3 papers 4 months ago

upvoted an article 5 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

583

upvoted a paper 5 months ago

SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension

Paper • 2508.01959 • Published Aug 3, 2025 • 59

upvoted an article 6 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15, 2025

•

222

Umar Azam

AI & ML interests

Recent Activity

Organizations

UmarAzam's activity

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Open-source DeepResearch – Freeing our search agents

ScreenEnv: Deploy your full stack Desktop Agent

Smol2Operator: Post-Training GUI Agents for Computer Use

Vision Language Models (Better, faster, stronger)

Train 400x faster Static Embedding Models with Sentence Transformers