Starstrek

Stars321123

Stars321

AI & ML interests

Recent Activity

upvoted a paper about 10 hours ago

SageBwd: A Trainable Low-bit Attention

upvoted a paper about 10 hours ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

upvoted a paper 1 day ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

View all activity

Organizations

upvoted 2 papers about 10 hours ago

SageBwd: A Trainable Low-bit Attention

Paper • 2603.02170 • Published 7 days ago • 15

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published 3 days ago • 73

upvoted a paper 1 day ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 152

upvoted a paper 2 days ago

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Paper • 2603.03756 • Published 6 days ago • 83

upvoted a collection 2 days ago

To read

Collection

140 items • Updated about 7 hours ago • 4

upvoted a paper 2 days ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published 7 days ago • 151

upvoted 2 papers 3 days ago

RealWonder: Real-Time Physical Action-Conditioned Video Generation

Paper • 2603.05449 • Published 4 days ago • 9

DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

Paper • 2603.04743 • Published 5 days ago • 45

upvoted 2 articles 3 days ago

Article

Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations

4 days ago

•

Article

NEO-unify: Building Native Multimodal Unified Models End to End

4 days ago

•

upvoted a collection 3 days ago

Bim

Collection

39 items • Updated 5 days ago • 5

upvoted a paper 3 days ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published 6 days ago • 153

upvoted 5 collections 3 days ago

upvoted 2 papers 3 days ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 76

Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

Paper • 2603.04791 • Published 5 days ago • 14

upvoted a collection 3 days ago

OpenThinkIMG

Collection

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images. • 7 items • Updated 4 days ago • 4

Starstrek

AI & ML interests

Recent Activity

Organizations

Stars321123's activity

Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations

NEO-unify: Building Native Multimodal Unified Models End to End