YJ

yjh415

AI & ML interests

None yet

Recent Activity

commented on a paper about 7 hours ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

commented on a paper about 11 hours ago

DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception

commented on a paper about 11 hours ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

View all activity

Organizations

None yet

yjh415's activity

commented a paper about 7 hours ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 2 days ago • 30 •

commented 2 papers about 11 hours ago

DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception

Paper • 2505.04410 • Published 9 days ago • 37 •

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published 1 day ago • 54 •

commented a paper 1 day ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published 4 days ago • 106 •

commented 4 papers 2 days ago

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Paper • 2505.07747 • Published 4 days ago • 56 •

commented 5 papers 3 days ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published 5 days ago • 119 •

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published 4 days ago • 72 •

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published 17 days ago • 87 •

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published 17 days ago • 87 •

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published 8 days ago • 132 •

commented 3 papers 8 days ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 10 days ago • 141 •

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published 11 days ago • 79 •

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published 10 days ago • 87 •

commented a paper 9 days ago

DeepCritic: Deliberate Critique with Large Language Models

Paper • 2505.00662 • Published 15 days ago • 49 •