6 44 39

Yongxin Guo

Yongxin-Guo

https://gyxxyg.github.io/yongxinguo/

gyxxyg

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

TULIP: Towards Unified Language-Image Pretraining

upvoted a paper 2 days ago

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

upvoted a paper 2 days ago

ViSpeak: Visual Instruction Feedback in Streaming Videos

View all activity

Organizations

Yongxin-Guo's activity

upvoted 3 papers 2 days ago

upvoted a paper 12 days ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 16 days ago • 107

upvoted 2 papers about 2 months ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 55

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 109

upvoted a paper 2 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 276

upvoted 13 papers 3 months ago

OpenAI o1 System Card

Paper • 2412.16720 • Published Dec 21, 2024 • 31

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 55

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 97

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 51

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 92

Autoregressive Video Generation without Vector Quantization

Paper • 2412.14169 • Published Dec 18, 2024 • 14

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 135

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 51

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published Dec 19, 2024 • 73

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 356

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 93

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 17