11 36 16

Guowei Xu PRO

Xkev

https://xugw-kevin.github.io

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

Xkev/qwen-2.5-openr1-random-subset-rlsft-wrongcode

published a model about 2 months ago

Xkev/qwen-2.5-openr1-random-subset-rlsft-wrongcode

updated a Space about 2 months ago

Xkev/Llama-3.2V-11B-cot

View all activity

Organizations

None yet

updated a model about 1 month ago

Xkev/qwen-2.5-openr1-random-subset-rlsft-wrongcode

Text Generation • 8B • Updated Sep 15 • 4

published a model about 2 months ago

Xkev/qwen-2.5-openr1-random-subset-rlsft-wrongcode

Text Generation • 8B • Updated Sep 15 • 4

updated a Space about 2 months ago

Llama 3.2V 11B Cot

💬

Chat about images with text input

upvoted a paper about 2 months ago

metaTextGrad: Automatically optimizing language model optimizers

Paper • 2505.18524 • Published May 24 • 1

liked a model 2 months ago

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • 685B • Updated Aug 26 • 11.6k • 1k

liked a model 3 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 4.84M • • 3.82k

authored a paper 3 months ago

metaTextGrad: Automatically optimizing language model optimizers

Paper • 2505.18524 • Published May 24 • 1

upvoted 3 papers 3 months ago

MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning

Paper • 2410.14972 • Published Oct 19, 2024 • 1

ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Paper • 2402.14528 • Published Feb 22, 2024 • 1

Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?

Paper • 2307.07837 • Published Jul 15, 2023 • 1

upvoted a paper 4 months ago

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

Paper • 2507.01925 • Published Jul 2 • 38

upvoted a paper 5 months ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3 • 58

upvoted 3 papers 6 months ago

PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models

Paper • 2504.16074 • Published Apr 22 • 36

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 297

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published Apr 14 • 84

upvoted a paper 7 months ago

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Paper • 2504.02782 • Published Apr 3 • 57

updated a Space 7 months ago

Llama 3.2V 11B Cot

💬

Chat about images with text input

New activity in Xkev/LLaVA-CoT-100k 8 months ago

Greetings! I have made a R1 format fork of this dataset!

👀 1

#2 opened 8 months ago by

di-zhang-fdu

upvoted 2 papers 8 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 150

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 153

Guowei Xu PRO

AI & ML interests

Recent Activity

Organizations

Xkev's activity

Llama 3.2V 11B Cot

Llama 3.2V 11B Cot

Greetings! I have made a R1 format fork of this dataset!