Xie's picture

Xie

Zhihui

·

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

RekaAI/reka-flash-3.1

liked a model 22 days ago

POLARIS-Project/Polaris-4B-Preview

liked a Space 23 days ago

visionLMsftw/comparevlms

View all activity

Organizations

authored a paper 5 months ago

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

authored 3 papers 8 months ago

Pretraining in Deep Reinforcement Learning: A Survey

Paper • 2211.03959 • Published Nov 8, 2022 • 1

VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment

Paper • 2410.09421 • Published Oct 12, 2024

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published Nov 26, 2024 • 11

authored 3 papers about 1 year ago

Jailbreaking as a Reward Misspecification Problem

Paper • 2406.14393 • Published Jun 20, 2024 • 13

Calibrating Reasoning in Language Models with Internal Consistency

Paper • 2405.18711 • Published May 29, 2024 • 6

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

Paper • 2404.12387 • Published Apr 18, 2024 • 40

authored a paper over 1 year ago

Silkie: Preference Distillation for Large Visual Language Models

Paper • 2312.10665 • Published Dec 17, 2023 • 11

authored a paper almost 2 years ago

Future-conditioned Unsupervised Pretraining for Decision Transformer

Paper • 2305.16683 • Published May 26, 2023