1 8 4

zuijiang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Reinforcement Pre-Training

upvoted a paper about 2 months ago

GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks

upvoted a paper 3 months ago

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

View all activity

Organizations

upvoted a paper about 1 month ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 247

upvoted a paper about 2 months ago

GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks

Paper • 2504.12764 • Published Apr 17 • 42

upvoted a paper 3 months ago

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published Apr 14 • 84

authored 4 papers 3 months ago

Scalable Oversight for Superhuman AI via Recursive Self-Critiquing

Paper • 2502.04675 • Published Feb 7

Expanding the Boundaries of Vision Prior Knowledge in Multi-modal Large Language Models

Paper • 2503.18034 • Published Mar 23

SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency

Paper • 2502.02458 • Published Feb 4

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers

Paper • 2504.00502 • Published Apr 1 • 24

liked a Space 5 months ago

2.82k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 5 months ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published Feb 3 • 24

upvoted a paper 6 months ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published Jan 3 • 18

commented a paper 6 months ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published Jan 3 • 18 •

authored a paper 8 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 24

upvoted a paper 8 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 24

updated 2 datasets 11 months ago

zuijiang/alpaca-alpaca-clean

Viewer • Updated Aug 26, 2024 • 51.8k • 16

zuijiang/mistral-alpaca-clean

Viewer • Updated Aug 25, 2024 • 51.8k • 15

liked a dataset about 1 year ago

AIcell/MOSSBench

Updated Mar 4 • 234 • 4

liked a Space about 1 year ago

2.25k

Voice Clone

🗣

Clone voices using text and audio samples

updated a model about 1 year ago

zuijiang/llava-qwen1.5-14B-chat

Text Generation • 15B • Updated Jul 1, 2024 • 5

updated a dataset about 1 year ago

zuijiang/ocr_vqa

Viewer • Updated May 30, 2024 • 208k • 86