1 1 4

CHEN ZHENG

zhengchenphd

AI & ML interests

Natural Language Processing, Question Answering

Recent Activity

liked a model 9 days ago

deepseek-ai/DeepSeek-R1

new activity 9 days ago

deepseek-ai/DeepSeek-R1:Congratulating DeepSeek-R1 and Inviting Review of Our Team’s Early Research last year on Similar Ideas

authored a paper 8 months ago

Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy

View all activity

Organizations

None yet

zhengchenphd's activity

liked a model 9 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 498k • 5.33k

New activity in deepseek-ai/DeepSeek-R1 9 days ago

Congratulating DeepSeek-R1 and Inviting Review of Our Team’s Early Research last year on Similar Ideas

#17 opened 9 days ago by

zhengchenphd

authored 5 papers 8 months ago

Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy

Paper • 2310.04945 • Published Oct 7, 2023 • 1

ICE-GRT: Instruction Context Enhancement by Generative Reinforcement based Transformers

Paper • 2401.02072 • Published Jan 4, 2024 • 11

A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

Paper • 2311.10614 • Published Nov 17, 2023

Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF

Paper • 2403.02513 • Published Mar 4, 2024

Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs

Paper • 2406.08657 • Published Jun 12, 2024 • 9

upvoted a paper 8 months ago

Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs

Paper • 2406.08657 • Published Jun 12, 2024 • 9

liked a model 8 months ago

zhengchenphd/Mistral-C2F-7B

Text Generation • Updated Jun 14, 2024 • 7 • 2

updated a model 8 months ago

zhengchenphd/Mistral-C2F-7B

Text Generation • Updated Jun 14, 2024 • 7 • 2

updated 2 models 11 months ago

zhengchenphd/ICE-GRT

Text Generation • Updated Mar 18, 2024 • 63 • 5

zhengchenphd/Mistral-Plus-7B

Text Generation • Updated Mar 18, 2024 • 533 • 4

liked a model 11 months ago

zhengchenphd/Mistral-Plus-7B

Text Generation • Updated Mar 18, 2024 • 533 • 4

liked a model about 1 year ago

zhengchenphd/ICE-GRT

Text Generation • Updated Mar 18, 2024 • 63 • 5