2 14 4

renjie

renjiepi

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

upvoted a paper 4 months ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

upvoted a paper 5 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

View all activity

Organizations

None yet

Create a static web page by editing HTML

models 11

renjiepi/BPO-Lora-LLaVA-7B

Updated Aug 23, 2024 • 2

renjiepi/protector_detector_3b_lora

Updated Apr 21, 2024 • 13

renjiepi/G-LLaVA-13B-align

Text Generation • Updated Mar 25, 2024

renjiepi/G-LLaVA-7B-2

Text Generation • Updated Mar 25, 2024 • 1

renjiepi/G-LLaVA-13B

Text Generation • Updated Mar 25, 2024 • 5 • 1

renjiepi/G-LLaVA-7B-align

Text Generation • Updated Mar 25, 2024 • 1

renjiepi/G-LLaVA-7B

Text Generation • Updated Mar 25, 2024 • 12 • 4

renjiepi/protector_detector_7b_lora

Updated Mar 23, 2024

renjiepi/mllm_protector_detoxifier

Updated Mar 23, 2024

renjiepi/autoencoder

Updated Sep 9, 2023

View 11 models

datasets 3

renjiepi/magicmagic

Viewer • Updated Aug 21, 2024 • 2.64k • 4

renjiepi/BPO_Instruct

Viewer • Updated Aug 5, 2024 • 188k • 8

renjiepi/harmful_vs_unharmful

Viewer • Updated Mar 23, 2024 • 20.2k • 9 • 1

renjie

AI & ML interests

Recent Activity

Organizations

Papers 3

spaces 1

Encoder

models 11 Sort: Recently updated

datasets 3 Sort: Recently updated

models 11

datasets 3