Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
14
4
renjie
renjiepi
Follow
XuankunRong's profile picture
YangCaoCS's profile picture
21world's profile picture
6 followers
·
18 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
upvoted
a
paper
4 months ago
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models
upvoted
a
paper
5 months ago
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
View all activity
Organizations
None yet
Papers
3
arxiv:
2502.12084
arxiv:
2410.07113
arxiv:
2312.11370
spaces
1
Running
Encoder
📊
Create a static web page by editing HTML
models
11
Sort:Â Recently updated
renjiepi/BPO-Lora-LLaVA-7B
Updated
Aug 23, 2024
•
2
renjiepi/protector_detector_3b_lora
Updated
Apr 21, 2024
•
13
renjiepi/G-LLaVA-13B-align
Text Generation
•
Updated
Mar 25, 2024
renjiepi/G-LLaVA-7B-2
Text Generation
•
Updated
Mar 25, 2024
•
1
renjiepi/G-LLaVA-13B
Text Generation
•
Updated
Mar 25, 2024
•
5
•
1
renjiepi/G-LLaVA-7B-align
Text Generation
•
Updated
Mar 25, 2024
•
1
renjiepi/G-LLaVA-7B
Text Generation
•
Updated
Mar 25, 2024
•
12
•
4
renjiepi/protector_detector_7b_lora
Updated
Mar 23, 2024
renjiepi/mllm_protector_detoxifier
Updated
Mar 23, 2024
renjiepi/autoencoder
Updated
Sep 9, 2023
View 11 models
datasets
3
Sort:Â Recently updated
renjiepi/magicmagic
Viewer
•
Updated
Aug 21, 2024
•
2.64k
•
4
renjiepi/BPO_Instruct
Viewer
•
Updated
Aug 5, 2024
•
188k
•
8
renjiepi/harmful_vs_unharmful
Viewer
•
Updated
Mar 23, 2024
•
20.2k
•
9
•
1