-
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Paper • 2310.20624 • Published • 13 -
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Paper • 2310.20587 • Published • 18 -
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Paper • 2311.00117 • Published -
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation
Paper • 2303.08320 • Published • 3
Vikarti Anatra
vikarti-anatra
·
AI & ML interests
None yet
Recent Activity
liked
a model
5 days ago
ArliAI/QwQ-32B-ArliAI-RpR-v3
liked
a model
7 days ago
Wan-AI/Wan2.1-T2V-1.3B
liked
a model
9 days ago
SentientAGI/Dobby-Mini-Unhinged-Llama-3.1-8B
Organizations
None yet
Collections
2
Various quants of Faro-Yi-9B-DPO
models
3
datasets
0
None public yet