Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
estnafinema0
's Collections
PEFT variations
NER Extraction. Active Learning Approach.
SmolLM Variation: PPO & DPO Fine-Tuning for RLHF
PEFT variations
updated
Apr 11
Upvote
-
estnafinema0/llm-course-hw3-dora
Text Generation
•
0.3B
•
Updated
Apr 11
•
5
estnafinema0/llm-course-hw3-lora
Text Generation
•
0.3B
•
Updated
Apr 11
•
4
estnafinema0/llm-course-hw3-tinyllama-qlora
Updated
Apr 11
estnafinema0/llm-course-hw3-tinyllamma-qlora
Updated
Apr 11
Upvote
-
Share collection
View history
Collection guide
Browse collections