Aleksanian Ekaterina's picture

2

Aleksanian Ekaterina

estnafinema0

·

estnafinema0

AI & ML interests

None yet

Organizations

None yet

upvoted 2 collections 4 months ago

NER Extraction. Active Learning Approach.

2 items • Updated Apr 4 • 1

SmolLM Variation: PPO & DPO Fine-Tuning for RLHF

This collection presents the fine-tuning of the SmolLM model using two (RLHF) approaches: DPO and PPO. • 3 items • Updated Mar 30 • 1