A collection of models trained using deep RL for a variety of games.
Matt Boraske
MattBoraske
AI & ML interests
Reinforcement Learning, Natural Language Processing, LLM Finetuning
Organizations
Reddit AITA Finetuning V1
Datasets curated from the reddit r/amithea**hole subreddit and models finetuned on them using QLoRA.
-
MattBoraske/llama-2-7b-chat-reddit-AITA-multiclass
Text Generation • 7B • Updated • 17 -
MattBoraske/llama-2-7b-chat-reddit-AITA-multiclass-top-2k
Text Generation • 7B • Updated • 6 -
MattBoraske/llama-2-7b-chat-reddit-AITA-binary
Text Generation • 7B • Updated • 5 -
MattBoraske/llama-2-7b-chat-reddit-AITA-binary-top-2k
Text Generation • 7B • Updated • 6
Deep RL Agents
A collection of models trained using deep RL for a variety of games.
Reddit AITA Finetuning V1
Datasets curated from the reddit r/amithea**hole subreddit and models finetuned on them using QLoRA.
-
MattBoraske/llama-2-7b-chat-reddit-AITA-multiclass
Text Generation • 7B • Updated • 17 -
MattBoraske/llama-2-7b-chat-reddit-AITA-multiclass-top-2k
Text Generation • 7B • Updated • 6 -
MattBoraske/llama-2-7b-chat-reddit-AITA-binary
Text Generation • 7B • Updated • 5 -
MattBoraske/llama-2-7b-chat-reddit-AITA-binary-top-2k
Text Generation • 7B • Updated • 6