plaguss
·
AI & ML interests
None yet
Organizations
plaguss/Qwen2.5-Math-1.5B-Instruct-PRM-0.1
Token Classification
•
2B
•
Updated
•
7
plaguss/Qwen2.5-Math-7B-Instruct-PRM-0.1
Token Classification
•
7B
•
Updated
•
8
plaguss/Qwen2.5-Math-7B-PRM-0.1
Token Classification
•
7B
•
Updated
•
12
plaguss/Llama-3.1-8B-Math-Shepherd-PRM-0.2
Token Classification
•
8B
•
Updated
•
10
plaguss/Mistral-7B-v0.1-Math-Shepherd-PRM-0.2
Token Classification
•
7B
•
Updated
•
9
plaguss/Qwen2.5-0.5B-Math-Shepherd-PRM-0.2
Token Classification
•
0.5B
•
Updated
•
10
plaguss/mistal-7b-prm-openrlhf
Text Generation
•
7B
•
Updated
•
7
plaguss/Mistral-7B-v0.1-Math-Shepherd-PRM-token-0.1
Token Classification
•
7B
•
Updated
•
7
plaguss/Qwen2.5-0.5B-Math-Shepherd-PRM-token-0.1
Token Classification
•
0.5B
•
Updated
•
9
plaguss/Mistral-7B-v0.1-Math-Shepherd-PRM-0.1
Token Classification
•
7B
•
Updated
•
10
plaguss/Qwen2.5-0.5B-Math-Shepherd-PRM-0.1
Token Classification
•
0.5B
•
Updated
•
12
plaguss/Llama-3.1-8B-Instruct-FineTome-APO-zero-12epoch-rmsprop-2048
Text Generation
•
8B
•
Updated
•
10
plaguss/Llama-3.1-8B-Instruct-FineTome-APO-zero-6epoch-rmsprop
Text Generation
•
8B
•
Updated
•
8
plaguss/bge-base-argilla-sdk-matryoshka
Sentence Similarity
•
0.1B
•
Updated
•
6
•
5
plaguss/zephyr-7b-lora-dpo-dibt-v0
Text Generation
•
7B
•
Updated
•
8
plaguss/zephyr-7b-lora-adapter-dpo-dibt-v0
plaguss/stablelm-2-1.6-dpo-disticoder-v0.1
plaguss/stablelm-2-1_6-sft-disticoder-v01
Text Generation
•
2B
•
Updated
•
9
plaguss/phi-2-disticoder-v0.1
plaguss/mistral-7b-capybara-v0.1
Updated
plaguss/criticon_v0_lora
Feature Extraction
•
7B
•
Updated
•
7
plaguss/mocked
plaguss/test_model
Text Classification
•
Updated
•
9
plaguss/dialogpt_dwight
plaguss/dialogpt_dwight_small
plaguss/dialogpt_dwight2
plaguss/gpt2_dwight