SiMajid
AI & ML interests
None yet
Organizations
None yet
SiMajid/tiny_llama_orpo_v10
Updated
SiMajid/tiny_llama_orpo_v5
Updated
SiMajid/tiny_llama_orpo_v4
Updated
SiMajid/tiny_llama_orpo_v3
Updated
SiMajid/tiny_llama_orpo_v2
Updated
SiMajid/new-value-reward-model-deberta
Updated
SiMajid/tiny_llama_orpo_v1
Updated
SiMajid/tiny_llama_dpo_v1
Updated
SiMajid/tiny_llama_ppo_v1
Updated
SiMajid/new-value-reward-model-opt-350m-v4
Text Classification
•
0.4B
•
Updated
•
2
SiMajid/value-new-reward-model-opt-350m-v5
Text Classification
•
0.3B
•
Updated
SiMajid/new-value-reward-model-opt-350m-v3
Text Classification
•
0.4B
•
Updated
SiMajid/new-value-reward-model-opt-350m-v2
Text Classification
•
0.4B
•
Updated
SiMajid/new-value-reward-model-opt-350m-v1
Text Classification
•
0.3B
•
Updated
SiMajid/value-reward-model-opt-1.3B-v1
Updated
SiMajid/value-reward-model-opt-350m-v16
Text Classification
•
0.3B
•
Updated
•
1
SiMajid/value-reward-model-opt-350m-v15
Text Classification
•
0.3B
•
Updated
SiMajid/value-reward-model-opt-350m-v12
Text Classification
•
0.3B
•
Updated
•
2
SiMajid/value-reward-model-opt-350m-v11
Text Classification
•
0.3B
•
Updated
SiMajid/value-reward-model-opt-350m-v3
Text Classification
•
0.3B
•
Updated
SiMajid/value-reward-model-opt-350m
Updated
SiMajid/working
0.3B
•
Updated
SiMajid/last-reward-train-facebook-opt350m_v1
Updated
SiMajid/last_phi3_dpo_v1
Updated
SiMajid/llama3_orpo_v1
Updated
SiMajid/llama3_orpo_v2
Updated
SiMajid/phi3_new_dpo_v2
Updated
SiMajid/phi3_new_dpo_v1
Updated
SiMajid/llama3_dpo_v3
Updated
SiMajid/llama3_dpo_v2
Updated