AI & ML interests
None defined yet.
edinlp/qwen-2.5-base-rlhf-zero-iter6
edinlp/qwen-2.5-base-rlhf-zero-iter5
edinlp/qwen-2.5-base-rlhf-zero-iter4
edinlp/qwen-2.5-base-rlhf-zero-iter3
edinlp/qwen-2.5-base-rlhf-zero-iter2
edinlp/qwen-2.5-base-rlhf-zero-iter1
Updated
edinlp/qwen2-7b-offline-dpo
8B
•
Updated
•
5
edinlp/llama-3-8b-offline-dpo
8B
•
Updated
•
4
edinlp/mistral-7b-v0.3-dpo
Text Generation
•
7B
•
Updated
•
6
edinlp/mistral-7b-v0.3-sft
Text Generation
•
7B
•
Updated
•
35
edinlp/llama-3-average
8B
•
Updated
•
4
edinlp/Llama-3-8B-DPO-Iter6
8B
•
Updated
•
5
edinlp/Llama-3-8B-DPO-Iter5
8B
•
Updated
•
5
edinlp/Llama-3-8B-DPO-Iter1
8B
•
Updated
•
5
edinlp/Llama-3-8B-DPO-Iter4
8B
•
Updated
•
5
edinlp/Llama-3-8B-DPO-Iter3
8B
•
Updated
•
5
edinlp/Llama-3-8B-DPO-Iter2
8B
•
Updated
•
5
edinlp/mistral-7b-dpo_zephyr_dpo
7B
•
Updated
•
5
edinlp/mistral-7b-dpo_iter6
7B
•
Updated
•
5
edinlp/mistral-7b-dpo_iter5
7B
•
Updated
•
4
edinlp/mistral-7b-dpo_iter4
7B
•
Updated
•
5
edinlp/mistral-7b-dpo_iter3
7B
•
Updated
•
5
edinlp/mistral-7b-dpo_iter2
7B
•
Updated
•
5
edinlp/mistral-7b-dpo_iter1
7B
•
Updated
•
5
edinlp/qwen-2-7b-default_iter6
8B
•
Updated
•
5
edinlp/qwen-2-7b-default_iter5
8B
•
Updated
•
5
edinlp/qwen-2-7b-default_iter4
8B
•
Updated
•
5
edinlp/qwen-2-7b-default_iter3
8B
•
Updated
•
8
edinlp/qwen-2-7b-default_iter2
8B
•
Updated
•
5
edinlp/qwen-2-7b-default_iter1
8B
•
Updated
•
5