Llama base models with a chat template that still use eos_token
Zack Ankner
ankner
AI & ML interests
None yet
Organizations
Base Models With Chat Templates
Llama base models with a chat template that still use eos_token
Hydra Decoding
Paper: https://arxiv.org/abs/2402.05109 | Code: https://github.com/zankner/Hydra
Oracle 2 Proxy Models
Oracle 2 Proxy Data
Multi Judgement Oversight
Critique-out-Loud Reward Models
Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud