Running 170 The ultimate guide to RL environments: building and scaling them in the LLM era π 170 Building and scaling RL environments for LLM training
Running on CPU Upgrade 14k Open LLM Leaderboard π 14k Track, rank and evaluate open LLMs and chatbots
mlx-community/Mistral-7B-Instruct-v0.2-4-bit Text Generation β’ Updated Dec 27, 2023 β’ 1.42k β’ 24