Running 3.69k The Ultra-Scale Playbook π 3.69k The ultimate guide to training LLM on large GPU Clusters
Running 1.49k Big Code Models Leaderboard π 1.49k Explore and submit code model evaluations on a leaderboard
MediaTek-Research/Breeze-7B-Instruct-v0_1 Text Generation β’ 7B β’ Updated Apr 24, 2024 β’ 844 β’ 90