Add model-index with comprehensive benchmark evaluations

#78

by davidlms - opened about 24 hours ago

←

about 24 hours ago

Added structured evaluation results from README benchmark tables covering 4 categories:

1. Reasoning & Factuality (11 benchmarks):

2. STEM & Code (8 benchmarks):

3. Multilingual (7 benchmarks):

4. Multimodal (15 benchmarks):

Total: 41 benchmarks across reasoning, STEM, code, multilingual, and multimodal capabilities.

This enables the model to appear in leaderboards and makes it easier to compare with other models.

Note: Existing PRs (#57, #49, #34) modify README text content. This PR adds structured metadata to the YAML frontmatter and should not conflict.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment