NewMind AI

Team

company

Verified

None defined yet.

nmmursit updated a Space about 16 hours ago

newmindai/Mezura

nmmursit updated a Space about 17 hours ago

newmindai/Mezura

zgrgr authored a paper 4 months ago

Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs

TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval

newmindai 's Spaces 2

Compare and evaluate LLM performance across multiple benchmarks

Display benchmark results for embedding models