Best 175 Evaluation Open Source Projects

🤗 The largest hub of ready-to-use datasets for ML models with fast, eas...

:metal: awesome-semantic-segmentation

Arbitrary expression evaluation for golang

Test your prompts, agents, and RAGs. Use LLM evals to improve your app's...

Building a modern functional compiler from first principles. (http://dev...

Python package for the evaluation of odometry and SLAM

Klipse is a JavaScript plugin for embedding interactive code snippets in...

OpenCompass is an LLM evaluation platform, supporting a wide range of mo...

End-to-end Automatic Speech Recognition for Madarian and English in Tens...

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models ...

The LLM Evaluation Framework

The LLM Evaluation Framework

A unified evaluation framework for large language models

An open-source visual programming environment for battle-testing prompts...

UpTrain is an open-source unified platform to evaluate and improve Gener...