🤗 The largest hub of ready-to-use datasets for ML models with fast, eas...
:metal: awesome-semantic-segmentation
Arbitrary expression evaluation for golang
Test your prompts, agents, and RAGs. Use LLM evals to improve your app's...
Building a modern functional compiler from first principles. (http://dev...
Python package for the evaluation of odometry and SLAM
Klipse is a JavaScript plugin for embedding interactive code snippets in...
OpenCompass is an LLM evaluation platform, supporting a wide range of mo...
End-to-end Automatic Speech Recognition for Madarian and English in Tens...
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models ...
The LLM Evaluation Framework
A unified evaluation framework for large language models
An open-source visual programming environment for battle-testing prompts...
UpTrain is an open-source unified platform to evaluate and improve Gener...