Evaluation Metrics

Topic related to evaluation-metrics

The LLM Evaluation Framework

8,485734
Python

️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍

56928
Python

A Neural Framework for MT Evaluation

61993
Python

Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including OpenAI Agents SDK, CrewAI, Langchain, Autogen, AG2, and CamelAI

4,606425
Python

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

1,668300
Python

Open source RAG evaluation package

24813
Python

Python SDK for running evaluations on LLM generated responses

28821
Python

🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data for end-to-end AI benchmarking

19958
Python

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

31618

Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13

18223
Python

STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지

6210
Python

The most comprehensive Python package for evaluating survival analysis models.

355
Python

Counting-Stars ()

832
Jupyter Notebook

Continuation of an abandoned project fast-coco-eval

11410
Python

Python client for Kolena's machine learning testing platform

465
Python

Topic Statistics

Related Topics