Apps tagged with 'evaluation-framework'

All apps in Apps tagged with 'evaluation-framework' category. Use the filters below to narrow down your search. 
Copy a direct link to this comment to your clipboard
  1. Opik icon
     Like

    Opik is an open-source platform for evaluating, testing and monitoring LLM applications. Built by Comet.

    Cost / License

    Platforms

    • Self-Hosted
    • Docker
    Opik screenshot 1
    3 alternatives
  2. LightEval icon
     Like

    LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

    Cost / License

    • Free
    • Open Source (MIT)

    Platforms

    • Self-Hosted
    • Python
    3 alternatives
  3. Maxim AI icon
     Like

    Maxim is an end-to-end AI evaluation and observability platform that empowers modern AI teams to ship agents with quality, reliability, and speed.

    Cost / License

    • Paid
    • Proprietary

    Platforms

    • Software as a Service (SaaS)
    • Online
    Maxim AI screenshot 1
    Maxim AI screenshot 1
    Maxim AI screenshot 2