🐢 Open-Source Evaluation & Testing for AI & LLM systems
-
Updated
Jul 8, 2025 - Python
🐢 Open-Source Evaluation & Testing for AI & LLM systems
Agentic testing for agentic codebases
Deliver safe & effective language models
MIT-licensed Framework for LLMs, RAGs, Chatbots testing. Configurable via YAML and integrable into CI pipelines for automated testing.
GPT4Go: AI-Powered Test Case Generation for Golang 🧪
A Python library for verifying code properties using natural language assertions.
Übungsaufgaben zum Buch "Basiswissen KI-Testen"
A CLI for testing your UI. Easy
👁 零代码零标注 CV AI 自动化测试平台 🚀 免除大量人工画框和打标签等,直接零代码快速自动化测试 CV 计算机视觉 AI 人工智能图像识别算法:行人检测、动植物分类、人脸识别、OCR 车牌识别、旋转校正、舞蹈姿态、抠图分割 等
Agent testing library that uses an agent to test your agent, in Go.
Agent testing library that uses an agent to test your agent, in Typescript.
Burro is a command-line interface (CLI) tool built with Deno for evaluating Large Language Model (LLM) outputs. It provides a straightforward way to run different types of evaluations with secure API key management.
An automated approach for exploring and testing conversational agents using large language models. TRACER discovers chatbot functionalities, generates user profiles, and creates comprehensive test suites for conversational AI systems.
Evaluation results and experimental data for TRACER, demonstrating its effectiveness in discovering chatbot functionalities and detecting errors with coverage analysis and mutation testing.
A plug & play framework for generative ai projects to be tested & automated
Open-source tools, SDKs, and resources for AetherLab AI quality control platform
🚀 ARM64 Browser Automation for Claude Code - SaaS testing on 80 Raspberry Pi budget. The first solution that works where Playwright/Puppeteer fail on ARM64. Autonomous testing without human debugging.
AI Generated BDD for Java and Junit using ChatGPT4o code
Add a description, image, and links to the ai-testing topic page so that developers can more easily learn about it.
To associate your repository with the ai-testing topic, visit your repo's landing page and select "manage topics."