adversarial-testing

Here are 74 public repositories matching this topic...

0xSanei / darwinia

The Self-Evolving Agent Ecosystem — Trading agents that evolve through Darwinian selection and adversarial self-play

bitcoin trading genetic-algorithm quantitative-finance autonomous-agents backtesting ai-agents multi-agent-system evolutionary-computing streamlit adversarial-testing openclaw darwinian-evolution

Updated Apr 13, 2026
Python

IBM / ares

Star

AI Robustness Evaluation System

security ai owasp owasp-top-10 red-teaming blue-teaming agentic-ai automated-red-teaming adversarial-testing

Updated May 21, 2026
Python

humanbound / humanbound

Star

Open-source AI agent red-team engine, SDK, and CLI. Run offline or against the Humanbound Platform.

Updated May 20, 2026
Python

sherifkozman / the-red-council

Star

LLM Adversarial Security Arena — Jailbreak → Detect → Defend → Verify

security gemini red-team llm langchain adversarial-testing

Updated May 9, 2026
Python

bili-core is an open-source framework for LLM benchmarking using LangChain, LangGraph, Streamlit, and Flask. It enables effective LLM model comparisons, Retrieval-Augmented Generation (RAG), and customizable decision workflows. Part of MSU Denver’s Sustainability Hub, bili-core promotes data democracy and transparent, reproducible AI research. 🚀

Updated May 20, 2026
Python

audn-ai / skills

Star

Red-team your AI agents from any coding IDE. Adversarial security testing skills for Claude Code, Cursor, Codex, and 40+ agents.

skills jailbreak red-team ai-security prompt-injection llm-security claude-code adversarial-testing agent-skill

Updated Apr 13, 2026

alejandrosaenz117 / bonfires-marketplace

Star

A marketplace of Claude Code plugins for adversarial security and architectural code review.

security architecture code-review threat-modeling security-review claude-code adversarial-testing plugin-marketplace

Updated Mar 30, 2026

jhlee0409 / elenchus-mcp

Sponsor

Star

Elenchus MCP Server - Adversarial verification system for code review

nodejs typescript ai mcp static-analysis code-review claude code-verification llm anthropic model-context-protocol mcp-server adversarial-testing

Updated Jan 29, 2026
TypeScript

stchakwdev / Gaslight_EVAL

Star

AI safety evaluation framework testing LLM epistemic robustness under adversarial self-history manipulation

python ai-safety openrouter llm-evaluation adversarial-testing alignment-research epistemic-robustness

Updated Dec 18, 2025
Python

YaswanthGhanta / llm-logical-integrity-benchmark

Star

Adversarial testing of LLMs on constraint satisfaction deadlocks

reinforcement-learning gemini grok claude hallucination prompt-engineering chain-of-thought chatgpt rlhf qwen llm-evaluation sycophancy deepseek safety-alignment ai-red-teaming kimi-k2 adversarial-testing

Updated Jan 27, 2026

dr-gareth-roberts / context-engineering

Star

Context engineering toolkit for LLMs — pack, cache, debug, red-team, and orchestrate context windows. Council of Experts, adversarial testing, immune system, context compiler, drift detection, multi-agent entanglement. TypeScript + Python.

python typescript ai multi-agent rag llm prompt-engineering llm-tools context-window prefix-caching context-engineering adversarial-testing token-budget council-of-experts context-packing

Updated May 19, 2026
Python

vibheksoni / jailbench

Star

Benchmark LLM jailbreak resilience across providers with standardized tests, adversarial mode, rich analytics, and a clean Web UI.

Updated Aug 12, 2025
Python

tasumermaf / the-adversary

Star

Agent-driven adversarial paper audit framework

python ai-agents scientific-writing research-tools adversarial-testing paper-audit

Updated Mar 17, 2026
Python

jhcdev / omc-codex

Star

Cross-model orchestration for Claude Code — Claude builds, Codex validates. Blind TDD, adversarial stress testing, mixed-model teams, and automatic fallback. Two AI models enter, better code leaves.

plugin tdd developer-tools code-review codex cross-model ai-orchestration claude-code adversarial-testing oh-my-claudecode

Updated Apr 3, 2026
JavaScript

craigtrim / persona-api

Star

API for generating LLM bot/agent personalities based on the Big Five personality model.

big-five-model adversarial-testing personality-api llm-agent-personas behavioral-profiles

Updated Jan 2, 2026
Python

zakky8 / llm-jailbreak-taxonomy

Star

Mechanism-grounded taxonomy of 40 LLM jailbreak patterns across 10 categories. Full evaluation harness for 4 frontier models. AI safety research with responsible disclosure.

taxonomy jailbreak alignment ai-safety security-testing responsible-disclosure jailbreak-detection adversarial-attacks red-teaming ai-security model-robustness adversarial-ml prompt-injection red-teaming-tools llm-security llm-evaluation llm-jailbreaks ai-red-teaming adversarial-testing

Updated Mar 21, 2026
Jupyter Notebook

Nicholas-Kloster / VisorCorpus

Star

Go toolkit + library: structured adversarial corpora for LLM/RAG safety + quality testing. Prompt injection, KB exfiltration, jailbreak, system-prompt probing. CI/CD-ready.

cli golang jailbreak corpus nuclide visor red-team ai-security go-cli adversarial-ml prompt-injection llm-security safety-evaluation rag-testing rag-security defensive-ai adversarial-testing corpus-generation nicholas-kloster

Updated May 5, 2026
Go

audn-ai / audn-cli

Star

CLI for Audn.ai — CI/CD security gate and developer workflows for AI agent red-teaming

cli golang security cicd red-team ai-security voice-ai llm-testing adversarial-testing

Updated Apr 13, 2026
Go

bogdanticu88 / OmniFuzz-LLM

Star

Adversarial testing and red-teaming framework for enterprise LLM deployments. Covers OWASP LLM Top 10 across 11 attack modules, RAG poisoning, tool-call abuse, PII leakage, credential harvesting, hallucination, and more. Built to run in CI/CD pipelines.

Updated Mar 22, 2026
Python

anotherben / claude-enterprise-skills

Star

9-stage enterprise development pipeline for Claude Code. TDD, adversarial testing, mechanical verification. Any stack.

Updated Mar 14, 2026
Shell

Improve this page

Add a description, image, and links to the adversarial-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-testing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial-testing

Here are 74 public repositories matching this topic...

0xSanei / darwinia

IBM / ares

humanbound / humanbound

sherifkozman / the-red-council

msu-denver / bili-core

audn-ai / skills

alejandrosaenz117 / bonfires-marketplace

jhlee0409 / elenchus-mcp

stchakwdev / Gaslight_EVAL

YaswanthGhanta / llm-logical-integrity-benchmark

dr-gareth-roberts / context-engineering

vibheksoni / jailbench

tasumermaf / the-adversary

jhcdev / omc-codex

craigtrim / persona-api

zakky8 / llm-jailbreak-taxonomy

Nicholas-Kloster / VisorCorpus

audn-ai / audn-cli

bogdanticu88 / OmniFuzz-LLM

anotherben / claude-enterprise-skills

Improve this page

Add this topic to your repo