adversarial-evaluation

Here are 8 public repositories matching this topic...

tjhavranek / research-audit-duel-protocol

Human-in-the-loop adversarial workflows for high-stakes research audit: from ChatGPT-Gemini duels to 4-model MAD.

gemini meta-analysis grok claude peer-review large-language-models chatgpt multi-agent-debate adversarial-evaluation research-audit

Updated May 30, 2026

Madhur-1 / RevealVLLMSafetyEval

Star

RevealVLLMSafetyEval is a comprehensive pipeline for evaluating Vision-Language Models (VLMs) on their compliance with harm-related policies. It automates the creation of adversarial multi-turn datasets and the evaluation of model responses, supporting responsible AI development and red-teaming efforts.

red-teaming responsible-ai llava vllm vision-language-models qwen2 responsible-ai-techniques llama3 phi3 gpt-4o qwen2-vl pixtral adversarial-evaluation multimodal-safety

Updated May 12, 2025
Python

tjhavranek / mad-research

Star

Three Claude Code skills for working with Codex CLI: codex-bridge (one-shot Codex calls), mad-build (Claude+Codex collaboration with cross-review), and mad-research (three-stream adversarial audit of papers, grants, reports with anonymized cross-critique and fresh-Codex synthesis).

codex claude peer-review ai-tools multi-agent-debate claude-code adversarial-evaluation research-audit

Updated Jun 3, 2026

alexsds / ade-workflow

Star

Claude Code plugin implementing Anthropic's 3-agent harness (Planner, Generator, Evaluator) for long-running app development with pluggable rubrics and adversarial evaluation

code-generation ai-agents anthropic claude-code adversarial-evaluation claude-code-plugin agent-harness

Updated Apr 3, 2026
Shell

Ziqing110 / rag-evidence-attack-lab

Star

Scientific QA robustness evaluation pipeline for evidence-missing RAG scenarios on PeerQA, with EM/F1 reliability analysis.

python rag openai-api llm-evaluation hallucination-detection adversarial-evaluation

Updated Mar 18, 2026
Python

Darv0n / sia-research-engine

Star

Multi-agent deep research engine with SIA (Semantic Intelligence Architecture) — thermodynamic entropy control, adversarial critique, multi-reactor swarm orchestration

python research entropy multi-agent knowledge-graph swarm-intelligence rag anthropic langgraph adversarial-evaluation

Updated Feb 25, 2026
Python

SHRAVANIRANE / GuardMCP

Star

GuardMCP - Deterministic Runtime Semantic Enforcement for Agentic Tool Execution using Directional Intent–Action Alignment

semantic-alignment nlp-research adversarial-evaluation agent-safety prompt-injection-detection research-benchmark embedding-based-methods vector-space-analysis

Updated Apr 4, 2026
Python

tahamsi / cider-deliberation

Star

CIDeR: a reproducible benchmark framework for causal exposure control in multi-agent LLM deliberation, comparing exposure-aware aggregation against voting, self-consistency, debate, causal-credit, social-choice, diversity, and adversarial baselines.

debate multi causal-inference multi-agent-systems self-consistency deliberation social-choice calibration-toolbox llm llm-evaluation-framework adversarial-evaluation

Updated Jun 1, 2026
Python

Improve this page

Add a description, image, and links to the adversarial-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial-evaluation

Here are 8 public repositories matching this topic...

tjhavranek / research-audit-duel-protocol

Madhur-1 / RevealVLLMSafetyEval

tjhavranek / mad-research

alexsds / ade-workflow

Ziqing110 / rag-evidence-attack-lab

Darv0n / sia-research-engine

SHRAVANIRANE / GuardMCP

tahamsi / cider-deliberation

Improve this page

Add this topic to your repo