Secret_H_Evals

Name: Secret_H_Evals
Author: stchakwdev

stchakwdev September 29, 2025

3 copies 0 downloads

Multi-agent strategic deception evaluation framework for LLMs using Secret Hitler as a testbed. Analyzes AI reasoning, trust dynamics, and deceptive behavior patterns.

Secret Hitler LLM Evaluation Framework

Multi-agent strategic deception evaluation for large language models using Secret Hitler as a testbed. This framework enables researchers to study AI reasoning, trust dynamics, and deceptive behavior patterns in a controlled game environment.

Author: Samuel T. Chakwera (stchakwdev)

Why This Project?

Understanding how AI systems engage in strategic deception is critical for AI safety research. Secret Hitler provides an ideal testbed because it:

Requires hidden information management - Players must reason about unknown roles and hidden agendas
Involves coalition formation - Trust and betrayal dynamics emerge naturally from gameplay
Tests deceptive reasoning - Fascists must convincingly lie while Liberals must detect deception
Produces measurable outcomes - Win rates, voting patterns, and policy outcomes provide quantifiable metrics

This framework enables researchers to:

Evaluate deception capabilities across different LLM architectures
Study emergent social behaviors in mult

Secret_H_Evals

Secret Hitler LLM Evaluation Framework

Table of Contents

Why This Project?

Tags

Comments

More Agents

Klaatcode

Agentmaker

Api Model Playground Cookbook

Agent Ecologies

Private Agent

Loom Novel

Ready-made automations for this