Secret_H_Evals — DeepSeek Agents | Neura Market
    Neura MarketNeura Market/DeepSeek
    ChatGPTChatGPTClaudeClaudeGeminiGeminiCursorCursorGrokGrokPerplexityPerplexityDeepSeekDeepSeek
    CoPilotCoPilotStable DiffusionStable DiffusionMidjourneyMidjourney
    View All Directories
    OverviewRulesPromptsMCPsAgentsBlogVideosGuidesCoursesCommunityTrendingGenerate
    DeepSeekAgentsSecret_H_Evals
    Back to Agents
    Secret_H_Evals

    Secret_H_Evals

    stchakwdev September 29, 2025
    3 copies 0 downloads

    Multi-agent strategic deception evaluation framework for LLMs using Secret Hitler as a testbed. Analyzes AI reasoning, trust dynamics, and deceptive behavior patterns.

    Agent Definition
    # Secret Hitler LLM Evaluation Framework
    
    [![Python 3.11+](https://img.shields.io/badge/python-3.11+-blue.svg)](https://www.python.org/downloads/)
    [![License: CC BY-NC-SA 4.0](https://img.shields.io/badge/License-CC%20BY--NC--SA%204.0-lightgrey.svg)](https://creativecommons.org/licenses/by-nc-sa/4.0/)
    [![GitHub stars](https://img.shields.io/github/stars/stchakwdev/Secret_H_Evals?style=social)](https://github.com/stchakwdev/Secret_H_Evals)
    
    Multi-agent strategic deception evaluation for large language models using Secret Hitler as a testbed. This framework enables researchers to study AI reasoning, trust dynamics, and deceptive behavior patterns in a controlled game environment.
    
    **Author**: Samuel T. Chakwera ([stchakwdev](https://github.com/stchakwdev))
    
    ---
    
    ## Table of Contents
    
    - [Why This Project?](#why-this-project)
    - [Quick Start](#quick-start)
    - [Batch Evaluation Monitor](#batch-evaluation-monitor)
    - [Evaluation Results](#evaluation-results-300-games)
    - [Visual Analytics](#visual-analytics)
    - [Features](#features)
    - [Architecture](#architecture)
    - [Documentation](#documentation)
    - [Citation](#citation)
    - [Recent Updates](#recent-updates)
    - [Acknowledgments](#acknowledgments)
    - [License](#license)
    - [Contact](#contact)
    
    ---
    
    ## Why This Project?
    
    Understanding how AI systems engage in strategic deception is critical for AI safety research. Secret Hitler provides an ideal testbed because it:
    
    - **Requires hidden information management** - Players must reason about unknown roles and hidden agendas
    - **Involves coalition formation** - Trust and betrayal dynamics emerge naturally from gameplay
    - **Tests deceptive reasoning** - Fascists must convincingly lie while Liberals must detect deception
    - **Produces measurable outcomes** - Win rates, voting patterns, and policy outcomes provide quantifiable metrics
    
    This framework enables researchers to:
    1. **Evaluate deception capabilities** across different LLM architectures
    2. **Study emergent social behaviors** in mult

    Tags

    ai-safetydeception-detectiondeepseekgame-theoryllm-evaluationmachine-learningmulti-agentpythonsocial-deduction

    Comments

    More Agents

    View all
    hybrid-model-workflow

    hybrid-model-workflow

    HAL 分层混合模型工作流 — 强模型(Claude)负责理解/拆解/验收,低成本模型(DeepSeek)负责检索/提取/清洗。Hermes Agent skill。

    P
    ph4ble
    1
    Dynamic-Review-Agent

    Dynamic-Review-Agent

    An LLM agent fine-tuned on DeepSeek for spaced repetition, dynamically integrating knowledge points based on the Ebbinghaus forgetting curve.

    1
    1838177
    1
    StellarOS-Watch

    StellarOS-Watch

    基于 STM32F103 构建的端到端 AI 智能手表生态。自研“零重定位”原生机器码动态加载引擎与页面栈式 UI 框架;集成生产级 OTA 回滚保护机制与高带宽(921600 baud)串口协议栈。通过 Node.js 中继实现 DeepSeek AI 语义控制及 ASRPRO 语音全双工交互,是一个集成了分布式计算、现代存储管理与 AI Agent 的嵌入式全栈工程。

    C
    chenshuang888
    1
    UAVagent1.0deepseek

    UAVagent1.0

    A Meta-Agent-Driven Self-Evolving Multi-Agent System for UAV Detection and Tracking

    S
    StarlitPupils
    2
    hermes-goai-agent

    hermes-go

    One command to run Hermes AI Agent with a browser UI. Zero prerequisites. 一行命令,AI 就位。

    L
    LAI-755
    1
    Agent

    Agent

    网页应用Agent,接入DeepSeek、Mimo等模型

    C
    Cosmos-815
    1

    Stay up to date

    Get the latest DeepSeek prompts, rules, and resources delivered to your inbox weekly.

    Neura Market LogoNeura Market

    Discover the best AI prompts, plugins, and resources for DeepSeek and more.

    Content Types

    • Rules
    • Prompts
    • MCPs
    • Agents
    • Guides

    Platforms

    • ChatGPT Directory
    • Claude Directory
    • Gemini Directory
    • Cursor Directory
    • Grok Directory
    • Perplexity Directory
    • DeepSeek Directory
    • CoPilot Directory
    • Stable Diffusion Directory
    • Midjourney Directory
    • All Directories

    Resources

    • Blog
    • Documentation
    • Help Center
    • Marketplace

    Legal

    • Privacy Policy
    • Terms of Service

    © 2026 Neura Market. All rights reserved.

    |

    Not affiliated with any AI platform vendors.