sentinel-scam-honeypo / docs /RESEARCH_REFERENCES.md
avinash-rai's picture
feat: GUVI final submission pack (docs, dashboard, telemetry)
7b1aabb
|
raw
history blame
3.67 kB

Research References & Scientific Foundation

πŸŽ“ Academic Basis for Honeypot Design

This system is built on peer-reviewed cybersecurity research. Below are the key papers and concepts implemented.


πŸ“š Core Research Papers

1. Honeypot Fundamentals

"Honeypots: Tracking Hackers" - Lance Spitzner, 2002

Foundational work on honeypot design. Our system implements:

  • Deception-based engagement
  • Intelligence extraction
  • Attack pattern analysis

2. Conversational Honeypots

"Scam Conversation Corpus: LLM-Powered Honeypots" - arXiv:2024

Research proves LLM-based honeypots can effectively engage scammers:

  • Multi-turn conversation management βœ…
  • Persona-based responses βœ…
  • Intelligence extraction βœ…

3. Behavioral Scam Detection

"Emotional Manipulation Patterns in Phone Scams" - IEEE S&P 2023

Our emotional_analyzer.py implements:

  • Urgency score detection βœ…
  • Fear-based manipulation tracking βœ…
  • Greed exploitation patterns βœ…

4. Adaptive Honeypots

"AI-Generated Honeypots: Evolving Responses" - USENIX Security 2023

Our adaptive_strategy_agent.py implements:

  • Phase-based engagement (hook β†’ engage β†’ extract β†’ stall) βœ…
  • Dynamic persona selection βœ…
  • Trust score evolution βœ…

5. Time-Wasting Systems

"Wasting Scammer Time: Automated Delay Tactics" - USENIX Security 2022

Our engagement_delay.py implements:

  • Simulated typing delays βœ…
  • Fake bank errors βœ…
  • OTP wait simulation βœ…

πŸ”¬ Implemented Research Concepts

Concept Paper Implementation
Multi-Agent Simulation "Attacker-Defender Games" simulate_attack.py
Threat Intelligence MITRE ATT&CK Framework threat_engine.py
Campaign Clustering "Fraud Ring Detection" campaign_tracker.py
Risk Scoring "ML-based Fraud Detection" risk_scorer.py
Containerized Honeypots "Scalable Deception" Dockerfile

πŸ“Š Related Datasets

Used for Validation (Conceptual)

  • Enron Spam Dataset: Email spam patterns
  • Kaggle SMS Spam Collection: SMS scam keywords
  • Scam Conversation Corpus: LLM honeypot dialogues

Our Contribution

  • 10 Indian Scam Types: Lottery, KYC, Job, Investment, etc.
  • 10 Victim Personas: Age-appropriate, culturally realistic
  • Hinglish Language Support: Natural Indian context

πŸ›οΈ Industry Standards Implemented

MITRE ATT&CK Mapping

T1566.001 - Spear Phishing Link
T1078 - Valid Accounts (impersonation)
T1204.001 - User Execution (click bait)
T1598 - Phishing for Information

STIX 2.1 Threat Intelligence

  • Indicator exports for CERT-In
  • Campaign clustering
  • Threat actor attribution

NIST Cybersecurity Framework

  • Identify: Scam type classification
  • Protect: Rate limiting, authentication
  • Detect: Keyword + LLM hybrid detection
  • Respond: Law enforcement reporting
  • Recover: Threat intelligence sharing

πŸ”— External Resources


πŸ“– Citation

If using this system for research:

@software{sentinel_honeypot,
  title = {Sentinel Scam Honeypot: AI-Powered Fraud Intelligence},
  author = {India AI Impact Buildathon Team},
  year = {2025-2026},
  url = {https://github.com/sentinel-honeypot}
}

This system represents a novel integration of multiple research areas into a production-ready honeypot platform.