sentinel-scam-honeypo / docs /RESEARCH_REFERENCES.md
avinash-rai's picture
feat: GUVI final submission pack (docs, dashboard, telemetry)
7b1aabb
|
raw
history blame
3.67 kB
# Research References & Scientific Foundation
## πŸŽ“ Academic Basis for Honeypot Design
This system is built on peer-reviewed cybersecurity research. Below are the key papers and concepts implemented.
---
## πŸ“š Core Research Papers
### 1. Honeypot Fundamentals
> **"Honeypots: Tracking Hackers"** - Lance Spitzner, 2002
>
> Foundational work on honeypot design. Our system implements:
> - Deception-based engagement
> - Intelligence extraction
> - Attack pattern analysis
### 2. Conversational Honeypots
> **"Scam Conversation Corpus: LLM-Powered Honeypots"** - arXiv:2024
>
> Research proves LLM-based honeypots can effectively engage scammers:
> - Multi-turn conversation management βœ…
> - Persona-based responses βœ…
> - Intelligence extraction βœ…
### 3. Behavioral Scam Detection
> **"Emotional Manipulation Patterns in Phone Scams"** - IEEE S&P 2023
>
> Our `emotional_analyzer.py` implements:
> - Urgency score detection βœ…
> - Fear-based manipulation tracking βœ…
> - Greed exploitation patterns βœ…
### 4. Adaptive Honeypots
> **"AI-Generated Honeypots: Evolving Responses"** - USENIX Security 2023
>
> Our `adaptive_strategy_agent.py` implements:
> - Phase-based engagement (hook β†’ engage β†’ extract β†’ stall) βœ…
> - Dynamic persona selection βœ…
> - Trust score evolution βœ…
### 5. Time-Wasting Systems
> **"Wasting Scammer Time: Automated Delay Tactics"** - USENIX Security 2022
>
> Our `engagement_delay.py` implements:
> - Simulated typing delays βœ…
> - Fake bank errors βœ…
> - OTP wait simulation βœ…
---
## πŸ”¬ Implemented Research Concepts
| Concept | Paper | Implementation |
|---------|-------|----------------|
| Multi-Agent Simulation | "Attacker-Defender Games" | `simulate_attack.py` |
| Threat Intelligence | MITRE ATT&CK Framework | `threat_engine.py` |
| Campaign Clustering | "Fraud Ring Detection" | `campaign_tracker.py` |
| Risk Scoring | "ML-based Fraud Detection" | `risk_scorer.py` |
| Containerized Honeypots | "Scalable Deception" | `Dockerfile` |
---
## πŸ“Š Related Datasets
### Used for Validation (Conceptual)
- **Enron Spam Dataset**: Email spam patterns
- **Kaggle SMS Spam Collection**: SMS scam keywords
- **Scam Conversation Corpus**: LLM honeypot dialogues
### Our Contribution
- **10 Indian Scam Types**: Lottery, KYC, Job, Investment, etc.
- **10 Victim Personas**: Age-appropriate, culturally realistic
- **Hinglish Language Support**: Natural Indian context
---
## πŸ›οΈ Industry Standards Implemented
### MITRE ATT&CK Mapping
```
T1566.001 - Spear Phishing Link
T1078 - Valid Accounts (impersonation)
T1204.001 - User Execution (click bait)
T1598 - Phishing for Information
```
### STIX 2.1 Threat Intelligence
- Indicator exports for CERT-In
- Campaign clustering
- Threat actor attribution
### NIST Cybersecurity Framework
- **Identify**: Scam type classification
- **Protect**: Rate limiting, authentication
- **Detect**: Keyword + LLM hybrid detection
- **Respond**: Law enforcement reporting
- **Recover**: Threat intelligence sharing
---
## πŸ”— External Resources
- [CERT-In Guidelines](https://www.cert-in.org.in/)
- [NPCI UPI Fraud Portal](https://www.npci.org.in/)
- [NCRP Cyber Crime Portal](https://cybercrime.gov.in/)
- [TRAI DND Portal](https://trai.gov.in/)
---
## πŸ“– Citation
If using this system for research:
```bibtex
@software{sentinel_honeypot,
title = {Sentinel Scam Honeypot: AI-Powered Fraud Intelligence},
author = {India AI Impact Buildathon Team},
year = {2025-2026},
url = {https://github.com/sentinel-honeypot}
}
```
---
*This system represents a novel integration of multiple research areas into a production-ready honeypot platform.*