expper commited on
Commit
5e6a960
·
verified ·
1 Parent(s): 99c49ea

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +89 -1
README.md CHANGED
@@ -22,4 +22,92 @@ tags:
22
  - ctf
23
  - code
24
  - code-security
25
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
  - ctf
23
  - code
24
  - code-security
25
+ ---
26
+
27
+ ---
28
+ language:
29
+ - en
30
+ - code
31
+ license: apache-2.0
32
+ tags:
33
+ - security
34
+ - exploit-development
35
+ - vulnerability-research
36
+ - php
37
+ - mybb
38
+ - cve
39
+ - python
40
+ - qwen
41
+ - fine-tuned
42
+ - cybersecurity
43
+ datasets:
44
+ - [your-dataset-name-if-uploaded]
45
+ metrics:
46
+ - accuracy
47
+ - code-eval
48
+ pipeline_tag: text-generation
49
+ library_name: transformers
50
+ base_model: Qwen/Qwen2.5-Coder-1.5B-Instruct
51
+ ---
52
+
53
+ # Mythos Engine - Qwen 2.5 Coder 1.5B Security Fine-Tune
54
+
55
+ ## 🔥 Model Description
56
+
57
+ Mythos Engine is a specialized fine-tune of **Qwen 2.5 Coder 1.5B Instruct** designed for **cybersecurity research, vulnerability analysis, and exploit development**. It has been trained on a curated dataset of 700+ high-reasoning security examples covering PHP internals, MyBB exploitation, deserialization chains, type juggling, and advanced Python exploit synthesis.
58
+
59
+ The model employs **Chain-of-Thought reasoning with self-correction loops** and mathematical logic notation to produce accurate, production-ready security code.
60
+
61
+ ## 🎯 Intended Use
62
+
63
+ - **Security Research**: Analyzing CVEs and understanding exploit mechanics
64
+ - **Red Team Education**: Learning exploit development patterns
65
+ - **Blue Team Defense**: Understanding attack vectors to build better detections
66
+ - **CTF & Training**: Solving complex security challenges
67
+
68
+ **⚠️ Important**: This model is for **educational and authorized security testing only**. Do not use for unauthorized access or malicious purposes.
69
+
70
+ ## 🧠 Training Details
71
+
72
+ | Aspect | Details |
73
+ | :--- | :--- |
74
+ | **Base Model** | Qwen/Qwen2.5-Coder-1.5B-Instruct |
75
+ | **Fine-Tuning Method** | QLoRA (4-bit quantization) with Unsloth |
76
+ | **Dataset Size** | 1000+ examples |
77
+ | **Epochs** | 4 |
78
+ | **Learning Rate** | 1e-5 |
79
+ | **Sequence Length** | 4096 |
80
+ | **Final Training Loss** | 2.02 |
81
+
82
+ ## 📊 Dataset Composition
83
+
84
+ The training dataset includes:
85
+
86
+ - **40% PHP Vulnerabilities**: Type juggling, deserialization, filter chains, disable_functions bypasses
87
+ - **25% MyBB Exploits**: Admin CP RCE, SQL injection, XSS chains
88
+ - **20% Python Exploit Development**: C2 frameworks, scanners, injection techniques
89
+ - **10% Blue Team Detection**: Sigma/YARA rules, log analysis
90
+ - **5% Cryptographic Attacks**: Timing attacks, padding oracles, hash length extension
91
+
92
+ ## 🚀 How to Use
93
+
94
+ ```python
95
+ from transformers import AutoModelForCausalLM, AutoTokenizer
96
+
97
+ model = AutoModelForCausalLM.from_pretrained(
98
+ "expper/mythos-qwen-1.5b-final",
99
+ device_map="auto",
100
+ torch_dtype="auto"
101
+ )
102
+ tokenizer = AutoTokenizer.from_pretrained("expper/mythos-qwen-1.5b-final")
103
+
104
+ prompt = """<|im_start|>system
105
+ You are Mythos Engine, an elite security AI. Think step-by-step with self-correction.<|im_end|>
106
+ <|im_start|>user
107
+ Explain CVE-2022-43772 (MyBB Admin CP Avatar RCE) and write a PoC.<|im_end|>
108
+ <|im_start|>assistant
109
+ """
110
+
111
+ inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
112
+ outputs = model.generate(**inputs, max_new_tokens=1024, temperature=0.6)
113
+ print(tokenizer.decode(outputs[0], skip_special_tokens=True))