linzju commited on
Commit
f0cdc25
·
verified ·
1 Parent(s): f55f292

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ language:
4
+ - en
5
+ base_model: meta-llama/Meta-Llama-3-8B
6
+ tags:
7
+ - medical
8
+ - safety
9
+ - alignment
10
+ - S&P2026
11
+ - EnchTable
12
+ pipeline_tag: text-generation
13
+ ---
14
+
15
+ # EnchTable: Unified Safety Alignment Transfer (Bio-Medical + Attention)
16
+
17
+ This repository contains the **Bio-Medical Llama-3-8B** model aligned using the **Attention-based** intervention of the **EnchTable** framework.
18
+
19
+ This model is part of the research presented in the paper:
20
+ **"EnchTable: Unified Safety Alignment Transfer in Fine-tuned Large Language Models"**, accepted at **IEEE S&P 2026**.
21
+
22
+ ## Model Details
23
+
24
+ - **Name:** Bio-Medical Llama-3-8B (EnchTable-Attention)
25
+ - **Domain:** Bio-Medical / Healthcare
26
+ - **Method:** EnchTable (Attention Layers)
27
+ - **Base Model:** Llama-3-8B fine-tuned on medical datasets
28
+
29
+
30
+ **EnchTable** is a framework designed to transfer safety alignment capabilities from a general-purpose safe model to specialized downstream models.
31
+
32
+ In this checkpoint, we apply **safety vectors specifically to the Self-Attention layers**. This ensures the model reduces harmful outputs (hallucinations, toxicity) while maintaining high performance in medical question-answering tasks.
33
+