Instructions to use SoufianeDahimi/child_trauma_assessment_gemma-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use SoufianeDahimi/child_trauma_assessment_gemma-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="SoufianeDahimi/child_trauma_assessment_gemma-GGUF", filename="child_trauma_assessment_gemma.Q8_0.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use SoufianeDahimi/child_trauma_assessment_gemma-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0 # Run inference directly in the terminal: llama-cli -hf SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0 # Run inference directly in the terminal: llama-cli -hf SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0 # Run inference directly in the terminal: ./llama-cli -hf SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0 # Run inference directly in the terminal: ./build/bin/llama-cli -hf SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0
Use Docker
docker model run hf.co/SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0
- LM Studio
- Jan
- Ollama
How to use SoufianeDahimi/child_trauma_assessment_gemma-GGUF with Ollama:
ollama run hf.co/SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0
- Unsloth Studio
How to use SoufianeDahimi/child_trauma_assessment_gemma-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for SoufianeDahimi/child_trauma_assessment_gemma-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for SoufianeDahimi/child_trauma_assessment_gemma-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for SoufianeDahimi/child_trauma_assessment_gemma-GGUF to start chatting
- Docker Model Runner
How to use SoufianeDahimi/child_trauma_assessment_gemma-GGUF with Docker Model Runner:
docker model run hf.co/SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0
- Lemonade
How to use SoufianeDahimi/child_trauma_assessment_gemma-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull SoufianeDahimi/child_trauma_assessment_gemma-GGUF:Q8_0
Run and chat with the model
lemonade run user.child_trauma_assessment_gemma-GGUF-Q8_0
List all available models
lemonade list
Child Trauma Assessment Gemma - Specialized Model for Trauma Assessment
This is a fine-tuned version of Gemma 3N specifically optimized for conducting trauma assessments with children from conflict zones. The model is designed to facilitate empathetic, culturally-sensitive conversations and generate professional psychological reports.
Model Description
- Base Model: Gemma 3N (unsloth/gemma-3n-E2B-it)
- Fine-tuning Method: LoRA (r=16, alpha=16)
- Training Focus: Specialized for trauma assessment conversations and report generation
- Languages: Multilingual support (Arabic dialects, Ukrainian, English)
- Context Length: 2048 tokens
Training Details
Dataset and Training Script: Details about training script and datast curation can be found here: https://github.com/Dahimi/Gemma3n_Finetune_Child_Trauma
Fine-tuning Configuration:
- Batch Size: 1 with gradient accumulation (4 steps)
- Learning Rate: 2e-4 with linear scheduler
- Training Focus: Conversation responses only (loss masked on user inputs)
- LoRA Parameters: r=16, alpha=16
- Context Length: 2048 tokens
Intended Uses
This model is designed for:
- Conducting initial trauma assessments with children from conflict zones
- Supporting mental health professionals and volunteers
- Generating standardized trauma assessment reports
- Facilitating multilingual mental health support
Primary Functions
Conversational Assessment:
- Empathetic dialogue with parents/caregivers
- Culturally-appropriate questioning
- Trauma indicator identification
Report Generation:
- Structured professional assessments
- Severity scoring (1-10 scale)
- Risk indicator analysis
- Cultural context documentation
Language Support
The model supports:
- Palestinian/Levantine Arabic
- Sudanese Arabic
- Ukrainian
- English
Note: All assessment reports are generated in professional English regardless of conversation language
Limitations & Biases
- This model should not replace professional mental health assessment
- Should be used as a support tool under professional supervision
- May have limitations in understanding extremely specific cultural contexts
- Not a substitute for emergency mental health intervention
Training Data
The model was trained on a specialized dataset containing:
- Conversation Examples: Multi-turn dialogues between parents and AI
- Report Generation Examples: Professional assessment report templates
Data format follows Gemma's chat template with alternating roles.
Usage
from transformers import TextStreamer
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
"SoufianeDahimi/child_trauma_assessment_gemma-GGUF",
max_seq_length = 2048,
load_in_4bit = True,
)
def generate_response(prompt):
messages = [{
"role": "user",
"content": [{"type": "text", "text": prompt}]
}]
inputs = tokenizer.apply_chat_template(
messages,
add_generation_prompt = True,
return_tensors = "pt",
tokenize = True,
return_dict = True,
).to("cuda")
return model.generate(
**inputs,
max_new_tokens = 512,
temperature = 0.7,
top_p = 0.95,
top_k = 64,
streamer = TextStreamer(tokenizer, skip_prompt = True),
)
Output Format
The model generates two types of responses:
Conversational Responses:
- Empathetic dialogue
- Follow-up questions
- Guidance and support
Assessment Reports:
- Parent observations summary
- Trauma indicator analysis
- Severity scoring
- Risk assessment
- Cultural context notes
Ethical Considerations
This model is designed for sensitive mental health contexts and should be used with appropriate care:
- Always use under professional supervision
- Maintain strict privacy and data protection
- Consider cultural and contextual sensitivities
- Follow ethical guidelines for mental health assessment
Citations
If you use this model in your research or application, please cite:
- Downloads last month
- 7
8-bit