PEFT
Safetensors

Model Card for Model ID

This LoRA model is a product of the research paper "LLM QLoRA Fine-Tuning of Llama, DeepSeek, and Qwen: A Skyrim Case Study" (to appear in IEEE Access). The study investigates the interplay between model scale (~8B, ~13B, ~33B), architecture, and data formatting in adapting Large Language Models to knowledge-intensive domains.

By utilizing a multi-stage data cycling strategy and 4-bit Quantized Low-Rank Adaptation (QLoRA), this model was fine-tuned to master the lore of The Elder Scrolls V: Skyrim. The training process involved rigorous hyperparameter optimization, comparing unstructured, structured, and summarized datasets across varying LoRA ranks (16, 32, 64) to determine the optimal configuration for factual recall and narrative fluency. The resulting model demonstrates the capability to generate high-fidelity, hallucination-resistant character biographies and attribute lists, as validated by a comprehensive ensemble LLM-as-a-Judge evaluation framework.

Model Details

Model Description

  • Skyrim Lore - Llama-2-13B
  • Description: This model proves that legacy architectures can still compete when trained correctly. By using the Summarized dataset and a high LoRA Rank of 64, it overcame its older foundation to achieve a competitive Factual Score (3.2), significantly outperforming its base baseline.
  • Best Configuration: Summarized Dataset | Rank 64
  • Base Model: unsloth/Llama-2-13b-chat-hf
  • Note: You can also use the pre-quantized version unsloth/Llama-2-13b-chat-hf-bnb-4bit.

Framework versions

  • PEFT 0.15.2
Downloads last month
1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MarcosEdu/Skyrim_Llama-2-13B

Adapter
(146)
this model

Collection including MarcosEdu/Skyrim_Llama-2-13B