Safetensors
GGUF
English
GGUF
qwen2
conversational

🇮🇳 Indian Legal Instruction Fine-Tuning

Qwen 2.5 (0.5B) — Domain-Specific LLM

Overview

This project fine-tunes Qwen 2.5 (0.5B) on a custom-built Indian legal instruction dataset to improve structured reasoning, drafting, and legal Q&A.

Core idea: maximize small model capability using high-quality, domain-specific data.


📊 Training Metrics

Training Metrics (Loss + Grad Norm)

Observations:

  • Smooth loss convergence
  • Stable gradient norm
  • Training stabilizes after early-stage fixes

🧪 Before vs After Fine-Tuning

Example 1

Before vs After 1

Example 2

Before vs After 2

Example 3

Before vs After 3

Overall Improvements:

  • Outputs become more structured and readable
  • Stronger legal terminology and accuracy
  • Better long-form coherence
  • Reduced vague / generic responses

🧠 Dataset

  • Domain: Indian Law (Acts, provisions, procedures)
  • Format: Instruction → Response
  • Language: English
  • Size: ~171K samples (v2)

Characteristics:

  • High information density
  • Structured legal reasoning
  • Long-form responses
  • Reduced noise and redundancy

📂 Data Format

{
  "instruction": "Question about a legal concept",
  "response": "Structured legal answer with explanation"
}
Downloads last month
207
Safetensors
Model size
0.5B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kaushik-harsh-99/Qwen-IndianLegal-Instruct-v1

Quantizations
1 model

Dataset used to train kaushik-harsh-99/Qwen-IndianLegal-Instruct-v1

Collection including kaushik-harsh-99/Qwen-IndianLegal-Instruct-v1