Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

pankajpandey-devย 
posted an update 1 day ago
view post
Post
7382
๐Ÿ‡ฎ๐Ÿ‡ณ Qwen3-4B Hindi Instruct v2 โ€” a Hindi LLM that runs on your own machine
Most strong Hindi-capable models are either huge or cloud-only. I wanted one that's small enough to run locally but actually follows instructions in Hindi โ€” so I fine-tuned Qwen3-4B on 10K Hindi instruction pairs and shipped it with a full GGUF quant ladder.
โœ… Fine-tune (16-bit): huggingface.co/pankajpandey-dev/Qwen3-4B-Hindi-Instruct-v2
โœ… GGUF (Q4/Q5/Q8): huggingface.co/pankajpandey-dev/Qwen3-4B-Hindi-Instruct-v2-GGUF
Runs in Ollama, llama.cpp, and LM Studio. The Q4_K_M is just 2.5 GB โ€” fits comfortably on a laptop, CPU or GPU.
Part of my Hindi LLM Series โ€” building openly-licensed Indic models for local and edge use. More coming (Gemma next). Feedback welcome ๐Ÿ™
#Hindi #IndicNLP #GGUF #LocalLLM #Qwen
  • 2 replies
ยท
prithivMLmodsย 
posted an update 3 days ago
view post
Post
5917
PiD โ€” Pixel Diffusion Decoder Image Edit Upscale and Image Generation Upscale, an all-in-one demo, is now live on Spaces! Great improvements in realism-based image generation and editing are powered by FLUX.2-Klein, while image generation is paired with Z-Image, and upscaling is enabled by default!

๐Ÿค— Space: prithivMLmods/PiD-Image-Upscaler
๐Ÿ”— Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

๐Ÿค— > To learn more, visit the app page or the respective model pages.
RiverRiderย 
posted an update 3 days ago
view post
Post
4747
SRT-introspect: Live Token-by-Token Readout of LLM Internal Reasoning

I have released SRT-introspect, a new public demonstration that makes the hidden reasoning process of a frozen large language model visible in real time.

The interface runs a Qwen-2.5-7B backbone equipped with the SRT Adapter and Activation Verbalizer. As the model generates each token, the system continuously measures divergence across attention heads, identifies high-signal moments, and translates the corresponding hidden-state object representations into natural-language verbalizations. You see exactly what the model is internally representing at the precise points where its computation is most active, complete with divergence scores, reflexivity estimates, and per-layer traces.

This is not a summary of the final output. It is a direct window into the modelโ€™s latent conceptual landscape, showing the dominant training-data attractors that activate even when the prompt asks for first-principles reasoning. The adaptive scheduler concentrates verbalizations precisely where the real internal work occurs, turning what used to be opaque black-box generation into observable, analyzable data.

The result is the clearest public demonstration yet that modern LLMs possess a rich, structured semiotic infrastructure that can now be audited without retraining or fine-tuning.

Try it:
RiverRider/srt-introspect
pankajpandey-devย 
posted an update 6 days ago
view post
Post
2614
๐Ÿงฌ Just uploaded K-quants of Carbon-3B for llama.cpp users!
@HuggingFaceBio released the original GGUF in bf16 only โ€” so I added the full quant ladder for CPU/edge inference:
โ€ข Q2_K โ†’ 1.4 GB
โ€ข Q3_K_M โ†’ 1.8 GB
โ€ข Q4_K_M โ†’ 2.1 GB โญ
โ€ข Q5_K_M โ†’ 2.4 GB
โ€ข Q6_K โ†’ 2.7 GB
โ€ข Q8_0 โ†’ 3.5 GB
๐Ÿ”— pankajpandey-dev/Carbon-3B-GGUF
Now you can generate DNA sequences on your laptop. Needs a llama.cpp build with PR #23410 (HybridDNATokenizer support).
Huge thanks to the HuggingFaceBio team for the original model ๐Ÿ™
#GGUF #llamacpp #genomics #DNA

ovi054ย 
posted an update about 7 hours ago
view post
Post
29
Color Grading Transferโšก

ovi054/Color-Grade-Transfer-Qwen-Image

What if you could steal color grade from your favorite films or any still image and apply it to your own content. And no, you don't need to be a professional colorist.

Input 1: Source Image - Content to be preserve
Input 2: Reference Image - Any still from films
Output: Color graded output image

๐Ÿ‘‰ Try it now: ovi054/Color-Grade-Transfer-Qwen-Image
prithivMLmodsย 
posted an update about 9 hours ago
kanaria007ย 
posted an update about 17 hours ago
view post
Post
72
โœ… Article highlight: *Worlds as Training Substrates* (art-60-167, v0.1)

TL;DR:
This article argues that gameplay is not automatically a training dataset.

A persistent world can generate incredibly rich traces of action, conflict, coordination, failure, and recovery. But turning that into a learning corpus is a governance problem, not a data-hoarding problem. If you want to say *โ€œModel M was trained on World Wโ€*, you need pinned corpus manifests plus receipted extraction, consent/redaction, decontamination, and training runs.

Read:
kanaria007/agi-structural-intelligence-protocols

Why it matters:
โ€ข turns โ€œworld dataโ€ into a governed learning substrate instead of a vibes dataset
โ€ข makes provenance, canon, and performance posture part of training honesty
โ€ข prevents extraction pipelines from silently rewriting what the world was
โ€ข treats contamination, leakage, and consent as first-class training-governance issues

Whatโ€™s inside:
โ€ข *training corpus manifests* that pin world identity, canon snapshot, and performance posture
โ€ข *learning trace extraction contracts* for what may be pulled from world history
โ€ข *dataset build receipts* and *training run receipts* for provenance
โ€ข *decontamination receipts* for leak prevention and train/eval hygiene
โ€ข governed rules for changing extraction or normalization surfaces without laundering history

Key idea:
Do not say:

*โ€œwe trained on gameplay data.โ€*

Say:

*โ€œthis model was trained on a governed corpus built from this world, under these extraction, redaction, decontamination, and training receipts.โ€*

That is how learning stops being data scavenging and becomes governance with receipts.
  • 2 replies
ยท
pankajpandey-devย 
posted an update 5 days ago
view post
Post
616
๐Ÿ‡ฎ๐Ÿ‡ณ Just shipped: MiniCPM5-1B-Hindi-Instruct (+ GGUF quants)

First Hindi instruction-tuned fine-tune of OpenBMB's brand-new MiniCPM5-1B (released this week).

Trained with Unsloth + LoRA (r=32) on AI4Bharat's anudesh + dolly Hindi splits โ€” ~4k high-quality examples, 2 epochs on a single T4 in 60 minutes.

๐Ÿ”— Model (16-bit + LoRA adapter):
pankajpandey-dev/MiniCPM5-1B-Hindi-Instruct

๐Ÿ“ฆ GGUF quants for llama.cpp / Ollama / LM Studio:
pankajpandey-dev/MiniCPM5-1B-Hindi-Instruct-v1-GGUF

5 quant levels โ€” from Q3_K_M (~560 MB, runs on a Raspberry Pi) to Q8_0 (~1.2 GB, near-lossless). Q4_K_M is the recommended default.

Part of my ongoing ๐Ÿ‡ฎ๐Ÿ‡ณ Hindi LLM Series โ€” bringing strong open-source LLMs to Indian languages.

#Hindi #IndicNLP #MiniCPM5 #LoRA #Unsloth #GGUF #llamacpp #Ollama #LocalLLM
pankajpandey-devย 
posted an update 8 days ago
view post
Post
249
Just released Qwen3-0.6B fine-tuned on Hindi instruction data ๐Ÿ‡ฎ๐Ÿ‡ณ

โœ… Full model: pankajpandey-dev/Qwen3-0.6B-Hindi-Instruct-v1
โœ… GGUF versions (Q2/Q4/Q5/Q8): pankajpandey-dev/Qwen3-0.6B-Hindi-Instruct-v1-GGUF

Smallest Hindi-capable GGUF โ€” runs on any laptop at 0.37GB.
Next: v2 with more data, better responses.

#Hindi #LLM #GGUF #OpenSource
danielhanchenย 
posted an update 11 days ago