yozkut commited on
Commit
b5cddd7
·
verified ·
1 Parent(s): d123186

Sync from GitHub via huggingface-sync-action

Browse files
Files changed (2) hide show
  1. README.md +12 -3
  2. demo.js +1 -0
README.md CHANGED
@@ -61,13 +61,16 @@ Judgy Reachy No Phone combines **NVIDIA-accelerated computer vision**, **LLM-gen
61
  - **Responds** with personality-matched interventions via 8 distinct AI personalities
62
  - **Adapts** its reactions based on your offense count and streak performance
63
 
 
 
64
  ---
65
 
66
  ## 🚀 Key Features
67
 
68
- - **NVIDIA GPU Acceleration**: TensorRT optimization for 2-3x faster inference
69
  - **Advanced Object Tracking**: ByteTrack algorithm with adaptive confidence thresholds
70
  - **8 AI Personalities**: From Angry Boss to Pure Reachy (robot sounds only)
 
71
  - **Multi-Voice TTS**: ElevenLabs premium or Edge TTS free tier
72
  - **Smart Detection**: Robust phone pickup/putdown with anti-flicker
73
  - **Behavior Tracking**: Streaks, pickup counts, session stats
@@ -116,14 +119,19 @@ This app is designed to be **100% accessible** regardless of your hardware or bu
116
 
117
  This project leverages the full stack of contest technologies:
118
 
119
- ### ⚡ NVIDIA GPU Acceleration
120
 
121
  **TensorRT & CUDA:**
122
  - **2-3x performance boost** with automatic TensorRT optimization
123
  - **Auto-detection** of NVIDIA GPUs with CUDA support
124
- - **FP16 precision** for faster inference
125
  - **Automatic fallback** to CPU/MPS when GPU unavailable
126
 
 
 
 
 
 
127
  **→ Detailed technical explanation in [NVIDIA GPU Acceleration](#-nvidia-gpu-acceleration) section below**
128
 
129
  ### 🤗 Hugging Face Ecosystem
@@ -716,6 +724,7 @@ This project was built for the **NVIDIA GTC 2026 Golden Ticket Contest**. Contri
716
  - **Meta**: Llama 3.1-8B model
717
  - **ElevenLabs**: High-quality TTS voices
718
  - **Microsoft**: Edge TTS (free tier)
 
719
 
720
  ### Datasets & Models
721
  - **Hugging Face**: [pollen-robotics/reachy-mini-emotions-library](https://huggingface.co/datasets/pollen-robotics/reachy-mini-emotions-library)
 
61
  - **Responds** with personality-matched interventions via 8 distinct AI personalities
62
  - **Adapts** its reactions based on your offense count and streak performance
63
 
64
+ > **💎 Built From Scratch:** This is not a fork or modification of existing app. Everything is designed and implemented specifically for this project. 100% original architecture and code.
65
+
66
  ---
67
 
68
  ## 🚀 Key Features
69
 
70
+ - **NVIDIA Technologies**: TensorRT (2-3x speedup) + ONNX Runtime Web (browser inference)
71
  - **Advanced Object Tracking**: ByteTrack algorithm with adaptive confidence thresholds
72
  - **8 AI Personalities**: From Angry Boss to Pure Reachy (robot sounds only)
73
+ - **Browser-Based Demo**: Transformers.js + ONNX Runtime Web with WebGPU/WASM
74
  - **Multi-Voice TTS**: ElevenLabs premium or Edge TTS free tier
75
  - **Smart Detection**: Robust phone pickup/putdown with anti-flicker
76
  - **Behavior Tracking**: Streaks, pickup counts, session stats
 
119
 
120
  This project leverages the full stack of contest technologies:
121
 
122
+ ### ⚡ NVIDIA Technologies
123
 
124
  **TensorRT & CUDA:**
125
  - **2-3x performance boost** with automatic TensorRT optimization
126
  - **Auto-detection** of NVIDIA GPUs with CUDA support
127
+ - **FP16 precision** for faster inference on desktop/laptop
128
  - **Automatic fallback** to CPU/MPS when GPU unavailable
129
 
130
+ **ONNX Runtime Web:**
131
+ - **[ONNX Runtime Web](https://onnxruntime.ai/docs/tutorials/web/)** with WebGPU/WASM in browser demo
132
+ - **Browser-side inference** using [Transformers.js](https://huggingface.co/docs/transformers.js) (built on ONNX Runtime)
133
+ - **[ONNX YOLO model](https://huggingface.co/onnx-community/yolo26m-ONNX)** for cross-platform deployment
134
+
135
  **→ Detailed technical explanation in [NVIDIA GPU Acceleration](#-nvidia-gpu-acceleration) section below**
136
 
137
  ### 🤗 Hugging Face Ecosystem
 
724
  - **Meta**: Llama 3.1-8B model
725
  - **ElevenLabs**: High-quality TTS voices
726
  - **Microsoft**: Edge TTS (free tier)
727
+ - **webml-community**: WebGPU demo implementation inspired by [YOLO26-WebGPU](https://huggingface.co/spaces/webml-community/YOLO26-WebGPU)
728
 
729
  ### Datasets & Models
730
  - **Hugging Face**: [pollen-robotics/reachy-mini-emotions-library](https://huggingface.co/datasets/pollen-robotics/reachy-mini-emotions-library)
demo.js CHANGED
@@ -1,5 +1,6 @@
1
  // Judgy Reachy No Phone - Browser Demo
2
  // Uses Transformers.js for YOLO detection in the browser
 
3
 
4
  import { AutoModel, AutoProcessor, RawImage } from 'https://cdn.jsdelivr.net/npm/@huggingface/transformers@3.8.1';
5
 
 
1
  // Judgy Reachy No Phone - Browser Demo
2
  // Uses Transformers.js for YOLO detection in the browser
3
+ // WebGPU implementation inspired by: https://huggingface.co/spaces/webml-community/YOLO26-WebGPU
4
 
5
  import { AutoModel, AutoProcessor, RawImage } from 'https://cdn.jsdelivr.net/npm/@huggingface/transformers@3.8.1';
6