Sync from GitHub via huggingface-sync-action
Browse files
README.md
CHANGED
|
@@ -61,13 +61,16 @@ Judgy Reachy No Phone combines **NVIDIA-accelerated computer vision**, **LLM-gen
|
|
| 61 |
- **Responds** with personality-matched interventions via 8 distinct AI personalities
|
| 62 |
- **Adapts** its reactions based on your offense count and streak performance
|
| 63 |
|
|
|
|
|
|
|
| 64 |
---
|
| 65 |
|
| 66 |
## 🚀 Key Features
|
| 67 |
|
| 68 |
-
- **NVIDIA
|
| 69 |
- **Advanced Object Tracking**: ByteTrack algorithm with adaptive confidence thresholds
|
| 70 |
- **8 AI Personalities**: From Angry Boss to Pure Reachy (robot sounds only)
|
|
|
|
| 71 |
- **Multi-Voice TTS**: ElevenLabs premium or Edge TTS free tier
|
| 72 |
- **Smart Detection**: Robust phone pickup/putdown with anti-flicker
|
| 73 |
- **Behavior Tracking**: Streaks, pickup counts, session stats
|
|
@@ -116,14 +119,19 @@ This app is designed to be **100% accessible** regardless of your hardware or bu
|
|
| 116 |
|
| 117 |
This project leverages the full stack of contest technologies:
|
| 118 |
|
| 119 |
-
### ⚡ NVIDIA
|
| 120 |
|
| 121 |
**TensorRT & CUDA:**
|
| 122 |
- **2-3x performance boost** with automatic TensorRT optimization
|
| 123 |
- **Auto-detection** of NVIDIA GPUs with CUDA support
|
| 124 |
-
- **FP16 precision** for faster inference
|
| 125 |
- **Automatic fallback** to CPU/MPS when GPU unavailable
|
| 126 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 127 |
**→ Detailed technical explanation in [NVIDIA GPU Acceleration](#-nvidia-gpu-acceleration) section below**
|
| 128 |
|
| 129 |
### 🤗 Hugging Face Ecosystem
|
|
@@ -716,6 +724,7 @@ This project was built for the **NVIDIA GTC 2026 Golden Ticket Contest**. Contri
|
|
| 716 |
- **Meta**: Llama 3.1-8B model
|
| 717 |
- **ElevenLabs**: High-quality TTS voices
|
| 718 |
- **Microsoft**: Edge TTS (free tier)
|
|
|
|
| 719 |
|
| 720 |
### Datasets & Models
|
| 721 |
- **Hugging Face**: [pollen-robotics/reachy-mini-emotions-library](https://huggingface.co/datasets/pollen-robotics/reachy-mini-emotions-library)
|
|
|
|
| 61 |
- **Responds** with personality-matched interventions via 8 distinct AI personalities
|
| 62 |
- **Adapts** its reactions based on your offense count and streak performance
|
| 63 |
|
| 64 |
+
> **💎 Built From Scratch:** This is not a fork or modification of existing app. Everything is designed and implemented specifically for this project. 100% original architecture and code.
|
| 65 |
+
|
| 66 |
---
|
| 67 |
|
| 68 |
## 🚀 Key Features
|
| 69 |
|
| 70 |
+
- **NVIDIA Technologies**: TensorRT (2-3x speedup) + ONNX Runtime Web (browser inference)
|
| 71 |
- **Advanced Object Tracking**: ByteTrack algorithm with adaptive confidence thresholds
|
| 72 |
- **8 AI Personalities**: From Angry Boss to Pure Reachy (robot sounds only)
|
| 73 |
+
- **Browser-Based Demo**: Transformers.js + ONNX Runtime Web with WebGPU/WASM
|
| 74 |
- **Multi-Voice TTS**: ElevenLabs premium or Edge TTS free tier
|
| 75 |
- **Smart Detection**: Robust phone pickup/putdown with anti-flicker
|
| 76 |
- **Behavior Tracking**: Streaks, pickup counts, session stats
|
|
|
|
| 119 |
|
| 120 |
This project leverages the full stack of contest technologies:
|
| 121 |
|
| 122 |
+
### ⚡ NVIDIA Technologies
|
| 123 |
|
| 124 |
**TensorRT & CUDA:**
|
| 125 |
- **2-3x performance boost** with automatic TensorRT optimization
|
| 126 |
- **Auto-detection** of NVIDIA GPUs with CUDA support
|
| 127 |
+
- **FP16 precision** for faster inference on desktop/laptop
|
| 128 |
- **Automatic fallback** to CPU/MPS when GPU unavailable
|
| 129 |
|
| 130 |
+
**ONNX Runtime Web:**
|
| 131 |
+
- **[ONNX Runtime Web](https://onnxruntime.ai/docs/tutorials/web/)** with WebGPU/WASM in browser demo
|
| 132 |
+
- **Browser-side inference** using [Transformers.js](https://huggingface.co/docs/transformers.js) (built on ONNX Runtime)
|
| 133 |
+
- **[ONNX YOLO model](https://huggingface.co/onnx-community/yolo26m-ONNX)** for cross-platform deployment
|
| 134 |
+
|
| 135 |
**→ Detailed technical explanation in [NVIDIA GPU Acceleration](#-nvidia-gpu-acceleration) section below**
|
| 136 |
|
| 137 |
### 🤗 Hugging Face Ecosystem
|
|
|
|
| 724 |
- **Meta**: Llama 3.1-8B model
|
| 725 |
- **ElevenLabs**: High-quality TTS voices
|
| 726 |
- **Microsoft**: Edge TTS (free tier)
|
| 727 |
+
- **webml-community**: WebGPU demo implementation inspired by [YOLO26-WebGPU](https://huggingface.co/spaces/webml-community/YOLO26-WebGPU)
|
| 728 |
|
| 729 |
### Datasets & Models
|
| 730 |
- **Hugging Face**: [pollen-robotics/reachy-mini-emotions-library](https://huggingface.co/datasets/pollen-robotics/reachy-mini-emotions-library)
|
demo.js
CHANGED
|
@@ -1,5 +1,6 @@
|
|
| 1 |
// Judgy Reachy No Phone - Browser Demo
|
| 2 |
// Uses Transformers.js for YOLO detection in the browser
|
|
|
|
| 3 |
|
| 4 |
import { AutoModel, AutoProcessor, RawImage } from 'https://cdn.jsdelivr.net/npm/@huggingface/transformers@3.8.1';
|
| 5 |
|
|
|
|
| 1 |
// Judgy Reachy No Phone - Browser Demo
|
| 2 |
// Uses Transformers.js for YOLO detection in the browser
|
| 3 |
+
// WebGPU implementation inspired by: https://huggingface.co/spaces/webml-community/YOLO26-WebGPU
|
| 4 |
|
| 5 |
import { AutoModel, AutoProcessor, RawImage } from 'https://cdn.jsdelivr.net/npm/@huggingface/transformers@3.8.1';
|
| 6 |
|