judgy_reachy_no_phone

Running

File size: 30,631 Bytes

---
title: Judgy Reachy No Phone
emoji: 📱
colorFrom: red
colorTo: purple
sdk: static
pinned: false
hf_oauth: true
license: apache-2.0
short_description: Robot shames you for phone addiction with AI vision
tags:
 - reachy_mini
 - reachy_mini_python_app
 - productivity
models:
 - onnx-community/yolo26m-ONNX
 - Ultralytics/YOLO26
 - meta-llama/Llama-3.1-8B
datasets:
 - pollen-robotics/reachy-mini-emotions-library
---

# 📱 Judgy Reachy No Phone 🤖

**A Reachy Mini app that uses NVIDIA-accelerated computer vision to detect phone usage and deliver personalized robot interventions through 8 distinct AI personalities.**

Built for the **NVIDIA GTC 2026 Golden Ticket Contest** in partnership with Pollen Robotics & Hugging Face.

[![Demo Video](https://img.shields.io/badge/Watch-Demo-red?style=for-the-badge&logo=youtube)](https://www.linkedin.com/feed/update/urn:li:activity:7420180578961907712/)
[![Try it Live](https://img.shields.io/badge/Try-Live%20Demo-blue?style=for-the-badge&logo=huggingface)](https://huggingface.co/spaces/yozkut/judgy_reachy_no_phone)
[![Tech Stack](https://img.shields.io/badge/NVIDIA%20%2B%20Partners-Tech%20Stack-green?style=for-the-badge&logo=nvidia)](#-nvidia-and-partner-technologies-integration)

<div align="center">

<img src="quick_demo.gif" alt="Judgy Reachy No Phone Demo" width="700">

*Real-time phone detection with YOLO26m + TensorRT, 8 AI personalities, and expressive robot reactions*

</div>

---

## ⚡ Quick Start

**Want to try it right now?**
- 🌐 **[Try Web Demo](https://huggingface.co/spaces/yozkut/judgy_reachy_no_phone#demo)** - No installation, runs in browser (Transformers.js + ONNX)
- 🚀 **[Install Locally](#️-installation)** - Full experience with all 8 personalities (multiple install options)

📖 **[Usage Instructions](#-usage)** • ⚙️ **[Configuration](#️-configuration)**

---

## 🎯 The Problem

Phone addiction is a growing productivity killer. Traditional app blockers fail because they're easy to bypass or disable. What if a physical robot could intervene with personalized, funny, and emotionally engaging feedback?

## 💡 The Solution

Judgy Reachy No Phone combines **NVIDIA-accelerated computer vision**, **LLM-generated responses**, and **expressive robotics** to create a physical productivity guardian that:
- **Detects** phone pickups in real-time using YOLO26m with TensorRT optimization
- **Tracks** your behavior patterns with ByteTrack persistent object tracking
- **Responds** with personality-matched interventions via 8 distinct AI personalities
- **Adapts** its reactions based on your offense count and streak performance

> **💎 Built From Scratch:** This is not a fork or modification of existing app. Everything is designed and implemented specifically for this project. 100% original architecture and code.

---

## 🚀 Key Features

- **NVIDIA Technologies**: TensorRT (2-3x speedup) + ONNX Runtime Web (browser inference)
- **Advanced Object Tracking**: ByteTrack algorithm with adaptive confidence thresholds
- **8 AI Personalities**: From Angry Boss to Pure Reachy (robot sounds only)
- **Browser-Based Demo**: Transformers.js + ONNX Runtime Web with WebGPU/WASM
- **Multi-Voice TTS**: ElevenLabs premium or Edge TTS free tier
- **Smart Detection**: Robust phone pickup/putdown with anti-flicker
- **Behavior Tracking**: Streaks, pickup counts, session stats
- **Expressive Animations**: Personality-matched robot reactions
- **100% Free Tier**: Works without any API keys or NVIDIA GPU

---

## 🌐 Accessibility - Multiple Ways to Try It

This app is designed to be **100% accessible** regardless of your hardware or budget:

### 💰 **100% Free Tier** (No API Keys Required)
- **Responses**: Pre-written personality lines (no LLM needed)
- **Voice**: Edge TTS (unlimited, free forever)
- **Cost**: $0 - Works completely offline for responses

### ⚡ **Optional Premium Tier** (Free APIs Available)
- **LLM Responses**: Groq API - Llama 3.1-8B (free tier available)
- **Premium Voice**: ElevenLabs API - 10k chars/month free
- **Dynamic**: AI-generated responses that adapt to context
- **Cost**: $0 with free API tiers

### 🖥️ **Hardware Flexibility** (GPU Optional)
- **NVIDIA GPU**: TensorRT acceleration (2-3x faster)
- **Apple Silicon**: MPS GPU support
- **CPU Only**: Full functionality, slightly slower inference
- **Auto-detection**: Automatically uses best available hardware

### 🤖 **Robot Options** (Physical Robot Optional)
- **[Try it NOW - Web Demo](https://huggingface.co/spaces/yozkut/judgy_reachy_no_phone#demo)**: No robot needed! Runs in your browser using **[Transformers.js](https://huggingface.co/docs/transformers.js/en/index)** from Hugging Face + **[ONNX YOLO](https://huggingface.co/onnx-community/yolo26m-ONNX)** (Pure Reachy mode only)
- **Simulation Mode**: Full app with laptop webcam (all 8 personalities, no physical robot)
- **Reachy Mini Lite**: Complete experience with wired robot connection
- **Reachy Mini Wireless**: Full wireless robot experience

### 🎨 **Engaging UX**
- **8 personalities** make intervention fun, not annoying
- **Customizable**: Add your own personalities, voices, animations
- **Extensible**: Easy to modify and adapt to your needs

**→ Anyone can try this right now, for free, without any hardware, API keys, or setup!**

---

## 🤝 NVIDIA and Partner Technologies Integration

This project leverages the full stack of contest technologies:

### ⚡ NVIDIA Technologies

**TensorRT & CUDA:**
- **2-3x performance boost** with automatic TensorRT optimization
- **Auto-detection** of NVIDIA GPUs with CUDA support
- **FP16 precision** for faster inference on desktop/laptop
- **Automatic fallback** to CPU/MPS when GPU unavailable

**ONNX Runtime Web:**
- **[ONNX Runtime Web](https://onnxruntime.ai/docs/tutorials/web/)** with WebGPU/WASM in browser demo
- **Browser-side inference** using [Transformers.js](https://huggingface.co/docs/transformers.js) (built on ONNX Runtime)
- **[ONNX YOLO model](https://huggingface.co/onnx-community/yolo26m-ONNX)** for cross-platform deployment

**→ Detailed technical explanation in [NVIDIA GPU Acceleration](#-nvidia-gpu-acceleration) section below**

### 🤗 Hugging Face Ecosystem

**Model Hub & Inference:**
- **[ONNX YOLO](https://huggingface.co/onnx-community/yolo26m-ONNX)** - Used in web demo via Transformers.js
- **[Transformers.js](https://huggingface.co/docs/transformers.js)** - Browser-based ML inference (no server needed!)

**Dataset:**
- **[reachy-mini-emotions-library](https://huggingface.co/datasets/pollen-robotics/reachy-mini-emotions-library)** - Pre-recorded robot emotions for Pure Reachy mode

**Deployment:**
- **[HF Spaces](https://huggingface.co/spaces/yozkut/judgy_reachy_no_phone)** - Web demo hosting with instant deployment
- **GitHub Actions → HF Sync** - Automatic synchronization using [custom fork](https://github.com/yaseminozkut/huggingface-sync-action)

### 🤖 Reachy Mini (Pollen Robotics)

**SDK Integration:**
- Full integration with [Reachy Mini SDK](https://github.com/pollen-robotics/reachy_mini)
- Supports **Simulation**, **Lite**, and **Wireless** modes
- Multi-platform installation (macOS, Windows, Linux)

**Robot Capabilities:**
- **Expressive animations** - Head movements, antenna gestures
- **Emotion library** - Access to 20+ pre-recorded emotional reactions
- **Multiple deployment options** - SDK app store, Desktop app, or pip install

**App Store Integration:**
- One-click install via Reachy Mini dashboard (localhost:8000)
- Available in [Reachy Mini Desktop App](https://github.com/pollen-robotics/reachy-mini-desktop-app)
- Community apps distribution

---

## 🎮 NVIDIA GPU Acceleration

### **TensorRT Optimization** (2-3x Speed Boost!)
- **Auto-detection** of NVIDIA GPUs with CUDA support
- **One-time export** to TensorRT engine for maximum performance
- **Automatic fallback** to PyTorch/CPU if NVIDIA GPU unavailable
- **FP16 precision** for faster inference without accuracy loss

```python
# Automatic TensorRT optimization on NVIDIA GPUs
if torch.cuda.is_available():
    device = 'cuda'
    # Export YOLO to TensorRT (one-time, ~1-2 min)
    model.export(format='engine', device=0, half=True, workspace=4)
    # Inference is now 2-3x faster! 🚀
```

### **Performance Benchmarks**

Measured on **NVIDIA Tesla T4** (Google Colab) for YOLO26m:

| Backend | Hardware | FPS | Latency | TensorRT Speedup | vs CPU |
|---------|----------|-----|---------|------------------|--------|
| **TensorRT** | NVIDIA T4 GPU | **132.7** | 7.5ms | **2.69x** | 121.4x |
| PyTorch | NVIDIA T4 GPU | 49.4 | 20.3ms | 1.0x | 45.1x |
| PyTorch | CPU | 1.1 | 914.3ms | - | 1.0x |

**Key Insights:**
- 🚀 **TensorRT optimization provides 2.69x speedup** over PyTorch on the same NVIDIA GPU
- ⚡ **NVIDIA GPU acceleration** provides 45x speedup over CPU (PyTorch)
- 🎯 **Combined effect**: 121x faster than CPU inference

*Real-time phone detection at 132+ FPS enables responsive, sub-8ms reaction times.*

---

## 👁️ Computer Vision & Object Tracking

### **YOLO26m Object Detection**
- Latest YOLO model from Ultralytics (2026 release)
- Trained on COCO dataset (class 67: "cell phone")
- Optimized for edge deployment (runs faster on NVIDIA hardware with TensorRT)
- Links: [Ultralytics/YOLO26](https://huggingface.co/Ultralytics/YOLO26), [ONNX version](https://huggingface.co/onnx-community/yolo26m-ONNX)

### **ByteTrack Object Tracking**
- Industry-standard multi-object tracking with persistent IDs
- **Adaptive Confidence Thresholds**: 0.5 for initial detection, 0.2 when tracking existing objects
- **Robust to Occlusion**: Maintains track IDs even when phone temporarily hidden
- **Real-time Performance**: ~100 FPS camera capture, ~33 FPS detection rate

---

## 🤖 AI-Powered Personality System

**8 Distinct Robot Personalities** powered by Meta's **Llama 3.1-8B-instant** (via [Groq](https://console.groq.com) - free API), each with carefully selected Edge TTS and ElevenLabs voices:

| Personality | Example Shame | Example Praise |
|------------|---------------|----------------|
| 🤖 **Pure Reachy** | *disgusted1.wav* (robot sound) | *success1.wav* (robot sound) |
| 😠 **Angry Boss** | "We have deadlines!" | "About time." |
| 🎭 **Sarcastic** | "Work can wait, obviously." | "Shocking development." |
| 😔 **Disappointed Parent** | "Expected more from you." | "So proud of you." |
| 💪 **Motivational Coach** | "Champions don't quit!" | "YES! That's it!" |
| 🤡 **Absurdist** | "Screen goblins summon you?" | "The desk thanks you." |
| 🤖 **Corporate AI** | "Productivity declining." | "Status: compliant." |
| 🎩 **British Butler** | "If I may suggest..." | "Very good, sir." |
| 🐣 **Chaos Baby** | *Random personality each time* | *Unpredictable!* |

**Pure Reachy Mode**: Uses [pollen-robotics/reachy-mini-emotions-library](https://huggingface.co/datasets/pollen-robotics/reachy-mini-emotions-library) dataset for emotion-based interactions without text-to-speech.

### 🎨 Expressive Robot Animations

**TTS Personalities** (Angry Boss, Sarcastic, etc.):
- **Curious Look** (1st offense): Gentle head tilt with antenna twitch
- **Disappointed Shake** (2-3 offenses): Triple head shake with drooping antennas
- **Dramatic Sigh** (4+ offenses): Exasperated look-up, slump, and turn away
- **Approving Nod** (phone down): Enthusiastic double-nod celebration
- **Idle Breathing** (monitoring): Gentle antenna movements while watching

**Pure Reachy Mode**:
- Uses pre-recorded emotion animations from [pollen-robotics/reachy-mini-emotions-library](https://huggingface.co/datasets/pollen-robotics/reachy-mini-emotions-library)
- **Shame emotions**: disgusted1, resigned1, displeased1/2, rage1, no1, reprimand1/3, dying1, surprised1/2
- **Praise emotions**: welcoming2, inquiring1/2, proud1/3, success1/2, enthusiastic1/2, grateful1, yes1, cheerful1
- Each emotion includes synchronized sound + animation

### 📊 Smart Behavior Tracking

- **Phone Pickup Counter**: Total pickups in current session
- **Shame Counter**: How many times robot intervened
- **Current Streak**: Time since last phone pickup
- **Best Streak**: Longest phone-free period achieved
- **Continue/Pause**: Preserve stats when stopping monitoring

### 🔊 Multi-Voice TTS System

Each personality has carefully selected voices that match their speaking style and tone:

**Free Tier (Unlimited) - Edge TTS**:
- 🤖 **Pure Reachy**: Robot sounds only (no TTS)
- 😠 **Angry Boss**: `en-US-EricNeural` (deep, stern male)
- 🎭 **Sarcastic**: `en-US-AvaMultilingualNeural` (dry wit)
- 😔 **Disappointed Parent**: `en-US-AvaNeural` (soft, empathetic)
- 💪 **Motivational Coach**: `en-US-GuyNeural` (energetic male)
- 🤡 **Absurdist**: `en-US-AriaNeural` (playful, expressive)
- 🤖 **Corporate AI**: `en-US-MichelleNeural` (neutral, professional)
- 🎩 **British Butler**: `en-GB-RyanNeural` (polite British male)
- 🐣 **Chaos Baby**: `en-US-AnaNeural` (versatile)

**Premium Tier (Optional) - ElevenLabs**:
- 🤖 **Pure Reachy**: Robot sounds only (no ElevenLabs)
- 😠 **Angry Boss**: Jerry B. (Gruff Commander) → Eric (Smooth, Trustworthy)
- 🎭 **Sarcastic**: Laura (Enthusiast, Quirky Attitude)
- 😔 **Disappointed Parent**: Alice (Clear, Engaging)
- 💪 **Motivational Coach**: Charlie (Deep, Confident, Energetic)
- 🤡 **Absurdist**: Jessica (Playful, Bright, Warm)
- 🤖 **Corporate AI**: Eva (Futuristic Robot Helper) → Sarah (Mature, Reassuring)
- 🎩 **British Butler**: George (Warm, Captivating Storyteller)
- 🐣 **Chaos Baby**: Custom Voice → Candy (Young and Sweet) → Jessica (Playful)

*Note: Multiple voices per personality ensure fallback if one is unavailable. System tries voices in order.*
- Voice validation with automatic fallback to Edge TTS
- 10k characters/month free tier → **[Get free API key](https://elevenlabs.io)**

### 🎯 Detection Features

- **Smart Pickup Detection**: 3 consecutive frames to confirm (avoids false positives)
- **Smart Putdown Detection**: 15 frames to confirm (avoids flicker)
- **Adaptive Cooldown**: Configurable time between interventions (10-120s)
- **Periodic Reminders**: Continuous shaming while phone in hand
- **Praise Mode**: Optional celebration when phone is put down

## 🏗️ Architecture

```
┌─────────────────────────────────────────────────────────────┐
│  NVIDIA GPU (CUDA + TensorRT)                               │
│  ├─ YOLO26m Detection (30-60 FPS)                           │
│  ├─ ByteTrack Tracking (Persistent IDs)                     │
│  └─ Adaptive Confidence Thresholds                          │
└─────────────────────────────────────────────────────────────┘
                              ↓
┌─────────────────────────────────────────────────────────────┐
│  Behavior Analysis Engine                                   │
│  ├─ Pickup/Putdown State Machine                            │
│  ├─ Streak Tracking                                         │
│  └─ Cooldown Management                                     │
└─────────────────────────────────────────────────────────────┘
                              ↓
┌─────────────────────────────────────────────────────────────┐
│  LLM Response Generation (Groq / Prewritten)                │
│  ├─ Llama 3.1-8B-instant (Groq API)                         │
│  ├─ Personality-matched prompts                             │
│  └─ Context-aware shame/praise                              │
└─────────────────────────────────────────────────────────────┘
                              ↓
┌─────────────────────────────────────────────────────────────┐
│  Text-to-Speech (ElevenLabs / Edge TTS)                     │
│  ├─ Voice validation & fallback                             │
│  ├─ Personality-matched voices                              │
│  └─ Emotion library (Pure Reachy mode)                      │
└─────────────────────────────────────────────────────────────┘
                              ↓
┌─────────────────────────────────────────────────────────────┐
│  Reachy Mini Robot                                          │
│  ├─ Expressive Animations (head, antennas, body)            │
│  ├─ Synchronized Audio Playback                             │
│  └─ Real-time Camera Feed                                   │
└─────────────────────────────────────────────────────────────┘
```

---

## 💻 Technical Details

### Performance & Design Parameters

| Component | Configuration | Notes |
|-----------|--------------|-------|
| **Camera Capture** | Laptop/Robot Camera | Max ~100 FPS (0.01s sleep) |
| **Detection Rate** | Every 3rd frame | Max ~33 FPS detection |
| **TensorRT Speedup** | NVIDIA GPU optimization | **2-3x faster vs PyTorch** |
| **Pickup Detection** | 3 consecutive frames | Fast response (~90ms at 33 FPS) |
| **Putdown Detection** | 15 consecutive frames | Anti-flicker delay (~450ms) |
| **LLM Response** | Groq (Llama 3.1-8B) | Varies by API load |
| **TTS Generation** | Edge TTS / ElevenLabs | Varies by text length |

*Note: Actual FPS depends on hardware (camera quality, CPU/GPU), lighting conditions, and system load.*

### NVIDIA GPU Support

**Automatic Device Detection**:
```python
if torch.cuda.is_available():
    device = 'cuda'  # NVIDIA GPU → TensorRT
elif torch.backends.mps.is_available():
    device = 'mps'   # Apple Silicon GPU
else:
    device = 'cpu'   # Fallback to CPU
```

**TensorRT Export** (one-time setup):
```python
# Export PyTorch model to TensorRT engine
model.export(
    format='engine',
    device=0,           # GPU 0
    half=True,          # FP16 precision
    workspace=4         # 4GB workspace
)
# Result: yolo26m.engine (2-3x faster inference!)
```

### ByteTrack Object Tracking

```python
# YOLO's built-in ByteTrack integration
results = model.track(
    frame,
    persist=True,                  # Maintain track IDs across frames
    conf=adaptive_confidence,      # 0.5 initial, 0.2 tracking
    tracker="bytetrack.yaml",      # ByteTrack algorithm
    classes=[67]                   # Phone class only
)
```

---

## 🛠️ Installation

### Choose Your Installation Method

There are **multiple ways** to install and run this app:

#### **Option 1: Clone from GitHub** (Recommended for Development)

```bash
# Clone repository
git clone https://github.com/yaseminozkut/judgy_reachy_no_phone
cd judgy_reachy_no_phone

# Install base (free tier)
pip install .

# OR install everything (LLM + Premium TTS)
pip install .[llm,premium-tts]
```

#### **Option 2: Clone from Hugging Face**

```bash
# Clone from Hugging Face Spaces
git clone https://huggingface.co/spaces/yozkut/judgy_reachy_no_phone
cd judgy_reachy_no_phone

# Install (same as GitHub)
pip install .

# OR install everything (LLM + Premium TTS)
pip install .[llm,premium-tts]
```

> **Note:** GitHub and Hugging Face repositories are automatically synced via GitHub Actions using a [custom fork](https://github.com/yaseminozkut/huggingface-sync-action) of [huggingface-sync-action](https://github.com/alozowski/huggingface-sync-action). Both sources are always up to date!

#### **Option 3: Install via Reachy Mini SDK App Store** (Easiest!)

1. Start Reachy Mini daemon ([see guide](https://github.com/pollen-robotics/reachy_mini?tab=readme-ov-file#user-guides))
2. Go to **http://localhost:8000** (Reachy Mini dashboard)
3. Check **"Community Apps"** box
4. Find **"Judgy Reachy No Phone"**
5. Click **Install**
6. Toggle **ON** to start
7. Access at **http://localhost:8042**

#### **Option 4: Install via Reachy Mini Desktop App**

1. Download [Reachy Mini Desktop App](https://github.com/pollen-robotics/reachy-mini-desktop-app)
2. Open the app and go to **App Store**
3. Find **"Judgy Reachy No Phone"**
4. Click **Install**
5. Start the app
6. Access at **http://localhost:8042**

### Prerequisites (for Options 1 & 2)

1. **Reachy Mini SDK**: [Installation Guide](https://github.com/pollen-robotics/reachy_mini/blob/develop/docs/SDK/installation.md)
2. **Python 3.10+**
3. **(Optional) NVIDIA GPU with CUDA** for TensorRT acceleration

### Optional: Get Free API Keys

- **Groq** (LLM): [console.groq.com](https://console.groq.com) - Free Llama 3.1-8B access
- **ElevenLabs** (Premium TTS): [elevenlabs.io](https://elevenlabs.io) - 10k chars/month free

---

## 🎮 Usage

### 1. Start Reachy Mini Daemon

See [Reachy Mini Quickstart](https://github.com/pollen-robotics/reachy_mini/blob/develop/docs/SDK/quickstart.md) for:
- **Simulation** vs. **Lite** vs. **Wireless** mode
- **macOS** vs. **Windows/Linux** setup

### 2. Launch the App

```bash
# App auto-detects simulation mode and uses appropriate camera:
# - Simulation: Laptop webcam
# - Real robot: Robot's camera
```

### 3. Access Web UI

Open **http://localhost:8042** in your browser

### 4. Configure & Start

1. **(Optional)** Enter API keys for LLM/Premium TTS
2. **Select personality** (Pure Reachy, Angry Boss, Sarcastic, etc.)
3. **Adjust cooldown** (10-120 seconds between shames)
4. **Enable/disable praise** for putting phone down
5. **Click "Start Monitoring"**

### 5. Get Judged!

Pick up your phone and watch Reachy react! 📱🤖

---

## 🎛️ Configuration

### Web UI Settings

| Setting | Options | Default |
|---------|---------|---------|
| **Personality** | 8 personalities + Pure Reachy | Pure Reachy |
| **Cooldown** | 10-120 seconds | 30s |
| **Praise Mode** | On/Off | On |
| **Groq API Key** | Optional (for LLM) | - |
| **ElevenLabs API Key** | Optional (premium TTS) | - |
| **Edge Voice** | Custom voice ID | Personality default |
| **ElevenLabs Voice** | Custom voice ID | Personality default |

### Advanced: Custom Personalities

Edit `config.py` to add your own personalities:

```python
PERSONALITIES = {
    "your_personality": {
        "name": "🎨 Your Personality",
        "voice": "Description of speaking style...",
        "default_voice": "en-US-VoiceName",
        "default_eleven_voices": ["voice_id_1", "voice_id_2"],
        "prewritten_shame": ["Line 1", "Line 2", ...],
        "shame": {
            "tone": "Description...",
            "examples": ["Example 1", ...]
        },
        # ... see config.py for full schema
    }
}
```

---

## 📈 How It Works (Technical Deep Dive)

### 1. **Camera Thread** (100 FPS)
```python
while not stop_event.is_set():
    frame = webcam.read()  # or reachy.media.get_frame()
    latest_frame = frame.copy()

    # Detection every 3rd frame (~33 FPS)
    if frame_count % 3 == 0:
        event = detector.process_frame(frame)

    # Encode as JPEG for web UI
    latest_frame_jpeg = encode_jpeg(frame)
    time.sleep(0.01)  # ~100 FPS
```

### 2. **Phone Detection** (YOLO26m + TensorRT)
```python
# Auto-detect NVIDIA GPU and use TensorRT
if cuda_available:
    model = YOLO("yolo26m.engine")  # TensorRT (2-3x faster!)
else:
    model = YOLO("yolo26m.pt")      # PyTorch fallback

# ByteTrack for persistent tracking
results = model.track(
    frame,
    persist=True,
    conf=adaptive_threshold,  # 0.5 → 0.2 when tracking
    tracker="bytetrack.yaml"
)
```

### 3. **State Machine** (Pickup/Putdown)
```python
# Pickup detection (fast: 3 frames)
if consecutive_phone >= 3 and not phone_visible:
    phone_visible = True
    return "picked_up"  # Trigger shame!

# Putdown detection (slow: 15 frames, anti-flicker)
if consecutive_no_phone >= 15 and phone_visible:
    phone_visible = False
    return "put_down"  # Trigger praise!
```

### 4. **LLM Response** (Groq + Llama 3.1-8B)
```python
response = groq_client.chat.completions.create(
    model="llama-3.1-8b-instant",
    max_tokens=20,
    temperature=1.1,  # High creativity
    messages=[
        {"role": "system", "content": personality_prompt},
        {"role": "user", "content": f"Phone pickup #{count}"}
    ]
)
```

### 5. **Text-to-Speech** (Multi-Voice)
```python
# Try ElevenLabs first (if API key + under quota)
for voice_id in eleven_voices:
    try:
        audio = eleven.text_to_speech.convert(
            text=text,
            voice_id=voice_id,
            model_id="eleven_multilingual_v2"
        )
        return audio  # Success!
    except:
        continue  # Try next voice

# Fallback to Edge TTS (always works, unlimited)
audio = edge_tts.Communicate(text, edge_voice).save()
```

### 6. **Robot Animation** (Synchronized)
```python
# Play audio
reachy.media.play_sound(audio_path)

# Animate based on offense count
if count == 1:
    curious_look(reachy)          # Gentle tilt
elif count <= 3:
    disappointed_shake(reachy)     # Head shake
else:
    dramatic_sigh(reachy)          # Full-body exasperation
```

---

## 🎯 Impact & Use Cases

### 🏢 **Productivity Enhancement**
- **Home office / Private workspace**: Stay focused during work sessions
- **Study sessions**: Break the phone-checking habit while studying
- **Personal accountability**: Physical reminder to stay off your phone

### 🏥 **Behavior Modification**
- **Digital wellness**: Reduce screen time naturally
- **Habit formation**: Build phone-free streaks
- **Mindfulness**: Awareness of unconscious phone checks

### 🎓 **Education & Research**
- **Human-Robot Interaction**: Study emotional engagement with robots
- **Behavior Psychology**: Test intervention effectiveness with different personalities
- **Computer Vision**: Real-time object detection demos
- **AI Ethics**: Explore persuasive technology boundaries

### 🤖 **Robotics Applications**
- **Social Robotics**: Emotional feedback systems
- **Assistive Technology**: Habit coaching robots
- **Edge AI**: Real-time vision on consumer hardware

---

## 🔧 Requirements

### Hardware
- Reachy Mini robot with camera
- **(Optional)** NVIDIA GPU with CUDA for TensorRT acceleration

### Software
- Python 3.10+
- Reachy Mini SDK
- Internet connection (first-time model download, LLM/TTS APIs)

### Dependencies

**Core** (always required):
```
reachy_mini
ultralytics
opencv-python
torch
numpy
edge-tts
fastapi
uvicorn
pydantic
```

**Optional - LLM**:
```
groq
```

**Optional - Premium TTS**:
```
elevenlabs
```

---

## 📝 Project Structure

```
judgy_reachy_no_phone/
├── judgy_reachy_no_phone/
│   ├── __init__.py
│   ├── main.py              # Main app loop, UI endpoints
│   ├── detection.py         # YOLO + TensorRT + ByteTrack
│   ├── audio.py             # LLM + TTS (Groq, ElevenLabs, Edge)
│   ├── animations.py        # Robot movements
│   └── config.py            # Personalities, settings
├── README.md                # This file
├── pyproject.toml           # Package config
└── .github/
    └── workflows/
        └── sync-hf-space.yml  # Auto-sync to Hugging Face
```

---

## 🤝 Contributing

This project was built for the **NVIDIA GTC 2026 Golden Ticket Contest**. Contributions welcome after contest ends!

### Ideas for Future Enhancements
- [ ] Multi-person tracking (shame multiple people!)
- [ ] Gesture recognition (phone in pocket vs. actively using)
- [ ] Dashboard analytics (daily/weekly reports)
- [ ] Mobile app integration (sync with phone screen-time data)
- [ ] Custom shame schedules (stricter during work hours)
- [ ] Gamification (achievements, leaderboards)
- [ ] Voice recognition (personalized responses per user)
- [ ] Integration with productivity tools (Slack, Calendar)

---

## 📜 License

**Apache 2.0** - Feel free to use, modify, and distribute!

---

## 🙏 Acknowledgments

### Technologies
- **NVIDIA**: CUDA, TensorRT optimization
- **Ultralytics**: YOLO26m object detection model
- **ByteTrack**: Multi-object tracking algorithm
- **Groq**: Free Llama 3.1-8B-instant API
- **Meta**: Llama 3.1-8B model
- **ElevenLabs**: High-quality TTS voices
- **Microsoft**: Edge TTS (free tier)
- **webml-community**: WebGPU demo implementation inspired by [YOLO26-WebGPU](https://huggingface.co/spaces/webml-community/YOLO26-WebGPU)

### Datasets & Models
- **Hugging Face**: [pollen-robotics/reachy-mini-emotions-library](https://huggingface.co/datasets/pollen-robotics/reachy-mini-emotions-library)
- **Ultralytics**: [YOLO26](https://huggingface.co/Ultralytics/YOLO26)
- **ONNX Community**: [yolo26m-ONNX](https://huggingface.co/onnx-community/yolo26m-ONNX)

### Partners
- **Pollen Robotics**: Reachy Mini robot platform
- **Hugging Face**: Hosting & model distribution
- **NVIDIA**: GTC Golden Ticket Contest sponsor

---

## 👤 Author

**Yasemin Ozkut**

Built for the **NVIDIA GTC 2026 Golden Ticket Contest** (Jan 27 - Feb 15, 2026)

Partnership: **Pollen Robotics Reachy Mini x Hugging Face x NVIDIA**

---

## 🎥 Demo

**[Watch Demo Video →](https://your-demo-link)**

**[Try Live Demo on Hugging Face →](https://huggingface.co/spaces/pollen-robotics/Reachy_Mini)**

---

## 📧 Contact & Links

- **GitHub**: [yaseminozkut/judgy_reachy_no_phone](https://github.com/yaseminozkut/judgy_reachy_no_phone)
- **Hugging Face**: [@yaseminozkut](https://huggingface.co/yaseminozkut)
- **Contest**: [NVIDIA GTC Golden Ticket](https://www.nvidia.com/gtc)

---

<div align="center">

**Built with ❤️ using NVIDIA TensorRT, YOLO26m, Llama 3.1, and Reachy Mini**

*Get off your phone and get back to work! 📱→🤖→💪*

</div>