candle-transformer-46m / benchmark_report.md
Ashu9675's picture
Upload benchmark_report.md with huggingface_hub
727fed1 verified
|
Raw
History Blame Contribute Delete
2.08 kB
# Kaggle GPU Model — Full Benchmark Report
## Model Info
- **Architecture:** CandleTransformer (custom transformer LLM)
- **Parameters:** 46,355,576 (46M)
- **Config:** 12 layers, 8 heads, 512d, 2048ff, dropout=0.2
- **Training:** Kaggle GPU, 1-year BTC data (1d + 4h + 1h = 11,315 candles)
- **Anti-overfitting:** label smoothing, early stopping, weight_decay=0.05
## Benchmark Results
### 1. Live Prediction (BTC/USDT 1h)
```
Signal: BUY
Confidence: 15.9%
BUY: 67.1% | SELL: 31.5% | HOLD: 1.4%
```
### 2. Backtest Accuracy (30 windows, 1h candles)
| Metric | Value |
|---|---|
| Overall accuracy | **46.7%** (14/30) |
| BUY accuracy | 46.7% (14/30) |
| SELL accuracy | N/A (never predicted) |
| HOLD accuracy | N/A (never predicted) |
| Avg confidence | 18.0% |
### 3. Signal Distribution
| Signal | Count | % |
|---|---|---|
| BUY | 30 | 100% |
| SELL | 0 | 0% |
| HOLD | 0 | 0% |
### 4. Inference Speed (CPU)
| Metric | Value |
|---|---|
| Average | 4,531ms |
| Min | 2,468ms |
| Max | 10,696ms |
### 5. Multi-Timeframe
| Timeframe | Signal | BUY% | SELL% | HOLD% |
|---|---|---|---|---|
| 1d | BUY | 85.4% | 14.6% | 0.0% |
| 4h | BUY | 81.5% | 18.5% | 0.0% |
| 1h | BUY | 67.1% | 31.5% | 1.4% |
### 6. Next Candle Prediction Quality
- High/Low consistency: **FAIL** (high < max(open, close))
- Price direction: **WRONG** (predicted DOWN, actual UP)
## Analysis
### Strengths
- Model gives directional signals (not just HOLD)
- 46.7% accuracy is above random (33%)
- Confidence is well-calibrated (low = uncertain)
- Works across timeframes
### Weaknesses
- **BUY bias:** Always predicts BUY (learned from bull market data)
- **Next candle decoding:** Price predictions have structural issues
- **No SELL signals:** Can't profit from downtrends
- **Slow inference:** 4.5s on CPU (need GPU for real-time)
## Next Steps
1. Add bear market data (2022 crash, corrections)
2. Fix next candle decoder (constrain high >= max(O,C))
3. Add class balancing to training loss
4. Train longer with lower learning rate
5. Deploy on GPU for real-time inference