Tombiczek commited on
Commit
3fc2f8e
·
0 Parent(s):

Initial commit with Xet-backed binaries

Browse files
.DS_Store ADDED
Binary file (6.15 kB). View file
 
.gitattributes ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.png filter=lfs diff=lfs merge=lfs -text
37
+ *.gif filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Model Card: DQN Traffic Light Controller (Belgradzka-KEN, Warsaw)
2
+
3
+ ## Model Summary
4
+ This project contains a Deep Q-Network (DQN) model trained to control traffic lights at the **Belgradzka-KEN intersection in Warsaw**.
5
+
6
+ The model learns signal switching behavior in a SUMO simulation to improve traffic flow metrics such as queue length and delay.
7
+
8
+ ## Intended Use
9
+ - **Primary use**: Research and demonstration of reinforcement learning for traffic signal control in SUMO.
10
+ - **Deployment scope**: This model is trained for **one specific intersection only** (Belgradzka-KEN).
11
+ - **Important limitation**: It is **not** expected to work correctly on other intersections without retraining.
12
+
13
+ ## Training Setup
14
+ - Algorithm: DQN
15
+ - Final selected model:
16
+ - Training steps: **225,000**
17
+ - Learning rate: **0.0005**
18
+ - Traffic during training:
19
+ - Random traffic with fixed car arrival interval
20
+ - Random traffic with random car arrival interval (closer to real-world variability)
21
+
22
+ ## Demonstration
23
+ GUI recording showing how the trained model controls traffic lights:
24
+
25
+ ![DQN GUI demo](media/demo.gif)
26
+
27
+ ## Training Results
28
+
29
+ ### Effect of Training Steps (Fixed Arrival Interval)
30
+ This plot shows performance trends for models trained with different numbers of steps on random traffic with fixed car arrival interval:
31
+
32
+ ![Fixed interval training steps trend](media/fixed_steps_trend.png)
33
+
34
+ ### Effect of Training Steps (Random Arrival Interval)
35
+ This plot shows performance trends for models trained with different numbers of steps on random traffic with random car arrival interval, intended to better simulate real-world traffic:
36
+
37
+ ![Random interval training steps trend](media/random_steps_trend.png)
38
+
39
+ ### Hyperparameter Tuning
40
+ This plot compares how different hyperparameter settings affect model performance, tested on:
41
+ - low fixed arrival interval
42
+ - medium fixed arrival interval
43
+ - high fixed arrival interval
44
+ - truly random traffic (mix of low/medium/high, closer to real world)
45
+
46
+ ![Hyperparameter tuning results](media/tuning_results.png)
47
+
48
+ ## Final Evaluation and Stability Comparison
49
+ Two models were compared:
50
+ - **Stable/best model**: selected final model
51
+ - **Unstable model**: model with `target_update_interval = 1000` (instead of 500)
52
+
53
+ The comparison highlights the importance of choosing proper hyperparameters.
54
+
55
+ ### Mean Delay by Method
56
+ ![Mean delay comparison](media/mean_delay_by_method.png)
57
+
58
+ ### Mean Queue Length by Method
59
+ ![Mean queue length comparison](media/mean_queue_length_by_method.png)
60
+
61
+ ## Usage
62
+ If you want to train, evaluate, or use these models yourself, see the repository:
63
+
64
+ https://github.com/Tombiczek/rl-traffic-control-sumo
media/demo.gif ADDED

Git LFS Details

  • SHA256: 4309be5cc5e8dd9f905e93711eb49ac45a4395d6ac25b40586922652e8d05cd0
  • Pointer size: 132 Bytes
  • Size of remote file: 5.4 MB
media/fixed_steps_trend.png ADDED

Git LFS Details

  • SHA256: 4eecd355feaf94807bb87fef7157aa752f83e16c45f920e85c3fab9f2ccf8e46
  • Pointer size: 131 Bytes
  • Size of remote file: 103 kB
media/mean_delay_by_method.png ADDED

Git LFS Details

  • SHA256: 21efe7fe8454bf59d6336ff1655a63114163c1eefb43a2f37e5067d5e1078954
  • Pointer size: 130 Bytes
  • Size of remote file: 22.9 kB
media/mean_queue_length_by_method.png ADDED

Git LFS Details

  • SHA256: ac7b8507afbc82a5d7340e801e249f50ee48de404bf45344b824621dafa03f20
  • Pointer size: 130 Bytes
  • Size of remote file: 25.8 kB
media/random_steps_trend.png ADDED

Git LFS Details

  • SHA256: fb1d7d0756bb08fa7e9d483b04fd7b0f9705987c92a5645a7eb5a9c9e91c86eb
  • Pointer size: 131 Bytes
  • Size of remote file: 111 kB
media/tuning_results.png ADDED

Git LFS Details

  • SHA256: 0b1a9db5d635465775047e5ca09a5f9852ca1a0ff6fdf13ddd2cede36151a946
  • Pointer size: 130 Bytes
  • Size of remote file: 94.5 kB
models/dqn_fixed_lr_5e-4_225k_final.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:196ae63860c320734fcba71a289c41659c5f2120395a5348c6d551d53f23c631
3
+ size 130637
models/dqn_fixed_tui_1e3_225k_unstable.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:50c386ddd29f8901cf5c64a109b13616903f5d5324fdf014d4527c6b6448f892
3
+ size 130636