JPxxx/url-benchmark-dataset
Viewer • Updated • 3M • 19
Official collection of models and datasets from paper "From Lab to Production: Malicious URL Detection on Real-World Data".
Note Benchmarking dataset of 3 million labeled URLs with site-aware folding
Note Pre-training corpus of 68 million URLs