Gotham Dataset 2025: A Reproducible Large-Scale IoT Network Dataset for Intrusion Detection
Abstract
"Large-scale reproducible IoT network dataset with traffic from 100+ diverse IoT devices including smart home, wearable, and industrial sensors, featuring multiple attack scenarios and benign behavior for intrusion detection research."
Description
Overview
The Gotham Dataset 2025 is a comprehensive, reproducible IoT network security dataset designed to advance intrusion detection and security research in large-scale heterogeneous IoT environments.
Data Collection
- Network traffic captured from a testbed containing over 100 diverse IoT devices spanning smart home (cameras, thermostats, lights), wearables (fitness trackers), industrial sensors, and smart appliances.
- Collected in a distributed manner with traffic captured separately for each device at the IoT gateway interface, enabling device-level analysis.
- The dataset includes both normal operational behavior across multiple days and various attack scenarios injected into the network.
Attack Scenarios
- Botnet attacks: Mirai and other IoT botnets performing DDoS, scanning, and propagation.
- Reconnaissance: Network scanning and device fingerprinting attacks.
- Man-in-the-Middle: Traffic interception and manipulation between IoT devices and cloud services.
- Data exfiltration: Unauthorized data transmission from compromised devices.
Dataset Structure
- Over 23.8 GB of network traffic data in PCAP and processed CSV formats.
- Per-device traffic captures enabling fine-grained analysis of individual IoT device behavior.
- Rich flow-based features including packet sizes, inter-arrival times, protocol distributions, and behavioral statistics.
- Labeled data with attack types, timestamps, and device identifiers.
Use Cases
- Developing and benchmarking IoT-specific intrusion detection systems at scale.
- Research on device fingerprinting, behavioral profiling, and anomaly detection in heterogeneous IoT networks.
- Evaluating machine learning models for IoT security in realistic multi-device environments.
View Data Structure
To explore column names, data types, and sample rows, visit the official dataset page on Zenodo / ArXiv.
Preview on Zenodo / ArXivCite This Dataset
Belarbi, Othmane, & others (2025). Gotham Dataset 2025: A Reproducible Large-Scale IoT Network Dataset for Intrusion Detection. [Dataset]. Zenodo. https://doi.org/10.5281/zenodo.14502760
Source: Zenodo (2025) · DOI: 10.5281/zenodo.14502760
Indexed by IoTDataset.com on Jan 30, 2026
Ready to Start Your Research?
Download this dataset directly from the official repository and start building your next breakthrough project.