Dutch Railway Datasets
Examples in this documentation often use datasets based on the Dutch Railway datasets. These high-quality datasets are maintained by the team behind the Rijden de Treinen (Are the trains running?) application. This page contains download links to our mirrors to the datasets.
In 2024, we have published a blog post on the analysis of these datasets.
Loading the Datasets
You can load the datasets directly as follows:
CREATE TABLE services AS
FROM 'https://${uri}/services-2025-03.csv.gz';
DESCRIBE services;
| column_name | column_type | null | key | default | extra |
|---|---|---|---|---|---|
| Service:RDT-ID | BIGINT | YES | NULL | NULL | NULL |
| Service:Date | DATE | YES | NULL | NULL | NULL |
| Service:Type | VARCHAR | YES | NULL | NULL | NULL |
| Service:Company | VARCHAR | YES | NULL | NULL | NULL |
| Service:Train number | BIGINT | YES | NULL | NULL | NULL |
| Service:Completely cancelled | BOOLEAN | YES | NULL | NULL | NULL |
| Service:Partly cancelled | BOOLEAN | YES | NULL | NULL | NULL |
| Service:Maximum delay | BIGINT | YES | NULL | NULL | NULL |
| Stop:RDT-ID | BIGINT | YES | NULL | NULL | NULL |
| Stop:Station code | VARCHAR | YES | NULL | NULL | NULL |
| Stop:Station name | VARCHAR | YES | NULL | NULL | NULL |
| Stop:Arrival time | TIMESTAMP WITH TIME ZONE | YES | NULL | NULL | NULL |
| Stop:Arrival delay | BIGINT | YES | NULL | NULL | NULL |
| Stop:Arrival cancelled | BOOLEAN | YES | NULL | NULL | NULL |
| Stop:Departure time | TIMESTAMP WITH TIME ZONE | YES | NULL | NULL | NULL |
| Stop:Departure delay | BIGINT | YES | NULL | NULL | NULL |
| Stop:Departure cancelled | BOOLEAN | YES | NULL | NULL | NULL |