Skip to main content

Dutch Railway Datasets

Examples in this documentation often use datasets based on the Dutch Railway datasets. These high-quality datasets are maintained by the team behind the Rijden de Treinen (Are the trains running?) application. This page contains download links to our mirrors to the datasets.

In 2024, we have published a blog post on the analysis of these datasets.

Loading the Datasets

You can load the datasets directly as follows:

CREATE TABLE services AS
FROM 'https://${uri}/services-2025-03.csv.gz';
DESCRIBE services;
column_namecolumn_typenullkeydefaultextra
Service:RDT-IDBIGINTYESNULLNULLNULL
Service:DateDATEYESNULLNULLNULL
Service:TypeVARCHARYESNULLNULLNULL
Service:CompanyVARCHARYESNULLNULLNULL
Service:Train numberBIGINTYESNULLNULLNULL
Service:Completely cancelledBOOLEANYESNULLNULLNULL
Service:Partly cancelledBOOLEANYESNULLNULLNULL
Service:Maximum delayBIGINTYESNULLNULLNULL
Stop:RDT-IDBIGINTYESNULLNULLNULL
Stop:Station codeVARCHARYESNULLNULLNULL
Stop:Station nameVARCHARYESNULLNULLNULL
Stop:Arrival timeTIMESTAMP WITH TIME ZONEYESNULLNULLNULL
Stop:Arrival delayBIGINTYESNULLNULLNULL
Stop:Arrival cancelledBOOLEANYESNULLNULLNULL
Stop:Departure timeTIMESTAMP WITH TIME ZONEYESNULLNULLNULL
Stop:Departure delayBIGINTYESNULLNULLNULL
Stop:Departure cancelledBOOLEANYESNULLNULLNULL