NYCTrip Data by Relatix™ Analytics

Data Dictionary

Schema definitions for all tables in the D1 analytics database. Raw data is stored as parquet in R2; these tables contain pre-aggregated summaries.

datasets

Data source inventory

ColumnTypeDescription
idTEXTPrimary key: taxi, subway, citibike, air, weather
display_nameTEXTHuman-readable name
descriptionTEXTDataset description
earliest_monthTEXTEarliest available YYYY-MM
latest_monthTEXTMost recent YYYY-MM
total_recordsINTEGERTotal row count across all files
updated_atTEXTLast update timestamp

taxi_zone_monthly

Taxi zone-level monthly aggregations

ColumnTypeDescription
taxi_typeTEXTyellow, green, or fhvhv
year_monthTEXTYYYY-MM
location_idINTEGERTLC taxi zone ID (1-263)
pickup_countINTEGERNumber of pickups
dropoff_countINTEGERNumber of dropoffs
avg_fareREALAverage fare amount (USD)
avg_distanceREALAverage trip distance (miles)
avg_tipREALAverage tip amount (USD)
total_fareREALSum of all fares (USD)
total_tripsINTEGERTotal trip count

taxi_hourly

Taxi hourly demand patterns

ColumnTypeDescription
taxi_typeTEXTyellow, green, or fhvhv
year_monthTEXTYYYY-MM
hourINTEGERHour of day (0-23)
trip_countINTEGERNumber of trips
avg_fareREALAverage fare (USD)

subway_station_monthly

Subway station monthly ridership

ColumnTypeDescription
station_idTEXTMTA station identifier
year_monthTEXTYYYY-MM
total_entriesINTEGERTotal turnstile entries
total_exitsINTEGERTotal turnstile exits
avg_daily_entriesREALAverage entries per day

citibike_station_monthly

Citi Bike station monthly activity

ColumnTypeDescription
station_idTEXTCiti Bike station ID
year_monthTEXTYYYY-MM
trip_startsINTEGERTrips starting here
trip_endsINTEGERTrips ending here
avg_duration_minREALAverage trip duration (minutes)
member_tripsINTEGERTrips by annual members
casual_tripsINTEGERTrips by casual riders

air_monthly

Airport monthly operations

ColumnTypeDescription
airportTEXTAirport code: JFK, LGA, EWR
year_monthTEXTYYYY-MM
departuresINTEGERNumber of departures
arrivalsINTEGERNumber of arrivals
avg_delay_minREALAverage delay (minutes)
cancelledINTEGERCancelled flights

weather_daily

Daily weather observations

ColumnTypeDescription
stationTEXTNOAA weather station ID
dateTEXTYYYY-MM-DD
temp_max_fREALDaily high temperature (°F)
temp_min_fREALDaily low temperature (°F)
temp_avg_fREALDaily average temperature (°F)
precip_inREALPrecipitation (inches)
snow_inREALSnowfall (inches)
wind_avg_mphREALAverage wind speed (mph)
conditionsTEXTWeather condition description