diff --git a/docs/R/ieei.R b/docs/R/ieei.R index 5852f6c5..efc30d85 100644 --- a/docs/R/ieei.R +++ b/docs/R/ieei.R @@ -4,7 +4,7 @@ data_dir <- "data/input/ieei/" output_dir <- "data/output/ieei/" master_dir <- "../master/" -taz_shape_filename <- paste0(master_dir, "tazs/master_tazs.shp") +taz_shape_filename <- "data/input/tazs/master_tazs.shp" socec_filename <- paste0(data_dir, "se_2016.csv") consolidated_streetlight_filename <- paste0(private_dir, "streetlight-ieei-flows.RDS") ncstm_filename <- paste0(data_dir, "ncstm-demand.rds") diff --git a/docs/_site.yml b/docs/_site.yml index 03b70f0a..6cec2b50 100644 --- a/docs/_site.yml +++ b/docs/_site.yml @@ -31,6 +31,8 @@ navbar: href: parking.html - text: "Destination Choice" href: resident_dest_choice.html + - text: "Mode Choice" + href: resident_mode_choice.html - text: "Non-homebased" href: nhb.html - text: Other Models diff --git a/docs/assignment_roadway.Rmd b/docs/assignment_roadway.Rmd index 59283388..a954793b 100644 --- a/docs/assignment_roadway.Rmd +++ b/docs/assignment_roadway.Rmd @@ -71,54 +71,32 @@ added to HOV3+. ## Value of time -Once trips are collapsed into the primary vehicle classes, they are split into -different value of time categories. The procedure for doing this is borrowed -from the NC Statewide Model -([documentation](other_docs/NCSTM_TollDocumentation_Section11-2.docx)). For -residents, this stratification is informed by household income, time of day of -the trip, and by the length of the trip. +Once trips are collapsed into the primary vehicle classes, they are assigned +values of time. The base values were borrowed from the NC Statewide Model, but +were then calibrated to match NC 540 traffic counts. -Qualitatively, value of time goes up: - -- As income increases -- As trip length increases -- In the peak periods (where work trips are most common) - -The NCSTM/TRMG2 procedure divides each class into five VOT categories, except -SUT which is divided into three. These are shown in the table below. ```{r, include=FALSE} tbl <- read_csv("../master/assignment/vot_params.csv") ``` ```{r} -tbl <- tbl[1:18, c("Name", "Value")] tbl <- tbl %>% - separate(Name, into = c("a", "b", "c"), fill = "right") %>% - mutate( - Name = ifelse(!is.na(c), paste(a, b), a), - VOT = ifelse(is.na(c), b, c) - ) %>% - select(Name, VOT, Value) %>% - pivot_wider(names_from = Name, values_from = Value) %>% - mutate(VOT = gsub("vot", "", VOT)) %>% - rename( - `VOT Group` = VOT, - `PK Auto` = `pk auto`, - `OP Auto` = `op auto`, - SUT = sut, - MUT = mut - ) + select(Name, Value) %>% + mutate(Name = case_when( + Name == "pk_auto" ~ "Autos/CV (PK)", + Name == "op_auto" ~ "Autos/CV (OP)", + Name == "sut" ~ "SUT", + Name == "mut" ~ "MUT", + TRUE ~ "Calibration Factor" + )) %>% + rename(`VOT ($/hr)` = Value) + tbl %>% kable() %>% kable_styling(full_width = FALSE) ``` -Such a high number of classes significantly increases assignment run time. To -mitigate this, categories 1-3 for the auto classes are collapsed and use the -values from VOT group 2. This trade off is reasonable given that it is the -higher VOT categories that are most likely to consider toll facilities. - ## Assignment parameters Relevant assignment details are listed below: diff --git a/docs/assignment_roadway.html b/docs/assignment_roadway.html index c54f5b39..24597841 100644 --- a/docs/assignment_roadway.html +++ b/docs/assignment_roadway.html @@ -190,6 +190,9 @@
  • Destination Choice
  • +
  • + Mode Choice +
  • Non-homebased
  • @@ -205,6 +208,9 @@
  • Accessibility
  • +
  • + University +
  • Airport
  • @@ -285,123 +291,61 @@

    Matrix creation

    Value of time

    -

    Once trips are collapsed into the primary vehicle classes, they are split into different value of time categories. The procedure for doing this is borrowed from the NC Statewide Model (documentation). For residents, this stratification is informed by household income, time of day of the trip, and by the length of the trip.

    -

    Qualitatively, value of time goes up:

    - -

    The NCSTM/TRMG2 procedure divides each class into five VOT categories, except SUT which is divided into three. These are shown in the table below.

    +

    Once trips are collapsed into the primary vehicle classes, they are assigned values of time. The base values were borrowed from the NC Statewide Model, but were then calibrated to match NC 540 traffic counts.

    - - - - - - - - - - - - - - - - - -
    -VOT Group - -PK Auto +Name -OP Auto - -SUT - -MUT +VOT ($/hr)
    -1 - -3.28 +Autos/CV (PK) -3.22 - -20.52 - -12.57 +17.11
    -2 - -7.15 - -7.08 - -38.79 +Autos/CV (OP) -31.12 +17.00
    -3 - -12.29 - -12.21 - -56.75 +SUT -49.54 +26.82
    -4 - -19.53 - -19.41 - -NA +MUT -67.95 +40.48
    -5 - -38.58 - -36.37 - -NA +Calibration Factor -88.34 +2.00
    -

    Such a high number of classes significantly increases assignment run time. To mitigate this, categories 1-3 for the auto classes are collapsed and use the values from VOT group 2. This trade off is reasonable given that it is the higher VOT categories that are most likely to consider toll facilities.

    Assignment parameters

    @@ -427,83 +371,83 @@

    Alphas

    Downtown -Urban +Rural Suburban -Rural +Urban -Freeway +Arterial -0.60 +1.15 -0.60 +1.15 -0.45 +1.15 -0.45 +1.15 -MLHighway +Collector -0.90 +1.60 -0.80 +1.75 -0.70 +1.60 -0.70 +1.60 -TLHighway +Freeway 0.90 -0.90 +0.65 -0.75 +0.65 -0.90 +0.65 -Superstreet +Local -0.75 +2.00 -0.75 +4.00 -0.75 +1.90 -0.75 +1.90 @@ -525,7 +469,7 @@

    Alphas

    -Arterial +MajorCollector 1.25 @@ -542,53 +486,53 @@

    Alphas

    -MajorCollector +MLHighway -1.50 +0.90 -1.50 +0.70 -1.50 +0.70 -1.50 +0.80 -Collector +Superstreet -1.75 +0.75 -1.75 +0.75 -1.75 +0.75 -1.75 +0.75 -Local +TLHighway -2.00 +1.15 -2.00 +1.00 -2.00 +1.15 -4.00 +1.15 @@ -610,34 +554,34 @@

    Betas

    -Freeway +Arterial -9 +5 -MLHighway +Collector -5 +4 -TLHighway +Freeway -5 +7 -Superstreet +Local -4 +3 @@ -650,34 +594,34 @@

    Betas

    -Arterial +MajorCollector -5 +4 -MajorCollector +MLHighway -3 +5 -Collector +Superstreet -3 +4 -Local +TLHighway -3 +5 diff --git a/docs/auto_ownership.Rmd b/docs/auto_ownership.Rmd index 15fe8530..41d31cbf 100644 --- a/docs/auto_ownership.Rmd +++ b/docs/auto_ownership.Rmd @@ -56,15 +56,19 @@ write_csv( ## Introduction -Vehicle ownership is a long-term decision that directly impacts daily mode +Auto ownership is a long-term decision that directly impacts daily mode choice. The choice is influenced by household factors like income and number -of workers, but also by where people choose to live. The auto-ownership model -allows the TRMG2 to be sensitive to these factors and respond to changes in the +of workers, but also by where people choose to live. The auto ownership model +allows TRMG2 to be sensitive to these factors and respond to changes in the future. +Auto ownership is predicted using a discrete choice model. For more details on +general model form, click +[here](https://tfresource.org/topics/Choice_models.html). + ## Model Structure -The auto-ownership model in the TRMG2 makes use of variables from the synthetic +The auto ownership model in TRMG2 makes use of variables from the synthetic population and zonal accessibility to make predictions. The model structure is a simple multinomial logistic (MNL) regression model with five alternatives: @@ -80,8 +84,11 @@ Model estimation was performed using TransCADs built-in logit model engine, and the table below shows the utility terms, coefficients, and goodness of fit. ```{r, include=FALSE} -coeffs <- read_csv("data/input/auto_ownership/ao_final_coefficients.csv") %>% - mutate(across(.cols = c(veh0:veh4), .fns = ~round(.x, 3))) %>% +coeffs <- read_csv("../master/resident/auto_ownership/ao_coefficients.csv") %>% + select(Term = Description, Alternative,Coefficient) %>% + mutate(Coefficient = round(Coefficient, 1)) %>% + pivot_wider(id_cols = c(Term), names_from = Alternative, values_from = Coefficient) %>% + mutate(across(.cols = c(v0:v4), .fns = ~round(.x, 3))) %>% mutate(across(.fns = ~ifelse(is.na(.x), "", .x))) ``` @@ -91,25 +98,32 @@ coeffs %>% kable_styling(full_width = FALSE) ``` +Household income categories are defined [here](population_synthesis.html#household-income). Accessibility terms are discussed in detail [here](accessibility.html). + +As an example, the 1-vehicle coefficients in the table above translate into +a utility formula that starts like this: + +$U = 4.88 + 0.7 * Workers + 0.1 * Seniors - 0.4 * WalkAccess + ...$ + The coefficients all have the right sign and the relative sizes are intuitive. -One particularly encouraging result of this model is that households with strong +For example, higher-income households are more likely to own more cars. One +particularly encouraging result of this model is that households with strong walk and transit accessibility are less likely to own a vehicle and even less likely to own multiple vehicles. This adds another dimension of model sensitivity to transit investments. New transit routes will affect long-term household decisions about auto ownership, which further influence their daily -decisions about mode choice. +transportation decisions. Looking in more detail, the trend for coefficients across alternatives is also -intuitive. Large numbers of workers in a household has a small positive impact on the -utility of owning 1 auto, but a large impact on owning 2 or more. High income -households are more likely to own more vehicles. Children make it less likely to -own 3 or more cars, because there are fewer drivers. Finally, the model's Rho^2 -shows strong predictive power. +intuitive. Large numbers of workers in a household have a small positive impact +on the utility of owning 1 auto, but a large impact on owning 2 or more. High +income households are more likely to own more vehicles. Finally, the model's +Rho^2 of 0.42 shows strong predictive power. ## Calibration -Even before calibration, the estimated coefficients produced results that -closely matched the survey as shown in the table below. +The estimated coefficients produce results that closely matched the survey as +shown in the table below. ```{r, include=FALSE} # These should come from an application of the model before any calibration diff --git a/docs/auto_ownership.html b/docs/auto_ownership.html index 35131ee1..f24611a8 100644 --- a/docs/auto_ownership.html +++ b/docs/auto_ownership.html @@ -190,6 +190,9 @@
  • Destination Choice
  • +
  • + Mode Choice +
  • Non-homebased
  • @@ -205,6 +208,9 @@
  • Accessibility
  • +
  • + University +
  • Airport
  • @@ -255,11 +261,12 @@

    Auto Ownership

    Introduction

    -

    Vehicle ownership is a long-term decision that directly impacts daily mode choice. The choice is influenced by household factors like income and number of workers, but also by where people choose to live. The auto-ownership model allows the TRMG2 to be sensitive to these factors and respond to changes in the future.

    +

    Auto ownership is a long-term decision that directly impacts daily mode choice. The choice is influenced by household factors like income and number of workers, but also by where people choose to live. The auto ownership model allows TRMG2 to be sensitive to these factors and respond to changes in the future.

    +

    Auto ownership is predicted using a discrete choice model. For more details on general model form, click here.

    Model Structure

    -

    The auto-ownership model in the TRMG2 makes use of variables from the synthetic population and zonal accessibility to make predictions. The model structure is a simple multinomial logistic (MNL) regression model with five alternatives:

    +

    The auto ownership model in TRMG2 makes use of variables from the synthetic population and zonal accessibility to make predictions. The model structure is a simple multinomial logistic (MNL) regression model with five alternatives:

    • 0 Vehicles
    • 1 Vehicle
    • @@ -278,76 +285,79 @@

      Coefficients

      Term -veh0 +v0 - -veh1 + +v1 - -veh2 + +v2 -veh3 +v3 -veh4 +v4 -Workers +Constant +0 - -0.665 + +4.9 - -3.24 + +2.4 -4.249 +1.7 -5.215 +0.7 -Non-Working Adults +Workers - -0 + +0.7 - -2.06 + +3.3 -2.806 +4.3 -3.598 +5.3 -Children +Non-working adults - + +0.0 - + +2.1 --0.153 +2.8 --0.153 +3.6 @@ -356,11 +366,11 @@

      Coefficients

      - -0.144 + +0.1 - -0.144 + +0.1 @@ -369,117 +379,186 @@

      Coefficients

      -Income +Walk access - -1.687 + +-0.4 - -2.531 + +-0.6 -2.724 +-0.6 -2.958 +-0.7 -Walk Accessibility +Transit access - + +-0.1 - --0.395 + +-0.1 --0.605 +-0.2 --1.061 +-0.2 -Transit Accessibility +Nearby access - --0.37 + +-0.4 - --0.425 + +-0.5 --0.536 +-0.8 --0.536 +-1.1 -ASC +Walkability (intrazonal) + +-0.5 + + +-0.5 + + +-0.5 + + +-0.5 + + + --0.441 +Income Category 1 HHs --5.751 + + +-0.6 + + +-1.8 --8.939 +-1.8 --12.61 +-3.2 +Income Category 2 HHs + + + + +1.6 + + +1.6 + + +1.6 +1.6 + + +Income Category 3 HHs + +1.8 + + +2.6 + +2.7 +2.7 -Adjusted Rho^2 +Income Category 4 HHs + + + + +2.2 + + +3.7 + + +4.1 -0.41 +4.5 + + +Calibration constant + +-0.1 + + +0.0 + +0.1 +0 -

      The coefficients all have the right sign and the relative sizes are intuitive. One particularly encouraging result of this model is that households with strong walk and transit accessibility are less likely to own a vehicle and even less likely to own multiple vehicles. This adds another dimension of model sensitivity to transit investments. New transit routes will affect long-term household decisions about auto ownership, which further influence their daily decisions about mode choice.

      -

      Looking in more detail, the trend for coefficients across alternatives is also intuitive. Large numbers of workers in a household has a small positive impact on the utility of owning 1 auto, but a large impact on owning 2 or more. High income households are more likely to own more vehicles. Children make it less likely to own 3 or more cars, because there are fewer drivers. Finally, the model’s Rho^2 shows strong predictive power.

      +

      Household income categories are defined here. Accessibility terms are discussed in detail here.

      +

      As an example, the 1-vehicle coefficients in the table above translate into a utility formula that starts like this:

      +

      \(U = 4.88 + 0.7 * Workers + 0.1 * Seniors - 0.4 * WalkAccess + ...\)

      +

      The coefficients all have the right sign and the relative sizes are intuitive. For example, higher-income households are more likely to own more cars. One particularly encouraging result of this model is that households with strong walk and transit accessibility are less likely to own a vehicle and even less likely to own multiple vehicles. This adds another dimension of model sensitivity to transit investments. New transit routes will affect long-term household decisions about auto ownership, which further influence their daily transportation decisions.

      +

      Looking in more detail, the trend for coefficients across alternatives is also intuitive. Large numbers of workers in a household have a small positive impact on the utility of owning 1 auto, but a large impact on owning 2 or more. High income households are more likely to own more vehicles. Finally, the model’s Rho^2 of 0.42 shows strong predictive power.

    Calibration

    -

    Even before calibration, the estimated coefficients produced results that closely matched the survey as shown in the table below.

    +

    The estimated coefficients produce results that closely matched the survey as shown in the table below.

    @@ -570,7 +649,7 @@

    Calibration

    v1 @@ -578,7 +657,7 @@

    Calibration

    v2 @@ -586,7 +665,7 @@

    Calibration

    v3 @@ -594,7 +673,7 @@

    Calibration

    v4 diff --git a/docs/commercial_vehicles.Rmd b/docs/commercial_vehicles.Rmd index e1d662f1..e9ad2cbe 100644 --- a/docs/commercial_vehicles.Rmd +++ b/docs/commercial_vehicles.Rmd @@ -21,13 +21,13 @@ library(kableExtra) ## Introduction -Commercial vehicles and trucks can be a significant source of congestion on -major facilities in an urban area, and in most instances, their travel behavior -distinct from residents. To better capture the commercial vehicle element of -traffic in the TRMG2, a separate set of commercial vehicle models were +Commercial vehicles and trucks contribute significantly to congestion on +major facilities in an urban area. In most instances, their travel behavior is +distinct from residents of that urban area. To better capture commercial vehicle +traffic in TRMG2, a separate set of commercial vehicle models were developed. These include trip generation, time of day, and distribution for -three commercial vehicle types: commercial autos, vans, and pickups (CV); single -unit trucks (SUT); and multi-unit trucks (MUT). +three commercial vehicle types: commercial autos, vans, and pickups (CV), single +unit trucks (SUT), and multi-unit trucks (MUT). These classes are defined using the [federal vehicle classification](https://www.fhwa.dot.gov/policyinformation/tmguide/tmg_2013/vehicle-types.cfm). @@ -35,9 +35,10 @@ CV - Classes 1-3 SUT - Classes 5-7 MUT - Classes 8-13 -The models are implementations of the [Quick Response -Freight Manual](https://trid.trb.org/view.aspx?id=859168) (QRFM) and are -re-estimated where possible using the Triangle's 2010 Commercial Vehicle survey. +The models use the [Quick Response +Freight Manual](https://trid.trb.org/view.aspx?id=859168) (QRFM) approach and +are re-estimated when possible using the Triangle's 2010 Commercial Vehicle +survey. ```{r, include=FALSE} veh_raw <- read_csv("data/input/_PRIVATE/survey_data/SurveyData_Caliper/2010 Commercial Vehicle Travel/1_Data - Original/vehicles_exported.csv") @@ -108,10 +109,11 @@ add_tazs <- add_veh_types %>% ## Trip Rates -Initial estimation attempts led to poor model results. A fundamental problem may -be regressing a 2010 survey against 2016 land use data. In addition, -stakeholders noted some concerns with the survey. Given these issues, initial -rates were borrowed from the QRFM and mapped to the TRMG2 employment types. +Initial estimation attempts led to counter-intuitive coefficeints and poor model +fit statistics. A fundamental problem may have been regressing a 2010 survey +against 2016 land use data. In addition, stakeholders noted some concerns with +the survey. Given these issues, initial rates were borrowed from the New River +Valley MPO travel model. ```{r, eval=FALSE} rate_est_tbl <- add_tazs %>% @@ -194,9 +196,8 @@ final_rates %>% ## Time of Day -The survey included trip departure time, which was used to categorize trips -by time period. The table below shows the estimated time of day factors used -by the model. +The 2010 survey included trip departure time, which was used to categorize trips +by time period. The table below shows the estimated time of day factors ([time of day definitions](time_of_day.html#final_defs)). ```{r} add_tod <- add_tazs %>% @@ -262,13 +263,14 @@ fix_peak %>% Directionality factors are not calculated for commercial vehicles and trucks. Instead, they are assumed to be evenly split by direction. -## Distribution +## Distribution {.tabset} -Once generated, the commercial vehicle and truck trips are distributed among -zones using a gravity model. Unfortunately, the CV survey did not contain -information on both ends of the trip, which meant it could not be used to -estimate model parameters. Model parameters were borrowed from the -travel model in Blacksburg, VA and are shown below. +Once generated, commercial vehicle and truck trips are distributed among +zones using a [gravity +model](https://tfresource.org/topics/Trip_distribution.html). Unfortunately, the +CV survey did not contain information on both ends of the trip, which meant it +could not be used to estimate model parameters. Model parameters were borrowed +from the NRVMPO travel model and are shown below. ```{r, include=FALSE} grav_params <- read_csv("../master/cv/cv_gravity_am.csv") @@ -281,18 +283,39 @@ grav_params %>% kable_styling(full_width = FALSE) ``` -The table below shows the resulting average trip lengths in the model. +The table below shows the resulting average commercial vehicle trip lengths in +the model. ```{r} tibble( Class = c("CV", "SUT", "MUT"), `Average Time (min)` = c(10, 12, 13.3), - `Averge Distance (mi)` = c(7.3, 9, 10.1) + `Averge Distance (mi)` = c(7.1, 8.9, 10.1) ) %>% kable() %>% kable_styling(full_width = FALSE) ``` +The charts below show the trip length frequency distribution graphs (in miles). + +### CV + +```{r, out.width="100%", fig.show='hold',fig.align='center'} +knitr::include_graphics("img/cv/cv.png") +``` + +### SUT + +```{r, out.width="100%", fig.show='hold',fig.align='center'} +knitr::include_graphics("img/cv/sut.png") +``` + +### MUT + +```{r, out.width="100%", fig.show='hold',fig.align='center'} +knitr::include_graphics("img/cv/mut.png") +``` + ```{r, eval=FALSE} write_csv(final_rates, "../master/cv/cv_generation.csv") ``` diff --git a/docs/commercial_vehicles.html b/docs/commercial_vehicles.html index cc725768..b73f715c 100644 --- a/docs/commercial_vehicles.html +++ b/docs/commercial_vehicles.html @@ -190,6 +190,9 @@
  • Destination Choice
  • +
  • + Mode Choice +
  • Non-homebased
  • @@ -242,7 +245,11 @@

    Commercial Vehicles and Trucks

  • Introduction
  • Trip Rates
  • Time of Day
  • -
  • Distribution
  • +
  • Distribution
  • @@ -258,16 +265,16 @@

    Commercial Vehicles and Trucks

    Introduction

    -

    Commercial vehicles and trucks can be a significant source of congestion on major facilities in an urban area, and in most instances, their travel behavior distinct from residents. To better capture the commercial vehicle element of traffic in the TRMG2, a separate set of commercial vehicle models were developed. These include trip generation, time of day, and distribution for three commercial vehicle types: commercial autos, vans, and pickups (CV); single unit trucks (SUT); and multi-unit trucks (MUT).

    +

    Commercial vehicles and trucks contribute significantly to congestion on major facilities in an urban area. In most instances, their travel behavior is distinct from residents of that urban area. To better capture commercial vehicle traffic in TRMG2, a separate set of commercial vehicle models were developed. These include trip generation, time of day, and distribution for three commercial vehicle types: commercial autos, vans, and pickups (CV), single unit trucks (SUT), and multi-unit trucks (MUT).

    These classes are defined using the federal vehicle classification.

    CV - Classes 1-3
    SUT - Classes 5-7
    MUT - Classes 8-13

    -

    The models are implementations of the Quick Response Freight Manual (QRFM) and are re-estimated where possible using the Triangle’s 2010 Commercial Vehicle survey.

    +

    The models use the Quick Response Freight Manual (QRFM) approach and are re-estimated when possible using the Triangle’s 2010 Commercial Vehicle survey.

    Trip Rates

    -

    Initial estimation attempts led to poor model results. A fundamental problem may be regressing a 2010 survey against 2016 land use data. In addition, stakeholders noted some concerns with the survey. Given these issues, initial rates were borrowed from the QRFM and mapped to the TRMG2 employment types.

    +

    Initial estimation attempts led to counter-intuitive coefficeints and poor model fit statistics. A fundamental problem may have been regressing a 2010 survey against 2016 land use data. In addition, stakeholders noted some concerns with the survey. Given these issues, initial rates were borrowed from the New River Valley MPO travel model.

    --0.0706 +-0.0852
    -0.1128 +0.0483
    -0.1454 +0.1047
    --0.4734 +0.0197
    @@ -480,7 +487,7 @@

    Trip Rates

    Time of Day

    -

    The survey included trip departure time, which was used to categorize trips by time period. The table below shows the estimated time of day factors used by the model.

    +

    The 2010 survey included trip departure time, which was used to categorize trips by time period. The table below shows the estimated time of day factors (time of day definitions).

    @@ -632,9 +639,9 @@

    Time of Day

    Directionality factors are not calculated for commercial vehicles and trucks. Instead, they are assumed to be evenly split by direction.

    -
    +

    Distribution

    -

    Once generated, the commercial vehicle and truck trips are distributed among zones using a gravity model. Unfortunately, the CV survey did not contain information on both ends of the trip, which meant it could not be used to estimate model parameters. Model parameters were borrowed from the travel model in Blacksburg, VA and are shown below.

    +

    Once generated, commercial vehicle and truck trips are distributed among zones using a gravity model. Unfortunately, the CV survey did not contain information on both ends of the trip, which meant it could not be used to estimate model parameters. Model parameters were borrowed from the NRVMPO travel model and are shown below.

    @@ -709,7 +716,7 @@

    Distribution

    -

    The table below shows the resulting average trip lengths in the model.

    +

    The table below shows the resulting average commercial vehicle trip lengths in the model.

    @@ -733,7 +740,7 @@

    Distribution

    10.0 @@ -744,7 +751,7 @@

    Distribution

    12.0 @@ -760,6 +767,19 @@

    Distribution

    -7.3 +7.1
    -9.0 +8.9
    +

    The charts below show the trip length frequency distribution graphs (in miles).

    +
    +

    CV

    +

    +
    +
    +

    SUT

    +

    +
    +
    +

    MUT

    +

    +
    diff --git a/docs/data/input/resident_productions/modeled_trips_uncalibrated.csv b/docs/data/input/resident_productions/modeled_trips_uncalibrated.csv index 064c585f..2a05aad4 100644 --- a/docs/data/input/resident_productions/modeled_trips_uncalibrated.csv +++ b/docs/data/input/resident_productions/modeled_trips_uncalibrated.csv @@ -1,6 +1,6 @@ segment,N_HB_K12_All,N_HB_OD_Long,N_HB_OD_Short,N_HB_OMED_All,N_HB_OME_All,W_HB_EK12_All,W_HB_O_All,W_HB_W_All -ihvi,57391.49,99520.12,63633.04,11208.62,107880.63,5785.51,24808.24,77525.72 -ihvs,267536.51,382519.53,233228.67,38203.91,387184.03,31829.93,140139.41,377344.79 -ilvi,62351,118400.08,75783.04,14807.76,140962.21,4833.9,21374.13,68703.69 -ilvs,164141.82,280523.6,166845.05,32935.52,317664.26,15409.96,95868.85,249942.44 -v0,15642.13,35325.7,19465.48,5588.16,44192.05,780.9,6501.41,15779.82 +ihvi,58060.36,104344.92,67662,12019.32,113223,5865.3,26636.95,82600.57 +ihvs,267893.05,375904.5,230216.29,37517.05,380471.25,31842.75,137962.67,371168.87 +ilvi,61438.89,121329.95,78005.28,15240.61,143169.48,4751.87,22288.42,71015.81 +ilvs,164754.23,276780.23,165369.72,32674.48,314584.27,15579.08,95360.5,248270.07 +v0,16155.92,37425.74,20949.35,6281.57,46821.7,802.68,6561.28,15957.73 diff --git a/docs/ieei.Rmd b/docs/ieei.Rmd index 75b445e1..9103757e 100644 --- a/docs/ieei.Rmd +++ b/docs/ieei.Rmd @@ -26,10 +26,27 @@ options(scipen = 999) ## Introduction -Travel from outside the Triangle model's geographic boundaries to locations inside the model's boundaries are referred to as "external-internal" or "EI" trips; movements from inside the boundary to outside are "internal-external" or "EI" trips; and movements through the region are referred to as "external-external" or "EE" trips. Together, these movements are referred to here as "IEEI" travel. This document discusses the development of an IEEI component of the regional model. +Travel from outside the Triangle model's geographic boundary to locations inside +the boundary are referred to as "external-internal" or "EI" trips. Movements +from inside the boundary to outside are "internal-external" or "EI" trips. +Movements through but not stopping within the region are referred to as +"external-external" or "EE" trips. Together, these travel patterns are referred +to here as "EE" and "IE/EI" travel. This document discusses the development of +these components of the regional model. ## Available Data Sources -The two best available data sources for movements through the modeling region are (i) StreetLight Data extracted by ITRE and (ii) a sub-area extraction from the North Carolina Statewide Travel Model (NCSTM). Our analysis suggests that, for our purposes, the NCSTM data is superior. The reason for this is that there *appears* to be an error with either the StreetLight Data or its extraction. The map below shows the "attractions" for the IE/EI trips from the StreetLight data, assuming the "E" end of the trip is the production end and the I end of the trip is the attraction end, and aggregated to a 21-district geographies defined by ITRE. The red dots show data extracted using the CAMPO zone system and the blue dots show data extracted using the Durham MPO zone system. The map shows an illogical outcomes of attractions clustured in select locations near the external stations. Given these odd results, we move forward with the NCSTM data, which, as demonstrated in the balance of the document, revealed logical results. +The two best available data sources for external-related travel are (i) +StreetLight Data provided by the client, and (ii) a sub-area extraction from the +North Carolina Statewide Travel Model (NCSTM v.4.3a). The analysis suggests that the NCSTM data is superior because there *appears* to be an error +with either the StreetLight Data or its extraction. The map below shows +"attractions" for the IE/EI trips from StreetLight. The "E" end of the trip is +asserted to be the production end and the "I" end of the trip is the attraction +end. Finally, the data was aggregated to a 21-district geographies defined by +ITRE. The red dots show data extracted using the CAMPO zone system and the blue +dots show data extracted using the DCHC MPO zone system. The map shows an +illogical outcome of attractions clustered in select locations near the external +stations. Nearly all the DCHC trip ends were in Chatham County while CAMPO trip +ends were predominantly in Franklin County. ### StreetLight Attractions ```{r, out.width="100%", fig.show='hold',fig.align='center'} @@ -37,18 +54,25 @@ knitr::include_graphics("img/ieei/sl_attractions.png") ``` ## EE -The first step in the analysis is to determine the share of traffic at each external station that can be attributed to EE movements. This will allow us to attribute the balance of the travel to internal-external (IE) and external-internal (EI) movements. The NCSTM data contains flow estimates for each of the model's `r number_of_external` EE station pairs. We use these flows to estimate the share of travel at each of the external stations using the ADT estimates for the external stations derived from the NCSTM. In addition, the share of commercial vehicles are segmented by single-unit trucks and multi-unit trucks. The table below summarizes these outcomes. +The first step is to determine the share of traffic at each external station +that can be attributed to EE travel. This allows the balance of trips to be +attributed to IE and EI travel. The NCSTM data contains flow estimates for each +of the model's `r number_of_external` external stations. These flows are used to +estimate the share of travel using the AWDT estimates derived from the NCSTM. In +addition, the share of commercial vehicles are segmented by single-unit trucks +and multi-unit trucks. The table below summarizes these outcomes. ### EE Traffic Shares ```{r} ext_shares_ncstm_df %>% + mutate(adt = format(adt, big.mark = ",", digits = 0, justify = "right")) %>% select(Station = ext_station, - `Avg Daily Traffic` = adt, + `Avg Weekday Traffic` = adt, `Pct EE Automobile` = pct_auto_ee, `Pct EE Commercial Vehicles` = pctcv, `Pct EE Single-unit Trucks` = pctcv_sut, `Pct EE Multi-unit Trucks` = pctcv_mut) %>% - kableExtra::kable(digits = 1) %>% + kableExtra::kable(digits = 0, aligh = "rr") %>% kableExtra::kable_styling(full_width = FALSE) ``` @@ -60,27 +84,19 @@ follow the same dual-peak pattern as commuters and occur more often during the midday in general. ## IE/EI -The second step in the development process is to see if we can find sufficient patterns in the NCSTM IE and EI data to estimate a regression model that can be generalized across geographies for use in regional model forecasting. We start with correlations between the "I" end of the IE/EI movements, which we are calling the production end, and the population and employment in the zone, which are shown in the table below. The correlations are positive for both population and employment. - -### IE/EI Attraction Correlations -```{r} -correlations_ncstm_d2_df %>% - mutate(term = case_when( - term == "trips" ~ "Productions", - term == "pop" ~ "Population", - term == "emp" ~ "Employment", - TRUE ~ "Missing" - )) %>% - select(Term = term, - Productions = trips) %>% - kableExtra::kable(digits = 3) %>% - kableExtra::kable_styling(full_width = FALSE) -``` - -The positive correletions move us to the next step of estimating attraction models for the internal end of the IE/EI movements, assigning the external station the production end. - -### IE/EI Attraction Model Estimation {.tabset .MODEL} -The NCSTM data was aggregated to the 43 district system to support model estimation. A handful of specifications were tested to determine a useful attraction model, including segmenting the attraction models for Freeway and non-Freeway external stations. The preferred model generated positive coefficients on the population and employment variables. It does not segment the stations by roadway type and asserts a Y-intercept of zero. +The number of IE/EI trips are controlled by the volumes at the external station +after external travel has been subtracted. These trips are distributed using a +[gravity model](https://tfresource.org/topics/Trip_distribution.html). For this +model form, attractions and gamma parameters must be estimated. + +### IE/EI Attraction Model Estimation +The NCSTM data was aggregated to the 43-district system to support model +estimation. A handful of specifications were tested to determine a useful +attraction model, including segmenting the attraction models for freeway and +non-freeway external stations. These more complicated models generated +coefficients with wrong signs. The final, simple model generated positive +coefficients on the population and employment variables. It does not segment the +stations by roadway type and asserts a Y-intercept of zero. ```{r} print_model <- function(model, includes_intercept) { @@ -112,27 +128,12 @@ print_model <- function(model, includes_intercept) { } ``` -#### Preferred ```{r, MODEL_PREFERRED} print_model(preferred_model, includes_intercept = FALSE) ``` -#### Freeway Model -```{r, MODEL_FREEWAY} -print_model(freeway_model, includes_intercept = TRUE) -``` - -#### Non-Freeway Model -```{r, MODEL_NON} -print_model(non_freeway_model, includes_intercept = TRUE) -``` - -#### Combined with Intercept -```{r, MODEL_INTER} -print_model(combined_model, includes_intercept = TRUE) -``` - -A scatter plot of the NCSTM and estimated attractions by the 43-district system is shown in the plot below. +A scatter plot of the NCSTM and estimated attractions by the 43-district system +is shown in the plot below. ### NCSTM and Estimated IE/EI Attractions ```{r, out.width="100%", fig.show='hold',fig.align='center'} @@ -140,16 +141,33 @@ knitr::include_graphics("img/ieei/estimated_attractions.png") ``` ### Distance Assessment -In the chart below, we plot the trip length frequency, with the trip length measured from the external station to the internal zone, for the NCSTM trips, segmenting trips that enter the region through external stations at Freeways from those at other locations. The plot shows a logical and expected outcome: trips entering the region at Freeway external stations travel longer than those entering at non-freeway locations. +The chart below plots the trip length frequency, with the trip length measured from the external station to the internal zone segmenting trips that enter the region at freeways from those at other locations. The plot shows a logical and expected outcome: trips entering the region at freeway external stations travel longer distances than those entering at non-freeway locations. #### IE/EI Trip Length Frequency ```{r, out.width="100%", fig.show='hold',fig.align='center'} knitr::include_graphics("img/ieei/ncstm_tlf.png") ``` -The above chart suggests the IE/EI distribution model should be segmented by station type, freeway or non-freeway. The external stations identified as freeways are as follows: `r freeway_station_vector`. +The above chart suggests the IE/EI distribution model should be segmented by freeway and non-freeway stations. The estimated gravity parameters are shown below. + +```{r, include=FALSE} +grav_df <- read_csv("../master/external/ieei_gravity.csv") +``` + +```{r} +grav_df %>% + group_by(filter) %>% + slice(1) %>% + ungroup() %>% + slice(1:2) %>% + mutate(Type = c("Freeway", "NonFreeway")) %>% + select(Type, a, b) %>% + kable() %>% + kable_styling(full_width = FALSE) +``` -Using the friction factors from the existing IE/EI model, the model estimated distances for freeways and non-freeways are plotted against the observed data in the two following figures. +The model estimated distances for freeways and non-freeways are plotted against +the observed data in the two following figures. #### IE/EI Observed and Estimated Trip Length Frequency for Freeway Stations ```{r, out.width="100%", fig.show='hold',fig.align='center'} @@ -174,6 +192,29 @@ ei_distance_means_df %>% ``` +### Final adjustments + +During validation, The IE/EI model was enhanced to allow additional control +of trip lengths. This is due to the existence of some town along the border. +Stations near those towns need even shorter trip lengths. Two additional +sets of gravity parameters were created and the terminology was changed as shown +below. + +```{r} +grav_df %>% + group_by(filter) %>% + slice(1) %>% + ungroup() %>% + mutate(Type = c("Long", "Medium", "Sort", "Very Short")) %>% + select(Type, a, b) %>% + kable() %>% + kable_styling(full_width = FALSE) +``` + +The first two options ("Long" and "Medium") are the same coefficients as +estimated. The final two coefficients were arrived at through iterative trials +during highway validation. + ### IE/EI Time of Day The data available was not adequate to calculate time of day shares for the diff --git a/docs/ieei.html b/docs/ieei.html index 76357a3a..79b6d467 100644 --- a/docs/ieei.html +++ b/docs/ieei.html @@ -7,7 +7,7 @@ - + Internal/External @@ -190,6 +190,9 @@
  • Destination Choice
  • +
  • + Mode Choice +
  • Non-homebased
  • @@ -248,13 +251,7 @@

    Internal/External

  • EE Time of Day
  • IE/EI
  • Caliper Corporation

    -

    March 01, 2022

    +

    April 07, 2022

    @@ -279,11 +277,11 @@

    Internal/External

    Introduction

    -

    Travel from outside the Triangle model’s geographic boundaries to locations inside the model’s boundaries are referred to as “external-internal” or “EI” trips; movements from inside the boundary to outside are “internal-external” or “EI” trips; and movements through the region are referred to as “external-external” or “EE” trips. Together, these movements are referred to here as “IEEI” travel. This document discusses the development of an IEEI component of the regional model.

    +

    Travel from outside the Triangle model’s geographic boundary to locations inside the boundary are referred to as “external-internal” or “EI” trips. Movements from inside the boundary to outside are “internal-external” or “EI” trips. Movements through but not stopping within the region are referred to as “external-external” or “EE” trips. Together, these travel patterns are referred to here as “EE” and “IE/EI” travel. This document discusses the development of these components of the regional model.

    Available Data Sources

    -

    The two best available data sources for movements through the modeling region are (i) StreetLight Data extracted by ITRE and (ii) a sub-area extraction from the North Carolina Statewide Travel Model (NCSTM). Our analysis suggests that, for our purposes, the NCSTM data is superior. The reason for this is that there appears to be an error with either the StreetLight Data or its extraction. The map below shows the “attractions” for the IE/EI trips from the StreetLight data, assuming the “E” end of the trip is the production end and the I end of the trip is the attraction end, and aggregated to a 21-district geographies defined by ITRE. The red dots show data extracted using the CAMPO zone system and the blue dots show data extracted using the Durham MPO zone system. The map shows an illogical outcomes of attractions clustured in select locations near the external stations. Given these odd results, we move forward with the NCSTM data, which, as demonstrated in the balance of the document, revealed logical results.

    +

    The two best available data sources for external-related travel are (i) StreetLight Data provided by the client, and (ii) a sub-area extraction from the North Carolina Statewide Travel Model (NCSTM v.4.3a). The analysis suggests that the NCSTM data is superior because there appears to be an error with either the StreetLight Data or its extraction. The map below shows “attractions” for the IE/EI trips from StreetLight. The “E” end of the trip is asserted to be the production end and the “I” end of the trip is the attraction end. Finally, the data was aggregated to a 21-district geographies defined by ITRE. The red dots show data extracted using the CAMPO zone system and the blue dots show data extracted using the DCHC MPO zone system. The map shows an illogical outcome of attractions clustered in select locations near the external stations. Nearly all the DCHC trip ends were in Chatham County while CAMPO trip ends were predominantly in Franklin County.

    StreetLight Attractions

    @@ -291,7 +289,7 @@

    StreetLight Attractions

    EE

    -

    The first step in the analysis is to determine the share of traffic at each external station that can be attributed to EE movements. This will allow us to attribute the balance of the travel to internal-external (IE) and external-internal (EI) movements. The NCSTM data contains flow estimates for each of the model’s 97 EE station pairs. We use these flows to estimate the share of travel at each of the external stations using the ADT estimates for the external stations derived from the NCSTM. In addition, the share of commercial vehicles are segmented by single-unit trucks and multi-unit trucks. The table below summarizes these outcomes.

    +

    The first step is to determine the share of traffic at each external station that can be attributed to EE travel. This allows the balance of trips to be attributed to IE and EI travel. The NCSTM data contains flow estimates for each of the model’s 97 external stations. These flows are used to estimate the share of travel using the AWDT estimates derived from the NCSTM. In addition, the share of commercial vehicles are segmented by single-unit trucks and multi-unit trucks. The table below summarizes these outcomes.

    EE Traffic Shares

    @@ -300,8 +298,8 @@

    EE Traffic Shares

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - @@ -1348,54 +1346,10 @@

    EE Time of Day

    IE/EI

    -

    The second step in the development process is to see if we can find sufficient patterns in the NCSTM IE and EI data to estimate a regression model that can be generalized across geographies for use in regional model forecasting. We start with correlations between the “I” end of the IE/EI movements, which we are calling the production end, and the population and employment in the zone, which are shown in the table below. The correlations are positive for both population and employment.

    -
    -

    IE/EI Attraction Correlations

    -
    Station -Avg Daily Traffic + +Avg Weekday Traffic Pct EE Automobile @@ -322,1020 +320,1020 @@

    EE Traffic Shares

    3235 -3523.9 + +3,524 -30.1 +30 -20.8 +21 -3.5 +3 -17.3 +17
    3236 -927.2 + +927 -24.2 +24 -12.8 +13 -4.9 +5 -7.8 +8
    3237 -2176.4 + +2,176 -22.4 +22 -8.6 +9 -2.7 +3 -5.8 +6
    3242 -1701.1 + +1,701 -24.0 +24 -13.0 +13 -4.0 +4 -9.1 +9
    3243 -1567.7 + +1,568 -38.4 +38 -8.7 +9 -3.5 +3 -5.2 +5
    3245 -3931.1 + +3,931 -15.3 +15 -2.0 +2 -1.8 +2 -0.2 +0
    3248 -1902.2 + +1,902 -26.8 +27 -27.6 +28 -3.8 +4 -23.7 +24
    3252 -25645.1 + +25,645 -53.5 +54 -13.8 +14 -3.0 +3 -10.8 +11
    3253 -2559.5 + +2,560 -9.0 +9 -16.1 +16 -4.9 +5 -11.2 +11
    3254 -3656.1 + +3,656 -7.9 +8 -9.4 +9 -2.5 +3 -6.9 +7
    3257 -7315.2 + +7,315 -8.7 +9 -14.9 +15 -1.8 +2 -13.1 +13
    3260 -4087.5 + +4,087 -44.8 +45 -8.1 +8 -3.4 +3 -4.7 +5
    3261 -1775.7 + +1,776 -51.7 +52 -9.9 +10 -3.7 +4 -6.2 +6
    3263 -2057.6 + +2,058 -9.0 +9 -10.4 +10 -3.9 +4 -6.4 +6
    3265 -1198.9 + +1,199 -56.5 +56 -11.5 +11 -1.9 +2 -9.5 +10
    3267 -1682.4 + +1,682 -18.5 +18 -12.5 +12 -5.5 +5 -7.0 +7
    3272 -1171.9 + +1,172 -4.1 +4 -11.5 +12 -6.8 +7 -4.7 +5
    3273 -15209.7 + +15,210 -31.0 +31 -14.9 +15 -2.4 +2 -12.5 +12
    3274 -1027.1 + +1,027 -2.6 +3 -7.9 +8 -4.6 +5 -3.3 +3
    3275 -2141.6 + +2,142 -69.8 +70 -5.0 +5 -2.9 +3 -2.1 +2
    3276 -106.6 + +107 -76.3 +76 -3.0 +3 -3.0 +3 -0.0 +0
    3277 -20162.0 + +20,162 -38.0 +38 -15.0 +15 -2.1 +2 -12.9 +13
    3280 -2328.7 + +2,329 -0.0 +0 -4.5 +4 -2.3 +2 -2.2 +2
    3281 -26150.0 + +26,150 -83.0 +83 -16.3 +16 -3.9 +4 -12.4 +12
    3282 -3483.4 + +3,483 -34.2 +34 -9.7 +10 -3.0 +3 -6.6 +7
    3283 -4173.8 + +4,174 -37.3 +37 -7.3 +7 -3.9 +4 -3.3 +3
    3289 -15915.0 + +15,915 -21.9 +22 -6.5 +6 -2.4 +2 -4.0 +4
    3291 -2201.1 + +2,201 -14.0 +14 -7.8 +8 -3.2 +3 -4.6 +5
    3295 -12093.1 + +12,093 -39.1 +39 -9.1 +9 -2.1 +2 -7.0 +7
    3296 -1917.4 + +1,917 -46.8 +47 -8.3 +8 -3.8 +4 -4.5 +5
    3297 -964.8 + +965 -10.1 +10 -7.4 +7 -2.0 +2 -5.4 +5
    3298 -37430.7 + +37,431 -66.3 +66 -16.4 +16 -3.4 +3 -13.1 +13
    3299 -3600.0 + +3,600 -16.1 +16 -6.3 +6 -5.7 +6 -0.6 +1
    3301 -2107.3 + +2,107 -0.0 +0 -1.5 +1 -1.1 +1 -0.3 +0
    3302 -2683.5 + +2,684 -0.0 +0 -3.4 +3 -2.2 +2 -1.2 +1
    3303 -6359.3 + +6,359 -36.6 +37 -7.4 +7 -3.8 +4 -3.6 +4
    3305 -14573.4 + +14,573 -19.5 +19 -6.7 +7 -2.6 +3 -4.0 +4
    3306 -2101.5 + +2,102 -1.4 +1 -9.7 +10 -4.1 +4 -5.6 +6
    3308 -17249.9 + +17,250 -12.0 +12 -11.0 +11 -2.0 +2 -9.1 +9
    3309 -3699.7 + +3,700 -24.7 +25 -10.0 +10 -2.4 +2 -7.6 +8
    3311 -1567.9 + +1,568 -2.6 +3 -6.4 +6 -2.4 +2 -4.0 +4
    3313 -10491.2 + +10,491 -15.1 +15 -12.9 +13 -1.8 +2 -11.1 +11
    3317 -1716.9 + +1,717 -65.3 +65 -20.5 +21 -4.6 +5 -15.9 +16
    3318 -1950.3 + +1,950 -1.6 +2 -5.9 +6 -4.3 +4 -1.6 +2
    3319 -545.3 + +545 -2.2 +2 -18.4 +18 -9.1 +9 -9.2 +9
    3321 -10281.3 + +10,281 -27.4 +27 -2.4 +2 -1.4 +1 -1.0 +1
    3322 -7153.6 + +7,154 -42.6 +43 -2.5 +2 -2.2 +2 -0.3 +0
    3323 -68297.2 + +68,297 -37.8 +38 -12.4 +12 -2.4 +2 -10.0 +10
    3326 -2710.5 + +2,711 -4.0 +4 -6.9 +7 -4.0 +4 -2.9 +3
    3328 -2870.9 + +2,871 -42.8 +43 -11.3 +11 -3.4 +3 -7.9 +8
    3329 -1352.2 + +1,352 -37.5 +38 -12.5 +13 -2.4 +2 -10.2 +10
    - - - - - - - - - - - - - - - - - - - - -
    -Term - -Productions -
    -Productions - -NA -
    -Population - -0.676 -
    -Employment - -0.187 -
    -

    The positive correletions move us to the next step of estimating attraction models for the internal end of the IE/EI movements, assigning the external station the production end.

    -
    -
    +

    The number of IE/EI trips are controlled by the volumes at the external station after external travel has been subtracted. These trips are distributed using a gravity model. For this model form, attractions and gamma parameters must be estimated.

    +

    IE/EI Attraction Model Estimation

    -

    The NCSTM data was aggregated to the 43 district system to support model estimation. A handful of specifications were tested to determine a useful attraction model, including segmenting the attraction models for Freeway and non-Freeway external stations. The preferred model generated positive coefficients on the population and employment variables. It does not segment the stations by roadway type and asserts a Y-intercept of zero.

    -
    -

    Preferred

    +

    The NCSTM data was aggregated to the 43-district system to support model estimation. A handful of specifications were tested to determine a useful attraction model, including segmenting the attraction models for freeway and non-freeway external stations. These more complicated models generated coefficients with wrong signs. The final, simple model generated positive coefficients on the population and employment variables. It does not segment the stations by roadway type and asserts a Y-intercept of zero.

    @@ -1419,13 +1373,13 @@

    Preferred

    Employment @@ -1433,13 +1387,13 @@

    Preferred

    Population @@ -1447,7 +1401,7 @@

    Preferred

    Adjusted R-squared
    -0.0059 +0.0045 -0.0648 +0.0492 -0.9487 +0.9610
    -0.1551 +0.1559 -2.8129 +2.8185 -0.0076 +0.0075
    -0.5013 +0.5008 NA @@ -1458,332 +1412,199 @@

    Preferred

    +

    A scatter plot of the NCSTM and estimated attractions by the 43-district system is shown in the plot below.

    -
    -

    Freeway Model

    +
    +

    NCSTM and Estimated IE/EI Attractions

    +

    +
    +
    +

    Distance Assessment

    +

    The chart below plots the trip length frequency, with the trip length measured from the external station to the internal zone segmenting trips that enter the region at freeways from those at other locations. The plot shows a logical and expected outcome: trips entering the region at freeway external stations travel longer distances than those entering at non-freeway locations.

    +
    +

    IE/EI Trip Length Frequency

    +

    +

    The above chart suggests the IE/EI distribution model should be segmented by freeway and non-freeway stations. The estimated gravity parameters are shown below.

    - - - - - - - - - - - - - - -
    -Variable +Type -Estimate +a -t-statistic - -p value +b
    -intercept - -1950.0099 - -2.1975 - -0.0342 -
    -Employment - -0.0657 - -1.8059 - -0.0789 -
    -Population - -0.0438 +Freeway -1.7447 +5 -0.0891 +0.25
    -Adjusted R-squared - -0.4699 +NonFreeway -NA +5 -NA +0.60
    +

    The model estimated distances for freeways and non-freeways are plotted against the observed data in the two following figures.

    +
    +
    +

    IE/EI Observed and Estimated Trip Length Frequency for Freeway Stations

    +

    +
    +
    +

    IE/EI Observed and Estimated Trip Length Frequency for Non-Freeway Stations

    +

    +

    A table summarizing the mean trip lengths is presented below.

    -
    -

    Non-Freeway Model

    +
    +
    +

    IE/EI Average Trip Lengths

    - - - - - - - - - -
    -Variable - -Estimate +Category -t-statistic + +Source -p value +Mean Distance (miles)
    -intercept - -4411.9350 +Freeway -3.1756 + +Estimated -0.0030 +21.6
    -Employment - --0.0321 +Freeway --0.5641 + +Observed -0.5760 +26.0
    -Population - -0.0231 +Non-freeway -0.5867 + +Estimated -0.5609 +14.5
    -Adjusted R-squared - --0.0427 +Non-freeway -NA + +Observed -NA +15.9
    -
    -

    Combined with Intercept

    +
    +

    Final adjustments

    +

    During validation, The IE/EI model was enhanced to allow additional control of trip lengths. This is due to the existence of some town along the border. Stations near those towns need even shorter trip lengths. Two additional sets of gravity parameters were created and the terminology was changed as shown below.

    - - - - - - - - - - - - - - - - -
    -Variable +Type -Estimate +a -t-statistic - -p value +b
    -intercept - -6361.9449 - -3.1567 - -0.0031 -
    -Employment - -0.0335 - -0.4063 - -0.6868 -
    -Population - -0.0669 +Long -1.1727 +5 -0.2482 +0.25
    -Adjusted R-squared - -0.1222 +Medium -NA +5 -NA +0.60
    -

    A scatter plot of the NCSTM and estimated attractions by the 43-district system is shown in the plot below.

    -
    -
    -
    -

    NCSTM and Estimated IE/EI Attractions

    -

    -
    -
    -

    Distance Assessment

    -

    In the chart below, we plot the trip length frequency, with the trip length measured from the external station to the internal zone, for the NCSTM trips, segmenting trips that enter the region through external stations at Freeways from those at other locations. The plot shows a logical and expected outcome: trips entering the region at Freeway external stations travel longer than those entering at non-freeway locations.

    -
    -

    IE/EI Trip Length Frequency

    -

    -

    The above chart suggests the IE/EI distribution model should be segmented by station type, freeway or non-freeway. The external stations identified as freeways are as follows: 3298, 3252, 3305, 3277, 3308, 3281, 3273, 3313, 3323.

    -

    Using the friction factors from the existing IE/EI model, the model estimated distances for freeways and non-freeways are plotted against the observed data in the two following figures.

    -
    -
    -

    IE/EI Observed and Estimated Trip Length Frequency for Freeway Stations

    -

    -
    -
    -

    IE/EI Observed and Estimated Trip Length Frequency for Non-Freeway Stations

    -

    -

    A table summarizing the mean trip lengths is presented below.

    -
    -
    -
    -

    IE/EI Average Trip Lengths

    - - - - - - - - - - - - - - - - - - -
    -Category - -Source - -Mean Distance (miles) -
    -Freeway - -Estimated +Sort -21.6 -
    -Freeway - -Observed +5 -26.0 +1.30
    -Non-freeway - -Estimated +Very Short -14.5 -
    -Non-freeway - -Observed +5 -15.9 +1.60
    +

    The first two options (“Long” and “Medium”) are the same coefficients as estimated. The final two coefficients were arrived at through iterative trials during highway validation.

    IE/EI Time of Day

    diff --git a/docs/img/cv/cv.png b/docs/img/cv/cv.png new file mode 100644 index 00000000..353a45a2 Binary files /dev/null and b/docs/img/cv/cv.png differ diff --git a/docs/img/cv/mut.png b/docs/img/cv/mut.png new file mode 100644 index 00000000..e144444c Binary files /dev/null and b/docs/img/cv/mut.png differ diff --git a/docs/img/cv/sut.png b/docs/img/cv/sut.png new file mode 100644 index 00000000..4c7a2cca Binary files /dev/null and b/docs/img/cv/sut.png differ diff --git a/docs/img/mc/asc_calibration.png b/docs/img/mc/asc_calibration.png new file mode 100644 index 00000000..76d1011f Binary files /dev/null and b/docs/img/mc/asc_calibration.png differ diff --git a/docs/img/mc/fta_path_constants.png b/docs/img/mc/fta_path_constants.png new file mode 100644 index 00000000..7967246e Binary files /dev/null and b/docs/img/mc/fta_path_constants.png differ diff --git a/docs/img/mc/fta_visibility.png b/docs/img/mc/fta_visibility.png new file mode 100644 index 00000000..3bbdb460 Binary files /dev/null and b/docs/img/mc/fta_visibility.png differ diff --git a/docs/img/mc/full_tree.png b/docs/img/mc/full_tree.png new file mode 100644 index 00000000..04ca8f09 Binary files /dev/null and b/docs/img/mc/full_tree.png differ diff --git a/docs/img/mc/lr_adjustment.png b/docs/img/mc/lr_adjustment.png new file mode 100644 index 00000000..c5d40186 Binary files /dev/null and b/docs/img/mc/lr_adjustment.png differ diff --git a/docs/img/mc/n_hb_k12_1.png b/docs/img/mc/n_hb_k12_1.png new file mode 100644 index 00000000..c3e2ca68 Binary files /dev/null and b/docs/img/mc/n_hb_k12_1.png differ diff --git a/docs/img/mc/n_hb_k12_2.png b/docs/img/mc/n_hb_k12_2.png new file mode 100644 index 00000000..9d088ff4 Binary files /dev/null and b/docs/img/mc/n_hb_k12_2.png differ diff --git a/docs/img/mc/n_hb_od_long1.png b/docs/img/mc/n_hb_od_long1.png new file mode 100644 index 00000000..4e839518 Binary files /dev/null and b/docs/img/mc/n_hb_od_long1.png differ diff --git a/docs/img/mc/n_hb_od_long2.png b/docs/img/mc/n_hb_od_long2.png new file mode 100644 index 00000000..373cd971 Binary files /dev/null and b/docs/img/mc/n_hb_od_long2.png differ diff --git a/docs/img/mc/n_hb_od_long3.png b/docs/img/mc/n_hb_od_long3.png new file mode 100644 index 00000000..531f5ac1 Binary files /dev/null and b/docs/img/mc/n_hb_od_long3.png differ diff --git a/docs/img/mc/n_hb_od_long4.png b/docs/img/mc/n_hb_od_long4.png new file mode 100644 index 00000000..21510408 Binary files /dev/null and b/docs/img/mc/n_hb_od_long4.png differ diff --git a/docs/img/mc/n_hb_od_short1.png b/docs/img/mc/n_hb_od_short1.png new file mode 100644 index 00000000..dfd4822e Binary files /dev/null and b/docs/img/mc/n_hb_od_short1.png differ diff --git a/docs/img/mc/n_hb_od_short2.png b/docs/img/mc/n_hb_od_short2.png new file mode 100644 index 00000000..3a2fe43b Binary files /dev/null and b/docs/img/mc/n_hb_od_short2.png differ diff --git a/docs/img/mc/n_hb_od_short3.png b/docs/img/mc/n_hb_od_short3.png new file mode 100644 index 00000000..286284b0 Binary files /dev/null and b/docs/img/mc/n_hb_od_short3.png differ diff --git a/docs/img/mc/n_hb_od_short4.png b/docs/img/mc/n_hb_od_short4.png new file mode 100644 index 00000000..6976c448 Binary files /dev/null and b/docs/img/mc/n_hb_od_short4.png differ diff --git a/docs/img/mc/n_hb_ome1.png b/docs/img/mc/n_hb_ome1.png new file mode 100644 index 00000000..ce4835cd Binary files /dev/null and b/docs/img/mc/n_hb_ome1.png differ diff --git a/docs/img/mc/n_hb_ome2.png b/docs/img/mc/n_hb_ome2.png new file mode 100644 index 00000000..6c5039d9 Binary files /dev/null and b/docs/img/mc/n_hb_ome2.png differ diff --git a/docs/img/mc/n_hb_ome3.png b/docs/img/mc/n_hb_ome3.png new file mode 100644 index 00000000..7d4371c7 Binary files /dev/null and b/docs/img/mc/n_hb_ome3.png differ diff --git a/docs/img/mc/n_hb_ome4.png b/docs/img/mc/n_hb_ome4.png new file mode 100644 index 00000000..6c20681e Binary files /dev/null and b/docs/img/mc/n_hb_ome4.png differ diff --git a/docs/img/mc/n_hb_omed1.png b/docs/img/mc/n_hb_omed1.png new file mode 100644 index 00000000..428b4447 Binary files /dev/null and b/docs/img/mc/n_hb_omed1.png differ diff --git a/docs/img/mc/n_hb_omed2.png b/docs/img/mc/n_hb_omed2.png new file mode 100644 index 00000000..ffcca854 Binary files /dev/null and b/docs/img/mc/n_hb_omed2.png differ diff --git a/docs/img/mc/n_hb_omed3.png b/docs/img/mc/n_hb_omed3.png new file mode 100644 index 00000000..eb98c623 Binary files /dev/null and b/docs/img/mc/n_hb_omed3.png differ diff --git a/docs/img/mc/n_hb_omed4.png b/docs/img/mc/n_hb_omed4.png new file mode 100644 index 00000000..3b48c288 Binary files /dev/null and b/docs/img/mc/n_hb_omed4.png differ diff --git a/docs/img/mc/nhb_full_nest.png b/docs/img/mc/nhb_full_nest.png new file mode 100644 index 00000000..e783e39d Binary files /dev/null and b/docs/img/mc/nhb_full_nest.png differ diff --git a/docs/img/mc/transit_sub_other1.png b/docs/img/mc/transit_sub_other1.png new file mode 100644 index 00000000..dfd9354d Binary files /dev/null and b/docs/img/mc/transit_sub_other1.png differ diff --git a/docs/img/mc/transit_sub_other2.png b/docs/img/mc/transit_sub_other2.png new file mode 100644 index 00000000..11687307 Binary files /dev/null and b/docs/img/mc/transit_sub_other2.png differ diff --git a/docs/img/mc/transit_sub_other3.png b/docs/img/mc/transit_sub_other3.png new file mode 100644 index 00000000..a9313dde Binary files /dev/null and b/docs/img/mc/transit_sub_other3.png differ diff --git a/docs/img/mc/transit_sub_shop1.png b/docs/img/mc/transit_sub_shop1.png new file mode 100644 index 00000000..2cc84154 Binary files /dev/null and b/docs/img/mc/transit_sub_shop1.png differ diff --git a/docs/img/mc/transit_sub_shop2.png b/docs/img/mc/transit_sub_shop2.png new file mode 100644 index 00000000..7985b37b Binary files /dev/null and b/docs/img/mc/transit_sub_shop2.png differ diff --git a/docs/img/mc/transit_sub_work.png b/docs/img/mc/transit_sub_work.png new file mode 100644 index 00000000..57de8617 Binary files /dev/null and b/docs/img/mc/transit_sub_work.png differ diff --git a/docs/img/mc/transit_sub_work2.png b/docs/img/mc/transit_sub_work2.png new file mode 100644 index 00000000..1f6aa240 Binary files /dev/null and b/docs/img/mc/transit_sub_work2.png differ diff --git a/docs/img/mc/transit_sub_work3.png b/docs/img/mc/transit_sub_work3.png new file mode 100644 index 00000000..6b3ff75d Binary files /dev/null and b/docs/img/mc/transit_sub_work3.png differ diff --git a/docs/img/mc/transit_sub_work4.png b/docs/img/mc/transit_sub_work4.png new file mode 100644 index 00000000..06b93249 Binary files /dev/null and b/docs/img/mc/transit_sub_work4.png differ diff --git a/docs/img/mc/w_hb_o1.png b/docs/img/mc/w_hb_o1.png new file mode 100644 index 00000000..66fe3208 Binary files /dev/null and b/docs/img/mc/w_hb_o1.png differ diff --git a/docs/img/mc/w_hb_o2.png b/docs/img/mc/w_hb_o2.png new file mode 100644 index 00000000..dfc924c9 Binary files /dev/null and b/docs/img/mc/w_hb_o2.png differ diff --git a/docs/img/mc/w_hb_o3.png b/docs/img/mc/w_hb_o3.png new file mode 100644 index 00000000..934e596f Binary files /dev/null and b/docs/img/mc/w_hb_o3.png differ diff --git a/docs/img/mc/w_hb_o4.png b/docs/img/mc/w_hb_o4.png new file mode 100644 index 00000000..ce2e5bf3 Binary files /dev/null and b/docs/img/mc/w_hb_o4.png differ diff --git a/docs/img/mc/w_hb_o5.png b/docs/img/mc/w_hb_o5.png new file mode 100644 index 00000000..616eba4b Binary files /dev/null and b/docs/img/mc/w_hb_o5.png differ diff --git a/docs/img/mc/w_hb_w1.png b/docs/img/mc/w_hb_w1.png new file mode 100644 index 00000000..7fc79e93 Binary files /dev/null and b/docs/img/mc/w_hb_w1.png differ diff --git a/docs/img/mc/w_hb_w2.png b/docs/img/mc/w_hb_w2.png new file mode 100644 index 00000000..b0cd4826 Binary files /dev/null and b/docs/img/mc/w_hb_w2.png differ diff --git a/docs/img/mc/w_hb_w3.png b/docs/img/mc/w_hb_w3.png new file mode 100644 index 00000000..9a80ad75 Binary files /dev/null and b/docs/img/mc/w_hb_w3.png differ diff --git a/docs/img/mc/w_hb_w4.png b/docs/img/mc/w_hb_w4.png new file mode 100644 index 00000000..9e1f7a61 Binary files /dev/null and b/docs/img/mc/w_hb_w4.png differ diff --git a/docs/index.html b/docs/index.html index 7699a6ab..92333503 100644 --- a/docs/index.html +++ b/docs/index.html @@ -7,7 +7,7 @@ - + Triangle Regional Model G2 @@ -188,6 +188,9 @@
  • Destination Choice
  • +
  • + Mode Choice +
  • Non-homebased
  • @@ -240,7 +243,7 @@

    Triangle Regional Model G2

    Caliper Corporation

    -

    2022-02-22

    +

    2022-03-30

    diff --git a/docs/obs_processing.Rmd b/docs/obs_processing.Rmd index cc9bc7b2..e496669b 100644 --- a/docs/obs_processing.Rmd +++ b/docs/obs_processing.Rmd @@ -22,23 +22,24 @@ library(sf) ## Introduction -On-board surveys ask transit riders a series of question while they are on the -bus in order to obtain detailed information about demographics, trip purpose, -modes used, and other variables. These targeted surveys are critical for -understanding transit users in the region. Due to the small size of the transit -market in the Triangle and the small household survey sample size, the household -survey does not contain enough transit trips to estimate the behavior of these -travelers. - -In the Triangle, a survey of GoTriangle and DCHC transit was conducted in 2014 -(excluding Duke Transit). CAMPO agencies were surveyed in 2015 along with -new GoTriangle routes. These are used to gain insight into the transit market. +Due to the small size of the transit market in the Triangle and the small +household survey sample size, the household survey does not contain enough +transit observations to estimate behavioral models for transit users. On-board +surveys are critical to fill this gap. They are intercept surveys, which means +they are conducted on the bus itself. They ask transit riders a series of +questions in order to obtain detailed information about demographics, trip +purpose, modes used, and other variables. + +In the Triangle, a survey of 9,231 GoTriangle and DCHC area transit riders was +conducted in 2014 (excluding Duke Transit). 5,390 CAMPO area transit riders were +surveyed in 2015 along with new GoTriangle routes. These are used to gain +insight into the transit market. ## Combining surveys Unlike the household surveys, the transit surveys were collected by different -companies. Combining the surveys required translating the different field codes -from each into a combined set of values. The surveys collected the same basic +companies. Combining the surveys required translating different field codes +into a combined set of values. The surveys collected the same basic information, which made this process easier. Additionally, because different routes were collected in each survey, no routes were double counted and no adjustments to the sample weights were needed. @@ -265,9 +266,9 @@ combined <- bind_rows(goraleigh_2015, gotriangle_2014) %>% ## Geocoding As with the household survey, the transit trip latitude/longitude values were -translated into TRMG2 TAZs. For a transit trip, this includes the home, origin, -destination, boarding, and alighting locations. A sample of this is shown in -the table below: +translated into TRMG2 TAZs. For a transit trip, this includes the home, origin +(which could be somewhere other than home), destination, boarding, and alighting +locations. A sample of this is shown in the table below: ```{r, include=FALSE} tazs <- st_read("data/input/tazs/master_tazs.shp") %>% @@ -410,12 +411,15 @@ add_mode <- add_trip_purp %>% ## Reasonableness checks {.tabset} -A high level review of the combined survey was performed to make sure processing -had not introduced errors. Some of those checks are shown in the charts below. -All dimensions checked looked reasonable. +A high level review of the combined survey was performed to make sure field +translations and survey combination had not introduced errors. Some of those +checks are shown in the charts below. All dimensions checked looked reasonable. ### Agency +The distribution by agency lines up fairly well with the size of the agencies +themselves. + ```{r} temp <- add_mode %>% mutate(agency = ifelse(agency == "TTA", "GoTriangle", agency)) %>% @@ -439,6 +443,11 @@ ggplot(temp, aes(x = reorder(agency, -pct), y = pct)) + ### Time of Day +The time period definitions are the same as those +defined in the [time of day](time_of_day.html) documentation. The AM period, for +example, is only 2 hours. This explains the large number of transit observations +categorized as midday. + ```{r} temp <- add_mode %>% group_by(tod) %>% @@ -462,6 +471,9 @@ ggplot(temp, aes(x = tod, y = pct)) + ### Access Mode +Walk access is the dominant mode in the survey, which matches transit behavior +in the region. + ```{r} temp <- add_mode %>% group_by(access_mode) %>% @@ -482,6 +494,9 @@ ggplot(temp, aes(x = reorder(access_mode, -pct), y = pct)) + ### Fare Type +Free routes are the most common, which is a reflection of the CHT system which +offers many routes with no fare. + ```{r} temp <- add_mode %>% group_by(fare_type) %>% @@ -504,11 +519,19 @@ ggplot(temp, aes(x = reorder(fare_type, -pct), y = pct)) + ## Survey Assignment -For an additional check on both the on-board survey processing and the model's -route network, the on-board survey was assigned. This involved processing the -survey into the appropriate matrices by time of day, transit mode, and accesss -mode. The model assignment macros in TransCAD were used to assign these to the -route system, and the results were compared to reported ridership by route. +Caliper assigned the combined transit survey to the 2016 transit route networks +used by the model. Neither survey was collected in 2016, so route offerings are +likely different. However, assigning the survey can help identify errors in the +networks. These can include missing access links or route mis-coding, but also +gives insights into things transfers. If the model is not penalizing transfers +enough, this early assignment will show more transfers than in the survey. + +The assignment process involved processing the survey into the appropriate +matrices by time of day, transit mode, and accesss mode. The model's assignment +macros in TransCAD were used to assign the matrices to the route system, and the +results were compared to reported ridership by route. One of the conclusions +was in fact that transfers needed to be penalized heavier in the transit network +settings. In the survey, no one transferred more than once. ```{r, eval=FALSE} # This code chunk was used to create transit matrices for a test assignment. @@ -604,16 +627,14 @@ agency_comp %>% These differences are large, but manual path checking did not uncover any errors. This test assignment was performed on a network with a rough initial -guess of travel times, which likely contributed to these differences. Further +guess of travel times, which contributed to these differences. Further differences may be because the surveys were from 2014 and 2015 while the transit network is 2016. Any changes in route alignments or offerings would lead to higher discrepancies. -More attention will be paid to agency- and route-level ridership comparisons -during model validation. It may be that fares or other route attributes received -from stakeholders need to be adjusted. One positive result of this early test -was that all routes received ridership, which indicates no critical -accessibility flaws in the network. +During model development, much more accurate roadway times were calculated from +actual model assignments. Further, the transit network settings were also dialed +in to achieve final results that look much more reasonable. ```{r, eval=FALSE} write_csv(add_mode, "data/output/_PRIVATE/obs_processing/obs_processed.csv", na = "") diff --git a/docs/obs_processing.html b/docs/obs_processing.html index 676575ad..a5e97f6e 100644 --- a/docs/obs_processing.html +++ b/docs/obs_processing.html @@ -190,6 +190,9 @@
  • Destination Choice
  • +
  • + Mode Choice +
  • Non-homebased
  • @@ -205,6 +208,9 @@
  • Accessibility
  • +
  • + University +
  • Airport
  • @@ -261,16 +267,16 @@

    On-board Survey Processing

    Introduction

    -

    On-board surveys ask transit riders a series of question while they are on the bus in order to obtain detailed information about demographics, trip purpose, modes used, and other variables. These targeted surveys are critical for understanding transit users in the region. Due to the small size of the transit market in the Triangle and the small household survey sample size, the household survey does not contain enough transit trips to estimate the behavior of these travelers.

    -

    In the Triangle, a survey of GoTriangle and DCHC transit was conducted in 2014 (excluding Duke Transit). CAMPO agencies were surveyed in 2015 along with new GoTriangle routes. These are used to gain insight into the transit market.

    +

    Due to the small size of the transit market in the Triangle and the small household survey sample size, the household survey does not contain enough transit observations to estimate behavioral models for transit users. On-board surveys are critical to fill this gap. They are intercept surveys, which means they are conducted on the bus itself. They ask transit riders a series of questions in order to obtain detailed information about demographics, trip purpose, modes used, and other variables.

    +

    In the Triangle, a survey of 9,231 GoTriangle and DCHC area transit riders was conducted in 2014 (excluding Duke Transit). 5,390 CAMPO area transit riders were surveyed in 2015 along with new GoTriangle routes. These are used to gain insight into the transit market.

    Combining surveys

    -

    Unlike the household surveys, the transit surveys were collected by different companies. Combining the surveys required translating the different field codes from each into a combined set of values. The surveys collected the same basic information, which made this process easier. Additionally, because different routes were collected in each survey, no routes were double counted and no adjustments to the sample weights were needed.

    +

    Unlike the household surveys, the transit surveys were collected by different companies. Combining the surveys required translating different field codes into a combined set of values. The surveys collected the same basic information, which made this process easier. Additionally, because different routes were collected in each survey, no routes were double counted and no adjustments to the sample weights were needed.

    Geocoding

    -

    As with the household survey, the transit trip latitude/longitude values were translated into TRMG2 TAZs. For a transit trip, this includes the home, origin, destination, boarding, and alighting locations. A sample of this is shown in the table below:

    +

    As with the household survey, the transit trip latitude/longitude values were translated into TRMG2 TAZs. For a transit trip, this includes the home, origin (which could be somewhere other than home), destination, boarding, and alighting locations. A sample of this is shown in the table below:

    @@ -441,27 +447,32 @@

    Geocoding

    Reasonableness checks

    -

    A high level review of the combined survey was performed to make sure processing had not introduced errors. Some of those checks are shown in the charts below. All dimensions checked looked reasonable.

    +

    A high level review of the combined survey was performed to make sure field translations and survey combination had not introduced errors. Some of those checks are shown in the charts below. All dimensions checked looked reasonable.

    Agency

    +

    The distribution by agency lines up fairly well with the size of the agencies themselves.

    Time of Day

    +

    The time period definitions are the same as those defined in the time of day documentation. The AM period, for example, is only 2 hours. This explains the large number of transit observations categorized as midday.

    Access Mode

    +

    Walk access is the dominant mode in the survey, which matches transit behavior in the region.

    Fare Type

    +

    Free routes are the most common, which is a reflection of the CHT system which offers many routes with no fare.

    Survey Assignment

    -

    For an additional check on both the on-board survey processing and the model’s route network, the on-board survey was assigned. This involved processing the survey into the appropriate matrices by time of day, transit mode, and accesss mode. The model assignment macros in TransCAD were used to assign these to the route system, and the results were compared to reported ridership by route.

    +

    Caliper assigned the combined transit survey to the 2016 transit route networks used by the model. Neither survey was collected in 2016, so route offerings are likely different. However, assigning the survey can help identify errors in the networks. These can include missing access links or route mis-coding, but also gives insights into things transfers. If the model is not penalizing transfers enough, this early assignment will show more transfers than in the survey.

    +

    The assignment process involved processing the survey into the appropriate matrices by time of day, transit mode, and accesss mode. The model’s assignment macros in TransCAD were used to assign the matrices to the route system, and the results were compared to reported ridership by route. One of the conclusions was in fact that transfers needed to be penalized heavier in the transit network settings. In the survey, no one transferred more than once.

    The percent root mean square error (%RMSE) for route-level boardings was 96.8%. The agency-level %RMSE was 52.63%. The table below shows the agency-level comparison.

    @@ -609,8 +620,8 @@

    Survey Assignment

    -

    These differences are large, but manual path checking did not uncover any errors. This test assignment was performed on a network with a rough initial guess of travel times, which likely contributed to these differences. Further differences may be because the surveys were from 2014 and 2015 while the transit network is 2016. Any changes in route alignments or offerings would lead to higher discrepancies.

    -

    More attention will be paid to agency- and route-level ridership comparisons during model validation. It may be that fares or other route attributes received from stakeholders need to be adjusted. One positive result of this early test was that all routes received ridership, which indicates no critical accessibility flaws in the network.

    +

    These differences are large, but manual path checking did not uncover any errors. This test assignment was performed on a network with a rough initial guess of travel times, which contributed to these differences. Further differences may be because the surveys were from 2014 and 2015 while the transit network is 2016. Any changes in route alignments or offerings would lead to higher discrepancies.

    +

    During model development, much more accurate roadway times were calculated from actual model assignments. Further, the transit network settings were also dialed in to achieve final results that look much more reasonable.

    diff --git a/docs/parking.Rmd b/docs/parking.Rmd index 269f2d5e..b0c514c7 100644 --- a/docs/parking.Rmd +++ b/docs/parking.Rmd @@ -20,72 +20,67 @@ library(kableExtra) The parking model component of the TRMG2 determines where auto travelers will park and either walk or take transit to their ultimate destination. The model -is only applied to CBD and campus areas where parking is generally limited and -paid for. +is only applied to CBD and university areas where parking is generally limited +and requires payment. -In the TRMG2, the results of the parking model influence both mode and -destination choices, which means travelers react realistically to changes in -both parking availability and price. +In TRMG2, results of the parking model influence both mode choice and +destination choice, which means travelers react realistically to changes in +parking availability and price. ## Design The parking model is applied at the end of the demand model stream, just before -assignment. Trip matrices from the preceding demand models are combined across -purposes maintaining only segmentation between trips on work tours and trips on -non-work tours (and maintaining SOV and HOV forassignment classes – but applying -the same models to them). +traffic assignment. Trip matrices from the preceding demand models are combined +across purposes maintaining segmentation between trips on work tours and +non-work tours (SOV and HOV are also maintained for assignment classes, but the +same models are applied to them). -The model is adapted from the one estimated as part of the 2016 Triangle Region -Parking Behavior Study but reformulated for application in the regional model. -The estimated model did not include the lowest level choices of parking zone, -which Caliper has added. In addition, the TRMG2 model only includes "Park and -Walk" and "Auto Intercept" options ("Drive and Park" and "Drive, Park & Shuttle" -in the 2016 study). The regional model handles transit upstream in the mode -choice model. However, for the auto intercept mode, the modeled transit system -is used directly for path building. This ensures travelers see accurate options -rather than asserted average headways and fares. +The TRMG2 parking model is adapted from the one estimated for the 2016 Triangle +Region Parking Behavior Study conducted by RSG. It has been reformulated for +application in the regional model. Several differences exist between the model +estimated by RSG and the TRMG2 model. Cailper added the lowest level choices of +parking zones. In addition, the TRMG2 model only includes "Park and Walk" and +"Park and Transit" options. For "Park and Transit", the G2 uses the actual +transit network for path building. This ensures travelers see accurate options +and make appropriate decisions. -This design is a departure from the previous generation of the TRM in grouping -the Auto Intercept mode as part of the parking choices at the end of the model -stream rather than as part of the main mode choice model. The 2016 study -supports this structure as evidenced by nesting parameters less than one for its -Auto nest which included both the "Drive and Park" and the "Drive, Park & -Shuttle" modes. This structure and original nesting coefficient in the 2016 -study imply that travelers view the auto intercept mode as a parking strategy -and not a main mode choice. +The G2 design is a departure from the previous generation of the TRM, which +included "Park and Walk" (called "Auto Intercept") as a primary mode in the mode +choice model. G2 effectively treats it as a subchoice under the auto mode. The +structure and nesting coefficient in the 2016 RSG study imply that travelers +view the auto intercept mode as a parking strategy and not a main mode choice. ```{r, out.width="75%", fig.show='hold',fig.align='center'} knitr::include_graphics("img/parking/choice_tree.png") ``` -The applied parking model in the TRM is a two-level, nested logit combined -parking mode and destination choice model. Various coefficient values and -implied values of time differ from those in the 2016 report for a number of -reasons including that the RP to SP scaling has been applied here, the model -has been calibrated, and because the values of time reported in the 2016 report -were generally scaled in comparison to willingness to pay transit fare rather -than parking costs. +The parking model in TRMG2 is a two-level, nested logit model that combines +parking mode and destination choice. Various coefficient values and implied +values of time differ from those in 2016 for a number of reasons. This includes +Caliper's scaling from Stated Preference to Revealed Preference, calibration of +the model, and because the values of time reported in the 2016 report were +generally scaled to transit fares rather than parking costs. The model was calibrated to the 2016 survey in two steps. The model's AM period -was calibrated since the survey only asked about the first trip of the day -downtown / to campus and was strongly skewed towards AM trips. First, the lower +was calibrated since the survey only asked about the first trip of the day to +downtown / campus and was strongly skewed towards AM trips. First, the lower level, parking destination choice was calibrated by scaling the non-size terms -to match the observed average walk time from the survey for the park and walk +to match the observed average walk time from the survey for the park-and-walk mode which was 7.8 minutes for CBD areas and 11.25 minutes for campus areas. -The park and shuttle coefficients were scaled consistent with park and walk -rather than calibrated separately both in order to biasing the mode choice and +The park-and-shuttle coefficients were scaled consistent with park and walk +rather than calibrated separately. This was both in order to bias the mode choice and because the 2016 survey would have required significant additional processing -to obtain the time from parking to destination for the park and shuttle mode. -Second, the upperl level parking mode choice was calibrated by adjusting the +to obtain the time from parking to destination for the park-and-shuttle mode. +Second, the upper level parking mode choice was calibrated by adjusting the alternative specific bias constant for park and shuttle to match observed shares for the mode which were 6.9% for CBD areas and 14.6% for campus areas. A parameter was also added to allow the model to better reflect satellite -parking lots like the Friday Center at UNC. In the absence of actual counts -of parking at the Friday Center, the parameter was simply asserted to produce -a roughly reasonable amount of parking of about 250. +parking lots like the Friday Center at UNC. In the absence of actual parking counts +at the Friday Center, the parameter was simply asserted to produce +a reasonable amount of about 250 parked vehicles. -```{r, out.width="90%", fig.show='hold',fig.align='center'} +```{r, out.width="80%", fig.show='hold',fig.align='center'} knitr::include_graphics("img/parking/coeffs.png") ``` diff --git a/docs/parking.html b/docs/parking.html index b6f8a497..bb8cd967 100644 --- a/docs/parking.html +++ b/docs/parking.html @@ -188,6 +188,9 @@
  • Destination Choice
  • +
  • + Mode Choice +
  • Non-homebased
  • @@ -203,6 +206,9 @@
  • Accessibility
  • +
  • + University +
  • Airport
  • @@ -251,19 +257,19 @@

    Parking Model

    Introduction

    -

    The parking model component of the TRMG2 determines where auto travelers will park and either walk or take transit to their ultimate destination. The model is only applied to CBD and campus areas where parking is generally limited and paid for.

    -

    In the TRMG2, the results of the parking model influence both mode and destination choices, which means travelers react realistically to changes in both parking availability and price.

    +

    The parking model component of the TRMG2 determines where auto travelers will park and either walk or take transit to their ultimate destination. The model is only applied to CBD and university areas where parking is generally limited and requires payment.

    +

    In TRMG2, results of the parking model influence both mode choice and destination choice, which means travelers react realistically to changes in parking availability and price.

    Design

    -

    The parking model is applied at the end of the demand model stream, just before assignment. Trip matrices from the preceding demand models are combined across purposes maintaining only segmentation between trips on work tours and trips on non-work tours (and maintaining SOV and HOV forassignment classes – but applying the same models to them).

    -

    The model is adapted from the one estimated as part of the 2016 Triangle Region Parking Behavior Study but reformulated for application in the regional model. The estimated model did not include the lowest level choices of parking zone, which Caliper has added. In addition, the TRMG2 model only includes “Park and Walk” and “Auto Intercept” options (“Drive and Park” and “Drive, Park & Shuttle” in the 2016 study). The regional model handles transit upstream in the mode choice model. However, for the auto intercept mode, the modeled transit system is used directly for path building. This ensures travelers see accurate options rather than asserted average headways and fares.

    -

    This design is a departure from the previous generation of the TRM in grouping the Auto Intercept mode as part of the parking choices at the end of the model stream rather than as part of the main mode choice model. The 2016 study supports this structure as evidenced by nesting parameters less than one for its Auto nest which included both the “Drive and Park” and the “Drive, Park & Shuttle” modes. This structure and original nesting coefficient in the 2016 study imply that travelers view the auto intercept mode as a parking strategy and not a main mode choice.

    +

    The parking model is applied at the end of the demand model stream, just before traffic assignment. Trip matrices from the preceding demand models are combined across purposes maintaining segmentation between trips on work tours and non-work tours (SOV and HOV are also maintained for assignment classes, but the same models are applied to them).

    +

    The TRMG2 parking model is adapted from the one estimated for the 2016 Triangle Region Parking Behavior Study conducted by RSG. It has been reformulated for application in the regional model. Several differences exist between the model estimated by RSG and the TRMG2 model. Cailper added the lowest level choices of parking zones. In addition, the TRMG2 model only includes “Park and Walk” and “Park and Transit” options. For “Park and Transit”, the G2 uses the actual transit network for path building. This ensures travelers see accurate options and make appropriate decisions.

    +

    The G2 design is a departure from the previous generation of the TRM, which included “Park and Walk” (called “Auto Intercept”) as a primary mode in the mode choice model. G2 effectively treats it as a subchoice under the auto mode. The structure and nesting coefficient in the 2016 RSG study imply that travelers view the auto intercept mode as a parking strategy and not a main mode choice.

    -

    The applied parking model in the TRM is a two-level, nested logit combined parking mode and destination choice model. Various coefficient values and implied values of time differ from those in the 2016 report for a number of reasons including that the RP to SP scaling has been applied here, the model has been calibrated, and because the values of time reported in the 2016 report were generally scaled in comparison to willingness to pay transit fare rather than parking costs.

    -

    The model was calibrated to the 2016 survey in two steps. The model’s AM period was calibrated since the survey only asked about the first trip of the day downtown / to campus and was strongly skewed towards AM trips. First, the lower level, parking destination choice was calibrated by scaling the non-size terms to match the observed average walk time from the survey for the park and walk mode which was 7.8 minutes for CBD areas and 11.25 minutes for campus areas. The park and shuttle coefficients were scaled consistent with park and walk rather than calibrated separately both in order to biasing the mode choice and because the 2016 survey would have required significant additional processing to obtain the time from parking to destination for the park and shuttle mode. Second, the upperl level parking mode choice was calibrated by adjusting the alternative specific bias constant for park and shuttle to match observed shares for the mode which were 6.9% for CBD areas and 14.6% for campus areas.

    -

    A parameter was also added to allow the model to better reflect satellite parking lots like the Friday Center at UNC. In the absence of actual counts of parking at the Friday Center, the parameter was simply asserted to produce a roughly reasonable amount of parking of about 250.

    -

    +

    The parking model in TRMG2 is a two-level, nested logit model that combines parking mode and destination choice. Various coefficient values and implied values of time differ from those in 2016 for a number of reasons. This includes Caliper’s scaling from Stated Preference to Revealed Preference, calibration of the model, and because the values of time reported in the 2016 report were generally scaled to transit fares rather than parking costs.

    +

    The model was calibrated to the 2016 survey in two steps. The model’s AM period was calibrated since the survey only asked about the first trip of the day to downtown / campus and was strongly skewed towards AM trips. First, the lower level, parking destination choice was calibrated by scaling the non-size terms to match the observed average walk time from the survey for the park-and-walk mode which was 7.8 minutes for CBD areas and 11.25 minutes for campus areas. The park-and-shuttle coefficients were scaled consistent with park and walk rather than calibrated separately. This was both in order to bias the mode choice and because the 2016 survey would have required significant additional processing to obtain the time from parking to destination for the park-and-shuttle mode. Second, the upper level parking mode choice was calibrated by adjusting the alternative specific bias constant for park and shuttle to match observed shares for the mode which were 6.9% for CBD areas and 14.6% for campus areas.

    +

    A parameter was also added to allow the model to better reflect satellite parking lots like the Friday Center at UNC. In the absence of actual parking counts at the Friday Center, the parameter was simply asserted to produce a reasonable amount of about 250 parked vehicles.

    +

    diff --git a/docs/resident_dest_choice.Rmd b/docs/resident_dest_choice.Rmd index 182f4c23..22426e52 100644 --- a/docs/resident_dest_choice.Rmd +++ b/docs/resident_dest_choice.Rmd @@ -33,7 +33,7 @@ General modeling aspects: - Destinations are predicted at the zone (TAZ) level. - Based on an analysis of the region, the zones were logically grouped into the following clusters: -```{r, out.width= "100%", out.height="100%"} +```{r, out.width= "100%", out.height="80%"} knitr::include_graphics("img/dc/clusters.png") ``` @@ -114,25 +114,71 @@ The nested specification outperformed all other approaches. ### Utility Specification -The utility specification for the DC models consisted of the following set of variables. Not all these variables are present in every specification since variables are retained depending on the significance of their estimated coefficients or owing to strong apriori assumptions. +The utility specification for the DC models consisted of the following set of +variables. Not all these variables are present in every specification since +variables are retained depending on the significance of their estimated +coefficients or owing to strong apriori assumptions. + +#### Size term + +This is the natural logarithm of the attractions at a destination zone, which +itself is a linear combination of employment variables and their coefficients. +These coefficients are simultaneously estimated with other parameters. Since it +is required that these coefficients be positive, the utility formulation +specifies coefficients that are then exponentiated. Successful estimation +requires that one of these attraction variable coefficients is fixed. The +estimation also provides a coefficient for the size variable, which +theoretically should be between 0 and 1. + +#### Mode Choice Logsums + +Using mode choice root logsums typically presents a problem in destination +choice. The root logsum value is dominated by the auto nest which washes out +effects from non-dominant modes. For example, drastically improving the +transit accessibility to a particular zone will not affect the probability of +choosing that zone if the transit logsums are dwarfed by the auto logsums. To +circumvent this issue, separate auto, transit, and non-household auto logsums +from the respective nests are used in the TRM specifications. The logsums are +segmented by vehicle sufficiency market segments for added explanatory power. +Work trip logsums are further distinguished by high- and low-income segments. +Finally, note that there is no auto logsum component for the zero-vehicle +households market segment. + +#### Others + +- **4D measures**: Several 4D measures such as transit accessibility, hospital +accessibility, walk accessibility, employment densities are considered. +- **Time Coefficient(s)**: The auto times corresponding to the latest model +skims are attached depending upon the time of day the respondent made the trip. +A time coefficient is estimated in each specification. +- **Intra-Cluster effects**: Intra-Cluster coefficients are estimated if the +home and the chosen zone are within the same cluster (nest). +- **Intra-Zonal effect**: A coefficient is estimated to boost the utility of +choosing the same zone as the origin. +- **Cluster Nest coefficients**: A nest coefficient for each of the 12 clusters +is estimated. +- **Cluster ASCs**: Likewise, ASCs are estimated for each cluster. + +The rich utility specification allows the model to capture the decision-making +process of choosing destination zones and capturing cluster-to-cluster flows. -- Size term: This is the natural logarithm of the attractions at a destination zone, which itself is a linear combination of employment variables and their coefficients. These coefficients are simultaneously estimated with other parameters. Since it is required that these coefficients be positive, the utility formulation specifies coefficients that are then exponentiated. Successful estimation requires that one of these attraction variable coefficients is fixed. The estimation also provides a coefficient for the size variable, which theoretically should be between 0 and 1. -- Mode Choice Logsums: Using mode choice root logsums typically presents a problem in destination choice. The root logsum value is dominated by the auto nest which washes out any effects from the non-dominant modes. For example, drastically improving the transit accessibility to a particular zone will not affect the probability of choosing that zone, if the transit logsums are dwarfed by the auto logsums. To circumvent this issue, separate auto, transit, and non-household auto logsums from the respective nests are used in the TRM specifications. The logsums are segmented by vehicle sufficiency market segments for added explanatory power. The work logsums are further distinguished by high- and low-income segments. Finally, note that there is no auto logsum component for the zero-vehicle households market segment. -- 4D measures: Several 4D measures such as transit accessibility, hospital accessibility, walk accessibility, employment densities are considered. -- Time Coefficient(s): The auto times corresponding to the latest model skims are attached depending upon the time of day the respondent made the trip. A time coefficient is estimated in each specification. -- Intra-Cluster effects: Intra-Cluster coefficients are estimated if the home and the chosen zone are within the same cluster (nest). -- Intra-Zonal effect: A coefficient is estimated to boost the utility of choosing the same zone as the origin. -- Cluster Nest coefficients: A nest coefficient for each of the 12 clusters is estimated. -- Cluster ASCs: Likewise, ASCs are estimated for each cluster. +### General Estimation Observations -The rich utility specification allows the model to capture the decision-making process of choosing destination zones and capturing cluster-to-cluster flows. +ASCs were estimated for all but one of the clusters, but only the significant +ASCs were kept for each model purpose. The ASCs with poor significance generally +had values close to zero. Further, these ASCs are very small in magnitude, +confirming that the model’s explanatory power is derived from other variables +with the ability to capture location choice behavior. -### General Estimation Observations +Intrazonal effects were strong for a few of the purposes, but intra-cluster +effects were very strong implying that there is a tendency to choose locations +that belong in the same cluster as the home zone. -- ASCs were estimated for all but one of the clusters. Only the significant ASCs were kept for each model purpose. The ASCs with poor significance generally had values close to zero. Further, these ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior. -- Intrazonal effects are strong for a few of the purposes -- Intra-Cluster effects are strong implying that there is a tendency to choose locations that belong in the same cluster as the home zone. -- Using mode choice logsums for the auto model (and for vehicle sufficient segments) caused unrealistic signs as well as much poorer model fit. This can be explained by the fact that these logsums are primarily driven by auto skims, which are already a part of the utility equation. Hence most specifications do not have an auto logsum. +Mode choice logsums for the auto model (and for vehicle sufficient +segments) caused unrealistic signs as well as much poorer model fit. This can be +explained by the fact that these logsums are primarily driven by auto skims, +which are already a part of the utility equation. Hence most specifications do +not have an auto logsum. ### W_HB_W @@ -155,20 +201,39 @@ knitr::include_graphics("img/dc/w_hb_w_all3.png") knitr::include_graphics("img/dc/w_hb_w_all4.png") ``` -#### Notes +ASCs were estimated for all but one of the clusters. Several had values close to +zero with poor significance and were dropped. The ASCs are very small in +magnitude, confirming that the model’s explanatory power is derived from other +variables and is sensitive to changes in model inputs. + +Intrazonal and home cluster effects are strong. Residents of Cary, Durham, +Chapel Hill, Garner often live and work in the same cluster. Similar effects are +also true for the peripheral clusters toward the northeast and south. + +Root mode choice logsums are significant for workers from zero-auto households, +capturing a coupling with access to public transit modes. Using root mode choice +logsums for the other auto sufficiency markets caused unrealistic signs as well +as much poorer model fit. This can be explained by the fact that these logsums +are primarily driven by auto skims, which are already a part of the utility +equation (the "Time" term). -- ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior. -- Intrazonal effects are strong. -- Home cluster effects are also strong, with residents of Cary, Durham, Chapel Hill, Garner often living and working in the same cluster. Similar effects are also true for the peripheral clusters toward the northeast and south. -- Mode choice logsums are significant for workers from zero-auto households, capturing a coupling with access to public transit modes. -- Using mode choice logsums for the other auto sufficiency markets (with or without the presence of congested time) caused unrealistic signs as well as much poorer model fit. This can be explained by the fact that these logsums are primarily driven by auto skims, which are already a part of the utility equation. -- Hospitals being a major source of employment in the region, accessibility to hospital jobs helps drive work destination attractiveness. Transit and walk accessibilities are also drivers of a zone’s attractiveness for work tours. -- The cluster nest coefficients are mostly significantly different from 1, so the nested structure is justified. Research Triangle Park is a special cluster (very low residential land use, very high technology employment, etc.) and has an MNL-type sub-nest. -- The size variables are stratified by income. The percentage of low and high paying jobs in each zone was used to split the total attraction into income-specific attractions. +Hospitals are major sources of employment in the region, and accessibility to +hospital jobs helps drive work destination attractiveness. Transit and walk +accessibilities are also drivers of a zone’s attractiveness for work tours. + +Most of the cluster nest coefficients are significantly different from 1, which +means the nested structure is justified. Research Triangle Park is a special +cluster (very low residential land use, very high technology employment, etc.). +It's coefficient is effectively 1, which implies an MNL-type sub-nest. + +The size variables are stratified by income using the percentage of low and high +paying jobs in each zone. This is how the model pairs high-income workers (e.g. +in the Regency area of Cary) with high-paying jobs in RTP. This stratification +is critical for accurate work flows in region. #### Double constraint and attraction model -All trip types other than work are singly-constrained. This means that the row +All trip types (other than work) are singly-constrained. This means that the row sums of the resulting trip table will match starting productions, but column sums will not necessarily be proportional to the amount of employment in each zone. As an example, for two zones with the same employment, the more accessible @@ -308,22 +373,21 @@ broom::tidy(model) %>% ``` During model application, predicted attractions are always scaled to match -predicted productions to match double constraint. As a consequence, the +predicted productions before applying double constraint. As a consequence, the attraction model should predict total attractions that is close to predicted productions. This is referred to as PA balance ("production/attraction balance"). +If it does not, adding employment to a zone will have unexpectedly high (or low) +impacts on total zonal attractions. The coefficients in the table above were multiplied by 2.45 during model -calibration to achieve the appropriate PA balance of 1.0. If the ratio is too -high or too low, adding additional employment to a zone will have unexpectedly -high or low impact on trips attracted (given the scaling that must be done -to attractions). +calibration to achieve the appropriate PA balance of 1.0. ### W_HB_O -W_HB_O corresponds to trips that are part of a work tour, have one end at home, +W_HB_O trips are part of a work tour, have one end at home, and the other end at a non-work location. This segment captures trips that directly connect the home to an intermediate stop on the way to or from work and -translates to the HBO trip purpose in traditional models. Destination choice in +is part of the HBO trip purpose in traditional models. Destination choice in this context relates to a short-term decision of intermediate stop location. #### Estimated coefficients and t statistics @@ -337,23 +401,42 @@ knitr::include_graphics("img/dc/w_hb_o_all2.png") knitr::include_graphics("img/dc/w_hb_o_all3.png") ``` -#### Notes +ASCs were estimated for all but one of the clusters, but only the significant +ones were retained. These ASCs are very small in magnitude, confirming that the +model’s explanatory power is derived from other variables and is sensitive to +changes in model inputs. + +The model has a sophisticated approach to travel time. Travel time is more +important up to the first 30 minutes. Excess time beyond this threshold is gets +a slight discount. This reflects a difference in the behavior for these longer +trips, which are dominated more by what is at the destination rather than the +time to get there. -- ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior. -- Intrazonal effects are strong. -- Travel time is more important up to the first 30 minutes. Excess time beyond this threshold is perceived as being less onerous. -- Home cluster effects are also strong, with residents of Cary, Garner, Raleigh, Southern Durham and the southeastern periphery often stopping in the same cluster for their intermediate stops. -- The public transit mode choice logsum is significant. -- The number of households is a major factor in the calculation of the size variable, reflecting the inclusion of personal visits in this trip purpose. Logically, the retail and service employment also help drive trips to specific destinations. Income-based stratifications of employment did not make a difference. -- The cluster nest coefficients are mostly significantly different from 1, so the nested structure is justified. Research Triangle Park is a special cluster (very low residential land use, very high technology employment, etc.) and has a nearly MNL-type sub-nest. +Home cluster effects are strong for residents of Cary, Garner, Raleigh, Southern +Durham and the southeastern periphery. People in these clusters often stop in +the same cluster for their intermediate activities. + +The public transit mode choice logsum is significant for certain segments like +zero-vehicle households. Non-household auto logsums (primarily taxis and TNCs) +are also significant. + +The number of households is a major factor in the calculation of the size +variable, reflecting the inclusion of personal visits in this trip purpose. +Logically, the retail and service employment also help drive trips to specific +destinations. Income-based stratification of employment did not make a +difference in model performance. + +Most cluster nest coefficients are significantly different from 1, so the +nested structure is justified. As with W_HB_W, RTP is an exception to this. ### W_HB_EK12 W_HB_EK12 corresponds to trips that are part of a work tour, have one end at home, and the other end at a school. This segment captures trips that directly connect the home to school on the way to or from work and translates to the HB -School trip purpose in traditional models. It includes worker dropping -off/picking up children on the way to/from work, and workers who attend school. +School trip purpose in traditional models. It includes workers dropping +off/picking up children on the way to/from work, and young workers who attend +school. #### Estimated coefficients and t statistics @@ -365,11 +448,10 @@ knitr::include_graphics("img/dc/w_hb_ek12_all1.png") knitr::include_graphics("img/dc/w_hb_ek12_all2.png") ``` -#### Notes - -- School location choice is driven by travel time. -- Attractions are driven by school enrollment. -- Some of the cluster nest coefficients are significantly different from 1, so the nested structure is justified. +School location choice is driven by travel time and school enrollment. Some of +the cluster nest coefficients are significantly different from 1, so the nested +structure is justified for those clusters. The remaining zones are treated as +top-level choices (MNL). ### N_HB_OME @@ -389,16 +471,13 @@ knitr::include_graphics("img/dc/n_hb_ome_all2.png") knitr::include_graphics("img/dc/n_hb_ome_all3.png") ``` -#### Notes +As with the previous models, cluster-based ASCs are small and only applied to +a few clusters, which implies a model that is appropriately sensitive. Travel +time is again more important up to the first 30 minutes. Excess time beyond this +threshold is perceived as being less onerous. -- ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior. -- Intrazonal effects are strong. -- Travel time is more important up to the first 30 minutes. Excess time beyond this threshold is perceived as being less onerous. -- Auto and transit logsums (by the relevant auto sufficiency category) play a role in the perception of zones’ attractiveness. -- Home cluster effects are strong, with residents often choosing to shop in the same cluster as their residence. -- Attractions are driven by office, retail and service employment, which is expected for a trip purpose that involves shopping. - -The cluster nest coefficients are mostly significantly different from 1, so the nested structure is justified. +Attractions are driven by retail and service employment, which is appropriate +for shopping/maintenance trips. ### N_HB_OMED @@ -417,17 +496,16 @@ knitr::include_graphics("img/dc/n_hb_omed_all2.png") knitr::include_graphics("img/dc/n_hb_omed_all3.png") ``` -#### Notes +A zone's accessibility to a hospital is a strong predictor in this model. This +captures the important clustering of doctor's offices, clinics, and pharmacies +around major hospitals. -- ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior. -- Intrazonal effects are strong. -- Travel time is more important up to the first 30 minutes. Excess time beyond this threshold is perceived as being less onerous. -- Auto logsums (by the relevant auto sufficiency category) play a role in the perception of zones’ attractiveness. -- Accessibility to hospitals plays a positive role in boosting a zone’s attractiveness, which is apt for this trip purpose. -- Home cluster effects are also strong, particularly with residents of Garner, Raleigh, Southern Durham and the north-eastern. -- Attractions are driven by retail and service employment, with an added boost from service employment related to the presence of hospitals. +Travel time above 30 minutes has almost no impact on desintation choice. This is +an encouraging result. People traveling long distances to see a specialist are +not impacted by travel time in a major way. -Most of the cluster nest coefficients are significantly different from 1, so the nested structure is justified. +Attractions are driven by retail and service employment, with an added boost +from service employment related to the presence of hospitals. ### N_HB_K12 @@ -445,19 +523,19 @@ knitr::include_graphics("img/dc/n_hb_k12_all1.png") knitr::include_graphics("img/dc/n_hb_k12_all2.png") ``` -#### Notes - -- ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior. -- Intra-zonal and intra-cluster effects are strong. -- Attractions are driven by school enrollment. -Some of the cluster nest coefficients are significantly different from 1, so the nested structure is justified. Others default to an MNL-like behavior. +This model has a strong fit and is primarily based on K12 enrollment and travel +time (as expected). Additionally, the negative coefficient on the Durham and +Northeast clusters means that (all else equal) zones in that cluster are less +likely to be chosen. The positive intrazonal coefficient is expected given +the tendency for people to be assigned to schools near their home. ### N_HB_OD_Long N_HB_ODLong corresponds to trips that are not part of a work tour, have one end at home, and the other end at an “other” activity that is at least 30 minutes -long. This segment would have been rolled into the HBO (Home-Based Other) trip -purpose in traditional models. +long. Visiting a friend is one example of this trip type. This segment would +have been rolled into the HBO (Home-Based Other) trip purpose in traditional +models. #### Estimated coefficients and t statistics @@ -470,19 +548,17 @@ knitr::include_graphics("img/dc/n_hb_od_long2.png") knitr::include_graphics("img/dc/n_hb_od_long3.png") ``` -#### Notes - -- Longer drive times penalize a destination’s appeal, though times over 30 minutes are not perceived to be as onerous. -- Hospital accessibility impacts destination attractiveness in a positive manner, likely accounting for longer medical appointments. -- Walk access is viewed positively, especially in h -- Most home cluster effects are also strong. -- Attractions are driven by employment and school enrollment totals as well as the residential population, the latter due to longer social visits. - -The cluster nest coefficients are all significantly different from 1, so the nested structure is justified. +The penalty on travel time above 30 minutes is reduced as seen in multiple trip +trypes. The positive coefficient on walk accessibility means that these trips +are more attracted to zones in area that are more dense. Home cluster effects +are strong, meaning that most of these trips stay within the home cluster. +Employment, school enrollment, and residential population all are significant +components of the size term. This reflects the catch-all nature of this trip +type. ### N_HB_OD_Short -N_HB_ODShort corresponds to trips that are not part of a work tour, have one end +N_HB_OD_Short corresponds to trips that are not part of a work tour, have one end at home, and the other end at an “other” activity shorter than 30 minutes in duration. This segment would have been rolled into the HBO (Home-Based Other) trip purpose in traditional models. @@ -498,22 +574,22 @@ knitr::include_graphics("img/dc/n_hb_od_short2.png") knitr::include_graphics("img/dc/n_hb_od_short3.png") ``` -#### Notes -- Drive time attenuates the attractiveness of farther destinations, though times beyond 20 minutes are perceived as less onerous than the first 20. -- Home cluster effects are also strong across all clusters, indicating a strong preference for proximate destinations for relatively short non-mandatory activities. -- Additionally, intra-zonal destinations are accorded higher preference. -- Mode choice logsums related to public transit and non-household auto modes (such as ride-hailing) play a positive role in elevating the desirability of destinations. -- Attractions are driven by employment and school enrollment totals as well as each zone’s residential population, an indication of the propensity to participate in social visits. +The attenuation on for drive time above 20 minutes is not as strong as other +trip types. This indicates that these trips are likely just trying to reach the +nearest attraction that satisfies the trip purpose. Similarly, home cluster +effects and the intrazonal term are strong meaning that people stay nearby. -All cluster nest coefficients are significantly different from 1, so the nested structure is justified. +Logsums for household auto, non-household auto, and transit all play an important +role in determining where someone travels. Similar to the N_HB_OD_Long trip, +the size term is made up of employment, enrollment, and household population. ## Non-Home-Based trips -The non-home-based trip purposes are multinomial logit (MNL) specifications. -Though the cluster-based nested structure of the home-based purposes was not -adopted, cluster-level alternative-specific constants (ASCs) are included to -help capture cluster-level preferences. +The non-home-based trip purposes are multinomial logit (MNL) specifications. The +cluster-based nesting structure did not improve model results and was not +adopted for these purposes. Even so, the cluster-level alternative-specific +constants (ASCs) are included to help capture cluster-level preferences. The non-home-based trip purposes consist of: @@ -525,9 +601,9 @@ The non-home-based trip purposes consist of: - N_NH_K12 - N_NH_OME - N_NH_O - -Given the strong dependence of destination choice on the prior home-based trip’s -travel mode for non-home-based trips, separate destination choice models have + +Recall that non-home-based trips are generated by mode based on the results of +the home-based models. Given this, separate destination choice models have been estimated by mode. For simplicity and statistical efficiency of model estimation, the above trip purposes were combined into the following four categories that combine trip purpose with travel mode: @@ -540,10 +616,9 @@ categories that combine trip purpose with travel mode: ### NHB Work Auto This segment corresponds to trips that are part of a work tour, do not have -either end at home, have at least one end at school, work, work-related or -“other” activities, and are associated with a drive mode. This segment would -have been rolled into the HBO (Home-Based Other) trip purpose in traditional -models. +either end at home, and are associated with the auto mode. This segment would +have been rolled into the NHBW (Non-Home-Based Work) trip purpose in +traditional models. #### Estimated coefficients and t statistics @@ -555,18 +630,17 @@ knitr::include_graphics("img/dc/w_nhb_auto1.png") knitr::include_graphics("img/dc/w_nhb_auto2.png") ``` -#### Notes - -- Drive time has the expected negative sensitivity, with times above 30 minutes penalized to a lesser extent than the first 30 minutes. -- The generic intra-cluster effect is strong. -- Attractions are driven by a variety of employment totals, as well as school enrollment. +Drive time has the expected negative sensitivity, with times above 30 minutes +penalized to a lesser extent than the first 30 minutes. The generic +intra-cluster effect is strong, which is expected given shorter trip lengths of +NHB trips. Attractions are driven by a variety of employment as well as +school enrollment. ### NHB NonWork Auto This segment corresponds to trips that are part of a non-work tour, do not have -either end at home, and have at least one end at school or “other” activities. -This segment would have been rolled into the HBO (Home-Based Other) trip purpose -in traditional models. +either end at home, and use the auto mode. This segment would have been rolled +into the NHBO (Non-Home-Based Other) trip purpose in traditional models. #### Estimated coefficients and t statistics @@ -578,18 +652,13 @@ knitr::include_graphics("img/dc/n_nhb_auto1.png") knitr::include_graphics("img/dc/n_nhb_auto2.png") ``` -#### Notes - -- Drive time has the expected negative sensitivity, with times above 30 minutes penalized to a lesser extent than the first 30 minutes. -- Intra-cluster and intra-zonal effects, both generic, are strong. -- Attractions are driven by school enrollment (perhaps related to after-school activities) and various employment totals. -- Higher values of the parking logsum lead to more desirable destination owing to enhanced ease of parking. +The non-work auto trip model looks similar to work, but the inclusion of parking +logsums captures a major difference between the two trip types: work trips are +not impacted by parking variables while non-work trips are. ### NHB Transit -This segment corresponds to trips that are part of a non-work tour, and do not -have either end at home. This segment would have been rolled into the HBO -(Home-Based Other) trip purpose in traditional models. +All NHB transit trips are handled with a single destination choice model. #### Estimated coefficients and t statistics @@ -601,18 +670,14 @@ knitr::include_graphics("img/dc/nhb_transit1.png") knitr::include_graphics("img/dc/nhb_transit2.png") ``` -#### Notes - -- Both in-vehicle time and the number of transfers are significant predictors in the attractiveness of destinations accessible by public transit modes. -- Transit accessibility positively impacts destination choice. -- Attractions are driven by office, retail and service employment totals. +Both the total time of the trip and the number of transfers are significant +predictors in the attractiveness of destinations accessible by public transit +modes. Transit accessibility of the destination also positively impacts +destination choice, which matches expectations. ### NHB NonMotorized -This segment corresponds to trips that are part of a non-work tour, do not have -either end at home, and are associated with a non-motorized (e.g. bike, walk) -mode. This segment would have been rolled into the HBO (Home-Based Other) trip -purpose in traditional models. +All NHB non-motorized trips are handled with a single destination choice model. #### Estimated coefficients and t statistics @@ -624,31 +689,65 @@ knitr::include_graphics("img/dc/nhb_nonmoto1.png") knitr::include_graphics("img/dc/nhb_nonmoto2.png") ``` -#### Notes +As expected, walk distance poses a highly significant deterrent to destinations +that are farther away from the origin. An extra penalty is imposed on +destinations that are more than a mile away. This captures the sharp drop off in +trip lengths beyond one mile in the survey. -- As expected, walk distance poses a highly significant deterrent to destinations that are farther away from the origin. A high penalty is imposed on destinations that are more than a mile away. -- Intra-zonal and intra-cluster effects are also strong, favoring short-range trips by non-motorized modes. -- Walk accessibility is key to choosing destinations. -- Attractions are driven by office, retail and service employment totals. +Intra-zonal and intra-cluster effects are strong, which also reflects the +short-range nature of non-motorized trips. Finally, the walk accessibility of +the destination is important for zero vehicle households. This captures an +important reality: these households cannot use autos to make home-based trips. ## DC Model Adjustments -During the model calibration stage, further updates to the DC models were performed to match the model cluster to cluster patterns to the weighted patterns from the survey. These adjustments can be thought of as the adjustment to the ASCs of a mode choice model to match aggregated shares. - -Given the nested destination choice approach and the assumption behind the structure, the model lends itself very well to such post process adjustments. The following parameters were adjusted to try and match the survey patterns. It is worth mentioning that these post process adjustments turned out to be rather small, thereby inspiring additional confidence in the estimation results: - -- Cluster ASC values: For a given purpose, the percent of trips attracted to each cluster can be determined (both from the model and the survey). The cluster ASC values can be adjusted to match the aggregated survey percentages. This additional adjustment for each cluster is the natural logarithm of the ratio of the target percentage to the model percentage. -- Intra-Cluster Constants: The model has provisions for intra cluster dummies. These dummies can likewise be adjusted to match the normalized intra-cluster totals from the survey. -- Intra-Zonal Dummy: Finally, the specific intra zonal dummy was adjusted for a few purposes to match the intra-zonal percentages. +During the model calibration stage, further updates to the DC models were +performed to match the model cluster to cluster patterns to the weighted +patterns from the survey. These adjustments can be thought of as the adjustment +to the ASCs of a mode choice model to match aggregated shares. + +Given the nested destination choice approach and the assumption behind the +structure, the model lends itself very well to such post process adjustments. +The following parameters were adjusted to try and match the survey patterns. It +is worth mentioning that these post process adjustments turned out to be rather +small, thereby inspiring additional confidence in the estimation results: + +- **Cluster ASC values**: For a given purpose, the percent of trips attracted to +each cluster can be determined (both from the model and the survey). The cluster +ASC values can be adjusted to match the aggregated survey percentages. This +additional adjustment for each cluster is the natural logarithm of the ratio of +the target percentage to the model percentage. +- **Intra-Cluster Constants**: The model has provisions for intra cluster +dummies. These dummies can likewise be adjusted to match the normalized +intra-cluster totals from the survey. +- **Intra-Zonal Dummy**: Finally, the specific intra zonal dummy was adjusted +for a few purposes to match the intra-zonal percentages. + +While this model form has additional levers to control behavior, the adjustments +made were small and do not meaningfully reduce model sensitivity. ## DC Application -This section has a short note on the application of the destination choice models in the TRM model framework. In the TRM model, the destination choice is the higher-level choice which is then followed by the mode choice for a given purpose, market segment and period. However, the utility equation for the DC models contains mode choice logsums. Therefore, a three-step process is followed for each combination of purpose, segment and time period: - -- The mode choice model is run for the given purpose, segment and time period, taking care to use the appropriate time period skim matrices. The output logsum matrix (by mode, segment and period) and the output mode choice probability matrix are stored. -- The DC nested choice is run, which produces a zone-zone probability matrix, such that for each row (zone), the destination probabilities sum to 1. The determination of this probability matrix itself is the result of the nested DC, implemented as a two-stage destination choice model with a zone level model and cluster level model informed by zone-zone DC logsums. -- The trip productions are applied to each row to produce a PA matrix, which are then further multiplied by the stored mode choice probabilities to produce PA matrices by mode, purpose, segment and period. -The matrices are then combined as required. +This section is a short note on the application of the destination choice +models in the TRM model framework. In the TRM model, the destination choice is +the higher-level choice which is then followed by the mode choice for a given +purpose, market segment and period. However, the utility equation for the DC +models contains mode choice logsums. Therefore, a three-step process is followed +for each combination of purpose, segment and time period: + +- The mode choice model is run for the given purpose, segment and time period, +taking care to use the appropriate time period skim matrices. The output logsum +matrix (by mode, segment and period) and the output mode choice probability +matrix are stored. +- The DC nested choice is run, which produces a zone-zone probability matrix, +such that for each row (zone), the destination probabilities sum to 1. The +determination of this probability matrix itself is the result of the nested DC, +implemented as a two-stage destination choice model with a zone level model and +cluster level model informed by zone-zone DC logsums. +- The trip productions are applied to each row to produce a PA matrix, which are +then further multiplied by the stored mode choice probabilities to produce PA +matrices by mode, purpose, segment and period. The matrices are then combined as +required. ## References diff --git a/docs/resident_dest_choice.html b/docs/resident_dest_choice.html index 48cbc19c..023f0471 100644 --- a/docs/resident_dest_choice.html +++ b/docs/resident_dest_choice.html @@ -190,6 +190,9 @@
  • Destination Choice
  • +
  • + Mode Choice +
  • Non-homebased
  • @@ -244,58 +247,50 @@

    Resident Destination Choice

  • Distance terms
  • Home-Based Trips
  • Non-Home-Based trips
  • DC Model Adjustments
  • @@ -322,7 +317,7 @@

    Intro

  • Destinations are predicted at the zone (TAZ) level.
  • Based on an analysis of the region, the zones were logically grouped into the following clusters:
  • -

    +

    • Nested Logit (NL) models were estimated for the home-based trip purposes while multinomial logit (MNL) models were estimated for non-home-based trip purposes. NL models were attempted for the latter set of purposes but did not provide justifiably better models.
    • All models were estimated using Larch (Newman, 2021).
    • @@ -359,26 +354,32 @@

      Home-Based Trips

      Utility Specification

      The utility specification for the DC models consisted of the following set of variables. Not all these variables are present in every specification since variables are retained depending on the significance of their estimated coefficients or owing to strong apriori assumptions.

      +
      +

      Size term

      +

      This is the natural logarithm of the attractions at a destination zone, which itself is a linear combination of employment variables and their coefficients. These coefficients are simultaneously estimated with other parameters. Since it is required that these coefficients be positive, the utility formulation specifies coefficients that are then exponentiated. Successful estimation requires that one of these attraction variable coefficients is fixed. The estimation also provides a coefficient for the size variable, which theoretically should be between 0 and 1.

      +
      +
      +

      Mode Choice Logsums

      +

      Using mode choice root logsums typically presents a problem in destination choice. The root logsum value is dominated by the auto nest which washes out effects from non-dominant modes. For example, drastically improving the transit accessibility to a particular zone will not affect the probability of choosing that zone if the transit logsums are dwarfed by the auto logsums. To circumvent this issue, separate auto, transit, and non-household auto logsums from the respective nests are used in the TRM specifications. The logsums are segmented by vehicle sufficiency market segments for added explanatory power. Work trip logsums are further distinguished by high- and low-income segments. Finally, note that there is no auto logsum component for the zero-vehicle households market segment.

      +
      +
      +

      Others

        -
      • Size term: This is the natural logarithm of the attractions at a destination zone, which itself is a linear combination of employment variables and their coefficients. These coefficients are simultaneously estimated with other parameters. Since it is required that these coefficients be positive, the utility formulation specifies coefficients that are then exponentiated. Successful estimation requires that one of these attraction variable coefficients is fixed. The estimation also provides a coefficient for the size variable, which theoretically should be between 0 and 1.
      • -
      • Mode Choice Logsums: Using mode choice root logsums typically presents a problem in destination choice. The root logsum value is dominated by the auto nest which washes out any effects from the non-dominant modes. For example, drastically improving the transit accessibility to a particular zone will not affect the probability of choosing that zone, if the transit logsums are dwarfed by the auto logsums. To circumvent this issue, separate auto, transit, and non-household auto logsums from the respective nests are used in the TRM specifications. The logsums are segmented by vehicle sufficiency market segments for added explanatory power. The work logsums are further distinguished by high- and low-income segments. Finally, note that there is no auto logsum component for the zero-vehicle households market segment.
      • -
      • 4D measures: Several 4D measures such as transit accessibility, hospital accessibility, walk accessibility, employment densities are considered.
      • -
      • Time Coefficient(s): The auto times corresponding to the latest model skims are attached depending upon the time of day the respondent made the trip. A time coefficient is estimated in each specification.
      • -
      • Intra-Cluster effects: Intra-Cluster coefficients are estimated if the home and the chosen zone are within the same cluster (nest).
      • -
      • Intra-Zonal effect: A coefficient is estimated to boost the utility of choosing the same zone as the origin.
      • -
      • Cluster Nest coefficients: A nest coefficient for each of the 12 clusters is estimated.
      • -
      • Cluster ASCs: Likewise, ASCs are estimated for each cluster.
      • +
      • 4D measures: Several 4D measures such as transit accessibility, hospital accessibility, walk accessibility, employment densities are considered.
      • +
      • Time Coefficient(s): The auto times corresponding to the latest model skims are attached depending upon the time of day the respondent made the trip. A time coefficient is estimated in each specification.
      • +
      • Intra-Cluster effects: Intra-Cluster coefficients are estimated if the home and the chosen zone are within the same cluster (nest).
      • +
      • Intra-Zonal effect: A coefficient is estimated to boost the utility of choosing the same zone as the origin.
      • +
      • Cluster Nest coefficients: A nest coefficient for each of the 12 clusters is estimated.
      • +
      • Cluster ASCs: Likewise, ASCs are estimated for each cluster.

      The rich utility specification allows the model to capture the decision-making process of choosing destination zones and capturing cluster-to-cluster flows.

      +

      General Estimation Observations

      -
        -
      • ASCs were estimated for all but one of the clusters. Only the significant ASCs were kept for each model purpose. The ASCs with poor significance generally had values close to zero. Further, these ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior.
      • -
      • Intrazonal effects are strong for a few of the purposes
      • -
      • Intra-Cluster effects are strong implying that there is a tendency to choose locations that belong in the same cluster as the home zone.
      • -
      • Using mode choice logsums for the auto model (and for vehicle sufficient segments) caused unrealistic signs as well as much poorer model fit. This can be explained by the fact that these logsums are primarily driven by auto skims, which are already a part of the utility equation. Hence most specifications do not have an auto logsum.
      • -
      +

      ASCs were estimated for all but one of the clusters, but only the significant ASCs were kept for each model purpose. The ASCs with poor significance generally had values close to zero. Further, these ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior.

      +

      Intrazonal effects were strong for a few of the purposes, but intra-cluster effects were very strong implying that there is a tendency to choose locations that belong in the same cluster as the home zone.

      +

      Mode choice logsums for the auto model (and for vehicle sufficient segments) caused unrealistic signs as well as much poorer model fit. This can be explained by the fact that these logsums are primarily driven by auto skims, which are already a part of the utility equation. Hence most specifications do not have an auto logsum.

      W_HB_W

      @@ -387,23 +388,16 @@

      W_HB_W

      Estimated coefficients and t statistics

      -
      -
      -

      Notes

      -
        -
      • ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior.
      • -
      • Intrazonal effects are strong.
      • -
      • Home cluster effects are also strong, with residents of Cary, Durham, Chapel Hill, Garner often living and working in the same cluster. Similar effects are also true for the peripheral clusters toward the northeast and south.
      • -
      • Mode choice logsums are significant for workers from zero-auto households, capturing a coupling with access to public transit modes.
      • -
      • Using mode choice logsums for the other auto sufficiency markets (with or without the presence of congested time) caused unrealistic signs as well as much poorer model fit. This can be explained by the fact that these logsums are primarily driven by auto skims, which are already a part of the utility equation.
      • -
      • Hospitals being a major source of employment in the region, accessibility to hospital jobs helps drive work destination attractiveness. Transit and walk accessibilities are also drivers of a zone’s attractiveness for work tours.
      • -
      • The cluster nest coefficients are mostly significantly different from 1, so the nested structure is justified. Research Triangle Park is a special cluster (very low residential land use, very high technology employment, etc.) and has an MNL-type sub-nest.
      • -
      • The size variables are stratified by income. The percentage of low and high paying jobs in each zone was used to split the total attraction into income-specific attractions.
      • -
      +

      ASCs were estimated for all but one of the clusters. Several had values close to zero with poor significance and were dropped. The ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables and is sensitive to changes in model inputs.

      +

      Intrazonal and home cluster effects are strong. Residents of Cary, Durham, Chapel Hill, Garner often live and work in the same cluster. Similar effects are also true for the peripheral clusters toward the northeast and south.

      +

      Root mode choice logsums are significant for workers from zero-auto households, capturing a coupling with access to public transit modes. Using root mode choice logsums for the other auto sufficiency markets caused unrealistic signs as well as much poorer model fit. This can be explained by the fact that these logsums are primarily driven by auto skims, which are already a part of the utility equation (the “Time” term).

      +

      Hospitals are major sources of employment in the region, and accessibility to hospital jobs helps drive work destination attractiveness. Transit and walk accessibilities are also drivers of a zone’s attractiveness for work tours.

      +

      Most of the cluster nest coefficients are significantly different from 1, which means the nested structure is justified. Research Triangle Park is a special cluster (very low residential land use, very high technology employment, etc.). It’s coefficient is effectively 1, which implies an MNL-type sub-nest.

      +

      The size variables are stratified by income using the percentage of low and high paying jobs in each zone. This is how the model pairs high-income workers (e.g. in the Regency area of Cary) with high-paying jobs in RTP. This stratification is critical for accurate work flows in region.

      Double constraint and attraction model

      -

      All trip types other than work are singly-constrained. This means that the row sums of the resulting trip table will match starting productions, but column sums will not necessarily be proportional to the amount of employment in each zone. As an example, for two zones with the same employment, the more accessible zone will attract more trips.

      +

      All trip types (other than work) are singly-constrained. This means that the row sums of the resulting trip table will match starting productions, but column sums will not necessarily be proportional to the amount of employment in each zone. As an example, for two zones with the same employment, the more accessible zone will attract more trips.

      For work trips, a traditional assumption to make is that each job of the same type must attract the same work trips - even if the zone is in a remote location. The justification for this assumption has weakened in recent years with the rise in telecommuting, flex schedules, and other changes, but the TRMG2 model implements double constraint for work trips. In the context of destination choice models, this is achieved by assigning each zone a “shadow price”. This extra term in the utility equation is adjusted in an iterative fashion to match predicted attractions.

      These predicted attractions come from a regression model estimated from the survey data.

      @@ -531,46 +525,33 @@

      Double constraint and attraction model

      -

      During model application, predicted attractions are always scaled to match predicted productions to match double constraint. As a consequence, the attraction model should predict total attractions that is close to predicted productions. This is referred to as PA balance (“production/attraction balance”).

      -

      The coefficients in the table above were multiplied by 2.45 during model calibration to achieve the appropriate PA balance of 1.0. If the ratio is too high or too low, adding additional employment to a zone will have unexpectedly high or low impact on trips attracted (given the scaling that must be done to attractions).

      +

      During model application, predicted attractions are always scaled to match predicted productions before applying double constraint. As a consequence, the attraction model should predict total attractions that is close to predicted productions. This is referred to as PA balance (“production/attraction balance”). If it does not, adding employment to a zone will have unexpectedly high (or low) impacts on total zonal attractions.

      +

      The coefficients in the table above were multiplied by 2.45 during model calibration to achieve the appropriate PA balance of 1.0.

    W_HB_O

    -

    W_HB_O corresponds to trips that are part of a work tour, have one end at home, and the other end at a non-work location. This segment captures trips that directly connect the home to an intermediate stop on the way to or from work and translates to the HBO trip purpose in traditional models. Destination choice in this context relates to a short-term decision of intermediate stop location.

    +

    W_HB_O trips are part of a work tour, have one end at home, and the other end at a non-work location. This segment captures trips that directly connect the home to an intermediate stop on the way to or from work and is part of the HBO trip purpose in traditional models. Destination choice in this context relates to a short-term decision of intermediate stop location.

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior.
    • -
    • Intrazonal effects are strong.
    • -
    • Travel time is more important up to the first 30 minutes. Excess time beyond this threshold is perceived as being less onerous.
    • -
    • Home cluster effects are also strong, with residents of Cary, Garner, Raleigh, Southern Durham and the southeastern periphery often stopping in the same cluster for their intermediate stops.
    • -
    • The public transit mode choice logsum is significant.
    • -
    • The number of households is a major factor in the calculation of the size variable, reflecting the inclusion of personal visits in this trip purpose. Logically, the retail and service employment also help drive trips to specific destinations. Income-based stratifications of employment did not make a difference.
    • -
    • The cluster nest coefficients are mostly significantly different from 1, so the nested structure is justified. Research Triangle Park is a special cluster (very low residential land use, very high technology employment, etc.) and has a nearly MNL-type sub-nest.
    • -
    +

    ASCs were estimated for all but one of the clusters, but only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables and is sensitive to changes in model inputs.

    +

    The model has a sophisticated approach to travel time. Travel time is more important up to the first 30 minutes. Excess time beyond this threshold is gets a slight discount. This reflects a difference in the behavior for these longer trips, which are dominated more by what is at the destination rather than the time to get there.

    +

    Home cluster effects are strong for residents of Cary, Garner, Raleigh, Southern Durham and the southeastern periphery. People in these clusters often stop in the same cluster for their intermediate activities.

    +

    The public transit mode choice logsum is significant for certain segments like zero-vehicle households. Non-household auto logsums (primarily taxis and TNCs) are also significant.

    +

    The number of households is a major factor in the calculation of the size variable, reflecting the inclusion of personal visits in this trip purpose. Logically, the retail and service employment also help drive trips to specific destinations. Income-based stratification of employment did not make a difference in model performance.

    +

    Most cluster nest coefficients are significantly different from 1, so the nested structure is justified. As with W_HB_W, RTP is an exception to this.

    W_HB_EK12

    -

    W_HB_EK12 corresponds to trips that are part of a work tour, have one end at home, and the other end at a school. This segment captures trips that directly connect the home to school on the way to or from work and translates to the HB School trip purpose in traditional models. It includes worker dropping off/picking up children on the way to/from work, and workers who attend school.

    +

    W_HB_EK12 corresponds to trips that are part of a work tour, have one end at home, and the other end at a school. This segment captures trips that directly connect the home to school on the way to or from work and translates to the HB School trip purpose in traditional models. It includes workers dropping off/picking up children on the way to/from work, and young workers who attend school.

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • School location choice is driven by travel time.
    • -
    • Attractions are driven by school enrollment.
    • -
    • Some of the cluster nest coefficients are significantly different from 1, so the nested structure is justified.
    • -
    +

    School location choice is driven by travel time and school enrollment. Some of the cluster nest coefficients are significantly different from 1, so the nested structure is justified for those clusters. The remaining zones are treated as top-level choices (MNL).

    @@ -580,18 +561,8 @@

    N_HB_OME

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior.
    • -
    • Intrazonal effects are strong.
    • -
    • Travel time is more important up to the first 30 minutes. Excess time beyond this threshold is perceived as being less onerous.
    • -
    • Auto and transit logsums (by the relevant auto sufficiency category) play a role in the perception of zones’ attractiveness.
    • -
    • Home cluster effects are strong, with residents often choosing to shop in the same cluster as their residence.
    • -
    • Attractions are driven by office, retail and service employment, which is expected for a trip purpose that involves shopping.
    • -
    -

    The cluster nest coefficients are mostly significantly different from 1, so the nested structure is justified.

    +

    As with the previous models, cluster-based ASCs are small and only applied to a few clusters, which implies a model that is appropriately sensitive. Travel time is again more important up to the first 30 minutes. Excess time beyond this threshold is perceived as being less onerous.

    +

    Attractions are driven by retail and service employment, which is appropriate for shopping/maintenance trips.

    @@ -601,19 +572,9 @@

    N_HB_OMED

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior.
    • -
    • Intrazonal effects are strong.
    • -
    • Travel time is more important up to the first 30 minutes. Excess time beyond this threshold is perceived as being less onerous.
    • -
    • Auto logsums (by the relevant auto sufficiency category) play a role in the perception of zones’ attractiveness.
    • -
    • Accessibility to hospitals plays a positive role in boosting a zone’s attractiveness, which is apt for this trip purpose.
    • -
    • Home cluster effects are also strong, particularly with residents of Garner, Raleigh, Southern Durham and the north-eastern.
    • -
    • Attractions are driven by retail and service employment, with an added boost from service employment related to the presence of hospitals.
    • -
    -

    Most of the cluster nest coefficients are significantly different from 1, so the nested structure is justified.

    +

    A zone’s accessibility to a hospital is a strong predictor in this model. This captures the important clustering of doctor’s offices, clinics, and pharmacies around major hospitals.

    +

    Travel time above 30 minutes has almost no impact on desintation choice. This is an encouraging result. People traveling long distances to see a specialist are not impacted by travel time in a major way.

    +

    Attractions are driven by retail and service employment, with an added boost from service employment related to the presence of hospitals.

    @@ -623,60 +584,34 @@

    N_HB_K12

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • ASCs were estimated for all but one of the clusters. Several of them had values close to zero with poor significance. These were dropped and only the significant ones were retained. These ASCs are very small in magnitude, confirming that the model’s explanatory power is derived from other variables with the ability to capture location choice behavior.
    • -
    • Intra-zonal and intra-cluster effects are strong.
    • -
    • Attractions are driven by school enrollment. Some of the cluster nest coefficients are significantly different from 1, so the nested structure is justified. Others default to an MNL-like behavior.
    • -
    +

    This model has a strong fit and is primarily based on K12 enrollment and travel time (as expected). Additionally, the negative coefficient on the Durham and Northeast clusters means that (all else equal) zones in that cluster are less likely to be chosen. The positive intrazonal coefficient is expected given the tendency for people to be assigned to schools near their home.

    N_HB_OD_Long

    -

    N_HB_ODLong corresponds to trips that are not part of a work tour, have one end at home, and the other end at an “other” activity that is at least 30 minutes long. This segment would have been rolled into the HBO (Home-Based Other) trip purpose in traditional models.

    +

    N_HB_ODLong corresponds to trips that are not part of a work tour, have one end at home, and the other end at an “other” activity that is at least 30 minutes long. Visiting a friend is one example of this trip type. This segment would have been rolled into the HBO (Home-Based Other) trip purpose in traditional models.

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • Longer drive times penalize a destination’s appeal, though times over 30 minutes are not perceived to be as onerous.
    • -
    • Hospital accessibility impacts destination attractiveness in a positive manner, likely accounting for longer medical appointments.
    • -
    • Walk access is viewed positively, especially in h
    • -
    • Most home cluster effects are also strong.
    • -
    • Attractions are driven by employment and school enrollment totals as well as the residential population, the latter due to longer social visits.
    • -
    -

    The cluster nest coefficients are all significantly different from 1, so the nested structure is justified.

    +

    The penalty on travel time above 30 minutes is reduced as seen in multiple trip trypes. The positive coefficient on walk accessibility means that these trips are more attracted to zones in area that are more dense. Home cluster effects are strong, meaning that most of these trips stay within the home cluster. Employment, school enrollment, and residential population all are significant components of the size term. This reflects the catch-all nature of this trip type.

    N_HB_OD_Short

    -

    N_HB_ODShort corresponds to trips that are not part of a work tour, have one end at home, and the other end at an “other” activity shorter than 30 minutes in duration. This segment would have been rolled into the HBO (Home-Based Other) trip purpose in traditional models.

    +

    N_HB_OD_Short corresponds to trips that are not part of a work tour, have one end at home, and the other end at an “other” activity shorter than 30 minutes in duration. This segment would have been rolled into the HBO (Home-Based Other) trip purpose in traditional models.

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • Drive time attenuates the attractiveness of farther destinations, though times beyond 20 minutes are perceived as less onerous than the first 20.
    • -
    • Home cluster effects are also strong across all clusters, indicating a strong preference for proximate destinations for relatively short non-mandatory activities.
    • -
    • Additionally, intra-zonal destinations are accorded higher preference.
    • -
    • Mode choice logsums related to public transit and non-household auto modes (such as ride-hailing) play a positive role in elevating the desirability of destinations.
    • -
    • Attractions are driven by employment and school enrollment totals as well as each zone’s residential population, an indication of the propensity to participate in social visits.
    • -
    -

    All cluster nest coefficients are significantly different from 1, so the nested structure is justified.

    +

    The attenuation on for drive time above 20 minutes is not as strong as other trip types. This indicates that these trips are likely just trying to reach the nearest attraction that satisfies the trip purpose. Similarly, home cluster effects and the intrazonal term are strong meaning that people stay nearby.

    +

    Logsums for household auto, non-household auto, and transit all play an important role in determining where someone travels. Similar to the N_HB_OD_Long trip, the size term is made up of employment, enrollment, and household population.

    Non-Home-Based trips

    -

    The non-home-based trip purposes are multinomial logit (MNL) specifications. Though the cluster-based nested structure of the home-based purposes was not adopted, cluster-level alternative-specific constants (ASCs) are included to help capture cluster-level preferences.

    +

    The non-home-based trip purposes are multinomial logit (MNL) specifications. The cluster-based nesting structure did not improve model results and was not adopted for these purposes. Even so, the cluster-level alternative-specific constants (ASCs) are included to help capture cluster-level preferences.

    The non-home-based trip purposes consist of:

    -

    Given the strong dependence of destination choice on the prior home-based trip’s travel mode for non-home-based trips, separate destination choice models have been estimated by mode. For simplicity and statistical efficiency of model estimation, the above trip purposes were combined into the following four categories that combine trip purpose with travel mode:

    +

    Recall that non-home-based trips are generated by mode based on the results of the home-based models. Given this, separate destination choice models have been estimated by mode. For simplicity and statistical efficiency of model estimation, the above trip purposes were combined into the following four categories that combine trip purpose with travel mode:

    NHB Work Auto

    -

    This segment corresponds to trips that are part of a work tour, do not have either end at home, have at least one end at school, work, work-related or “other” activities, and are associated with a drive mode. This segment would have been rolled into the HBO (Home-Based Other) trip purpose in traditional models.

    +

    This segment corresponds to trips that are part of a work tour, do not have either end at home, and are associated with the auto mode. This segment would have been rolled into the NHBW (Non-Home-Based Work) trip purpose in traditional models.

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • Drive time has the expected negative sensitivity, with times above 30 minutes penalized to a lesser extent than the first 30 minutes.
    • -
    • The generic intra-cluster effect is strong.
    • -
    • Attractions are driven by a variety of employment totals, as well as school enrollment.
    • -
    +

    Drive time has the expected negative sensitivity, with times above 30 minutes penalized to a lesser extent than the first 30 minutes. The generic intra-cluster effect is strong, which is expected given shorter trip lengths of NHB trips. Attractions are driven by a variety of employment as well as school enrollment.

    NHB NonWork Auto

    -

    This segment corresponds to trips that are part of a non-work tour, do not have either end at home, and have at least one end at school or “other” activities. This segment would have been rolled into the HBO (Home-Based Other) trip purpose in traditional models.

    +

    This segment corresponds to trips that are part of a non-work tour, do not have either end at home, and use the auto mode. This segment would have been rolled into the NHBO (Non-Home-Based Other) trip purpose in traditional models.

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • Drive time has the expected negative sensitivity, with times above 30 minutes penalized to a lesser extent than the first 30 minutes.
    • -
    • Intra-cluster and intra-zonal effects, both generic, are strong.
    • -
    • Attractions are driven by school enrollment (perhaps related to after-school activities) and various employment totals.
    • -
    • Higher values of the parking logsum lead to more desirable destination owing to enhanced ease of parking.
    • -
    +

    The non-work auto trip model looks similar to work, but the inclusion of parking logsums captures a major difference between the two trip types: work trips are not impacted by parking variables while non-work trips are.

    NHB Transit

    -

    This segment corresponds to trips that are part of a non-work tour, and do not have either end at home. This segment would have been rolled into the HBO (Home-Based Other) trip purpose in traditional models.

    +

    All NHB transit trips are handled with a single destination choice model.

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • Both in-vehicle time and the number of transfers are significant predictors in the attractiveness of destinations accessible by public transit modes.
    • -
    • Transit accessibility positively impacts destination choice.
    • -
    • Attractions are driven by office, retail and service employment totals.
    • -
    +

    Both the total time of the trip and the number of transfers are significant predictors in the attractiveness of destinations accessible by public transit modes. Transit accessibility of the destination also positively impacts destination choice, which matches expectations.

    NHB NonMotorized

    -

    This segment corresponds to trips that are part of a non-work tour, do not have either end at home, and are associated with a non-motorized (e.g. bike, walk) mode. This segment would have been rolled into the HBO (Home-Based Other) trip purpose in traditional models.

    +

    All NHB non-motorized trips are handled with a single destination choice model.

    Estimated coefficients and t statistics

    -
    -
    -

    Notes

    -
      -
    • As expected, walk distance poses a highly significant deterrent to destinations that are farther away from the origin. A high penalty is imposed on destinations that are more than a mile away.
    • -
    • Intra-zonal and intra-cluster effects are also strong, favoring short-range trips by non-motorized modes.
    • -
    • Walk accessibility is key to choosing destinations.
    • -
    • Attractions are driven by office, retail and service employment totals.
    • -
    +

    As expected, walk distance poses a highly significant deterrent to destinations that are farther away from the origin. An extra penalty is imposed on destinations that are more than a mile away. This captures the sharp drop off in trip lengths beyond one mile in the survey.

    +

    Intra-zonal and intra-cluster effects are strong, which also reflects the short-range nature of non-motorized trips. Finally, the walk accessibility of the destination is important for zero vehicle households. This captures an important reality: these households cannot use autos to make home-based trips.

    @@ -775,14 +681,15 @@

    DC Model Adjustments

    During the model calibration stage, further updates to the DC models were performed to match the model cluster to cluster patterns to the weighted patterns from the survey. These adjustments can be thought of as the adjustment to the ASCs of a mode choice model to match aggregated shares.

    Given the nested destination choice approach and the assumption behind the structure, the model lends itself very well to such post process adjustments. The following parameters were adjusted to try and match the survey patterns. It is worth mentioning that these post process adjustments turned out to be rather small, thereby inspiring additional confidence in the estimation results:

    +

    While this model form has additional levers to control behavior, the adjustments made were small and do not meaningfully reduce model sensitivity.

    DC Application

    -

    This section has a short note on the application of the destination choice models in the TRM model framework. In the TRM model, the destination choice is the higher-level choice which is then followed by the mode choice for a given purpose, market segment and period. However, the utility equation for the DC models contains mode choice logsums. Therefore, a three-step process is followed for each combination of purpose, segment and time period:

    +

    This section is a short note on the application of the destination choice models in the TRM model framework. In the TRM model, the destination choice is the higher-level choice which is then followed by the mode choice for a given purpose, market segment and period. However, the utility equation for the DC models contains mode choice logsums. Therefore, a three-step process is followed for each combination of purpose, segment and time period: