首页
网站开发
桌面应用
管理软件
微信开发
App开发
嵌入式软件
工具软件
数据采集与分析
其他
首页
>
> 详细
辅导program编程、R留学生程序调试、R编程讲解 调试Matlab程序|讲解Database
项目预算:
开发周期:
发布时间:
要求地区:
Transport Sociology and Psychology (LV 240834759)
Take-Home Exam for Part ‘Transport Sociology’
Department of Civil, Geo and Environmental Engineering at the Technical University Munich
Release Date: 9 February 2021
Due Date: 17 March 2021, end of day (CET)
Introduction
The following tasks shall be answered in a written report. The recommended software
package to calculate answers in Task 1 and Task 2 is ‘The R Project for Statistical
Computing’. You may, however, use other software packages that you might be more
familiar with (such as Matlab or Biogeme), or you write your own code in any language
of your choice. While there is no word limit, answers should be rather short with one to
two paragraphs per question.
Submit your report as a PDF document. Please add your name and matriculation
number (03xxxxxx) at the beginning of your report and specify which software was
used to answer Tasks 1 and 2. Do not provide the script or code you wrote. Once you
are done, upload your PDF report to Moodle.
You are allowed to work in teams to solve these tasks. Note, however, that each
student needs to submit an individual solution. The final estimation results are likely
to be different for every student, as there are thousands of right answers for many
tasks. Provide your own solution. Also, all text needs to be written in your own words,
using copy-and-paste will result in failing the exam and a report to the examination
board. To acknowledge these rules at TUM, you were asked to sign the “Pledge
against Plagiarism” that you find on Moodle. Some of you have handed in a signed
copy already. If you have not done so already (or if you are unsure whether you already
did), please sign this document an upload it with your take-home exam report.
Task 1 [35 points]
You have been provided with a household travel survey that provides information on
household characteristics and the number of trips reported. The Excel file
(householdTravelSurvey.xlsx) provides a description of the available variables. The
CSV file contains the same data and was provided to be read in R. In this task, you
shall identify the most important socio-demographic attributes that explain the number
of auto trips.
a) Read the data with R (or your language of choice). To understand the range of
data, provide min, max and mean values for each variable of this dataset in your
report. [2 points]
b) Create a histogram for number of auto trips and a boxplot for income. Copy the
two graphics into your report. Describe the two graphics in two to three
sentences in your report. [2 points]
c) Estimate a multiple regression, where you try to explain the number of trips by
car with all other socio-demographic attributes available in this survey. Provide
the estimation results in the report*
. [6 points]
Describe the estimation results in the report:
2
× Which independent variables are statistically significant with a
confidence level of at least 90%?
× Are estimated coefficients (called ‘Estimate’ in R) reasonable? Or did you
find coefficients that do not make sense to you? Name coefficients that
seem unlikely and explain why you think they don’t seem right from a
theoretical point of view.
d) A possible reason for unreasonable coefficients is multicollinearity. Use R (or
your preferred software) to plot the correlation between all variables. Add this
plot to your report and identify the three pairs of independent variables that are
most correlated. [5 points]
e) Create another multiple regression with auto trips as the dependent variable.
This time, select independent variables that lead to an estimate where all
coefficients:
× are statistically significant (here defined as 90% confidence or more),
× have signs (+ or –) that make sense to you, and
× no two independent variables correlate with more than |R| = 0.6
This will require some trial and error. Provide and briefly describe the final
estimate in your report*
. Explain for each independent variable in your final
estimation why it makes sense to you (i.e., explain why every + and – sign is
reasonable). [10 points]
f) Attempt to improve the estimation result further by removing the intercept [-1],
by using a quadratic transformation [I(variable^2)], by using a logarithmic
transformation [log(variable)] and by testing interactions [variableA*variableB]
for selected independent variables. This may require to drop additional
independent variables to ensure that all estimated coefficients are statistically
significant. The same three rules listed under the bullet points of subtask (e)
shall apply. In your report, provide the final estimate* that provides the best
model fit that you can find. [10 points]
Task 2 [20 points]
You were provided with another dataset on mode choice
for long-distance travel (file modeChoiceData.csv, see
xlsx file for definition of variables). The survey data was
collected for long-distance trips between Sydney,
Canberra and Melbourne in Australia. Travelers had the
choice between auto, bus, train and air.
[Data Source: Greene, W.H. and D. Hensher: Multinomial logit and discrete choice models. In Greene,
W. H. (1997) LIMDEP version 7.0 user’s manual revised. Plainview, New York. Note that data were
modified for this exam.]
a) Read the data in R (or the software of your choice). To understand the range of
data, provide min, max and mean values of in-vehicle travel times for each mode
in your report. [2 points]
b) Estimate of a multinomial logit model, where mode is the dependent variable
and all other variables serve as independent variables. Provide the estimation
result in your report† and briefly describe whether these estimates make sense
to you (refer to statistical significance and describe whether + and – signs are
reasonable). [8 points]
c) The estimation under (2b) provides for WaitTime, InVehCosts, InVehTime and
GenCosts one coefficient each across all modes. Modify your estimation to
Auto Bus Train Air
Trip
3
provide mode-specific InVehTime (i.e., estimate a different coefficient for
InVehTime for every mode). Provide the estimation result in your report† and
briefly assess how using coefficients by mode has improved this estimation
(provide two reasons why estimation (2c) is better than estimation (2b)).
[4 points]
c) Try to further improve the estimation result from task (2b)
• by removing the intercept [-1] or
• by raising a variable to the power of two [I(variable^2)] for selected
independent variables or
• by using a logarithmic transformation [log(variable)] for selected
independent variables or
• by estimating mode-specific coefficients for InVehCosts, InVehTime or
GenCosts.
To ensure that all variables are statistically significant, you may have to drop
some independent variables. In your report, provide the final estimate† that
provides the best model fit that you can find. Make sure that your best model
estimation only includes independent variable that (i) have the expected sign
[+ or –] and (ii) have a 90% significance level or more. This will require some
trial and error. It is ok to include constants that do not reach this significance
level. [6 points]
Task 3 [15 points]
In task 2, you were asked to estimate a multinomial logit
model. Here, we explore a nested logit model instead.
a) Describe the reasons why nested mode choice
models sometimes work better than multinomial logit
models. There is no need to estimate a model. A
written description of the potential benefits of nested
logit models is sufficient. [7 points]
b) Create a nesting structure for the modes conventional
car, autonomous car, tolled road, non-tolled road,
walk, bike, e-bike, e-scooter, bus, tram and commuter rail. Use as many nesting
layers as make sense to you. Draw a nesting diagram (as shown in the diagram
above), label the boxes with modes and provide it in your report. Explain in one
paragraph your chosen nesting structure. There are many different solutions
that are plausible. While your nesting structure will not be evaluated, your
reasoning for your chosen nesting structure will be evaluated. [8 points]
Task 4 [20 points]
Task 1 explored multiple regression and Tasks 2 and 3 discrete choice models. In this
Task 4, we look at the differences between the two.
a) We apply multiple regression and discrete choice models for different problem
sets. Explain when to use which one. [4 points]
b) Could you have solved Task 1 with a discrete choice model? Why? [8 points]
c) Could you have solved Task 2 with a multiple regression? Why? [8 points]
Trip
4
Task 5 [10 points]
To explore travel behavior, both household travel surveys (e.g., MiD in Germany) and
panel surveys (e.g., MOP in Germany) have been conducted.
a) Explain the difference between a household travel survey and a panel survey in
terms of selection of participants and common sample sizes. [4 points]
b) For each of the following questions, select a survey (MiD or MOP) that is likely
to be most useful. Explain your choices in two or three sentences [6 points]
× Explain mode choice behavior for shopping trips of high-income
households with 0 workers and 0 cars.
× Explore if people who travel less on weekdays travel more on weekends.
× Explain how household relocation to the suburbs has affected the
likelihood to buy a car.
I appreciate any feedback you like to give on clarity, length and difficulty of this exam.
Also, it would be helpful if you could give an estimation of number of hours it took you
to complete this exam. Your answer is optional and will not affect your grade. Thanks!
* Please provide your estimation results including at least: Variable names, estimated
coefficients, statistical significance of each variable and R2 of the estimate.
† Please provide your estimation results including at least: Variable names, estimated
coefficients, statistical significance of each variable, as well as log-likelihood and R2 of
the estimate.
软件开发、广告设计客服
QQ:99515681
邮箱:99515681@qq.com
工作时间:8:00-23:00
微信:codinghelp
热点项目
更多
代写infosys 110 digital syst...
2024-12-28
代写fbe 506 quantitative met...
2024-12-28
代做part i: (crazy eddie htm...
2024-12-28
代写infosys 110 digital syst...
2024-12-28
代做stats 769 statistics sec...
2024-12-28
代写ece3700j introduction to...
2024-12-28
代做tcm2301 biochemistry代做...
2024-12-28
代做ece5550: applied kalman ...
2024-12-28
代写mth205 introduction to s...
2024-12-28
代写scicomp project 3 week 4...
2024-12-28
代做business operations anal...
2024-12-28
代写mth205 introduction to s...
2024-12-28
代写socs0100 computational t...
2024-12-28
热点标签
mktg2509
csci 2600
38170
lng302
csse3010
phas3226
77938
arch1162
engn4536/engn6536
acx5903
comp151101
phl245
cse12
comp9312
stat3016/6016
phas0038
comp2140
6qqmb312
xjco3011
rest0005
ematm0051
5qqmn219
lubs5062m
eee8155
cege0100
eap033
artd1109
mat246
etc3430
ecmm462
mis102
inft6800
ddes9903
comp6521
comp9517
comp3331/9331
comp4337
comp6008
comp9414
bu.231.790.81
man00150m
csb352h
math1041
eengm4100
isys1002
08
6057cem
mktg3504
mthm036
mtrx1701
mth3241
eeee3086
cmp-7038b
cmp-7000a
ints4010
econ2151
infs5710
fins5516
fin3309
fins5510
gsoe9340
math2007
math2036
soee5010
mark3088
infs3605
elec9714
comp2271
ma214
comp2211
infs3604
600426
sit254
acct3091
bbt405
msin0116
com107/com113
mark5826
sit120
comp9021
eco2101
eeen40700
cs253
ece3114
ecmm447
chns3000
math377
itd102
comp9444
comp(2041|9044)
econ0060
econ7230
mgt001371
ecs-323
cs6250
mgdi60012
mdia2012
comm221001
comm5000
ma1008
engl642
econ241
com333
math367
mis201
nbs-7041x
meek16104
econ2003
comm1190
mbas902
comp-1027
dpst1091
comp7315
eppd1033
m06
ee3025
msci231
bb113/bbs1063
fc709
comp3425
comp9417
econ42915
cb9101
math1102e
chme0017
fc307
mkt60104
5522usst
litr1-uc6201.200
ee1102
cosc2803
math39512
omp9727
int2067/int5051
bsb151
mgt253
fc021
babs2202
mis2002s
phya21
18-213
cege0012
mdia1002
math38032
mech5125
07
cisc102
mgx3110
cs240
11175
fin3020s
eco3420
ictten622
comp9727
cpt111
de114102d
mgm320h5s
bafi1019
math21112
efim20036
mn-3503
fins5568
110.807
bcpm000028
info6030
bma0092
bcpm0054
math20212
ce335
cs365
cenv6141
ftec5580
math2010
ec3450
comm1170
ecmt1010
csci-ua.0480-003
econ12-200
ib3960
ectb60h3f
cs247—assignment
tk3163
ics3u
ib3j80
comp20008
comp9334
eppd1063
acct2343
cct109
isys1055/3412
math350-real
math2014
eec180
stat141b
econ2101
msinm014/msing014/msing014b
fit2004
comp643
bu1002
cm2030
联系我们
- QQ: 9951568
© 2021
www.rj363.com
软件定制开发网!