首页
网站开发
桌面应用
管理软件
微信开发
App开发
嵌入式软件
工具软件
数据采集与分析
其他
首页
>
> 详细
ENVS363代做、R设计编程代写
项目预算:
开发周期:
发布时间:
要求地区:
ENVS363/563.3 - A Computational Essay 2023/24
Overview and Instructions
Due Date: 8th January 2024
50% of the final mark
Overview
Here’s the premise. You will take the role of a real-world GIS analyst or spatial data scientist tasked
to explore datasets on the San Francisco Bay Area (often just called the Bay Area) and find useful
insights for a variety of city decision-makers. It does not matter if you have never been to the Bay
Area. In fact, this will help you focus on what you can learn about the city through the data, without
the influence of prior knowledge. Furthermore, the assessment will not be marked based on how
much you know about the San Francisco Bay Area but instead about how much you can show you
have learned through analysing data. You will need contextualise your project by highlighting the
opportunities and limitations of ‘old’ and ‘new’ forms of spatial data and reference relevant
literature.
Format
A computational essay using Quarto. The assignment should be carried out fully in Quarto.
What is a Computational Essay?
A computational essay is an essay whose narrative is supported by code and computational results
that are included in the essay itself. This piece of assessment is equivalent to 4,000 words.
However, this is the overall weight. Since you will need to create not only narrative but also code
and figures, here are the requirements:
• Maximum of 1,000 words (ordinary text) (references do not contribute to the word
count). You should answer the specified questions within the narrative. The questions
should be included within a wider analysis.
• Up to five maps or figures (a figure may include more than one map and will only count
as one but needs to be integrated in the same overall output)
• Up to one table
There are three kinds of elements in a computational essay.
1. Ordinary text (in English)
2. Computer input (R code)
3. Computer output
These three elements all work together to express what’s being communicated.
Submission
You must submit 1 electronic copy of your assessment via Canvas by the published
deadline. The format of the file must be an html document. Please do not include your
name anywhere in the documents.
• Please refer to the ENVS363/563 Assessment criteria. This document includes the parts
you should include in your Computational Essay.
Data
The assignment relies on datasets and has two parts. Each dataset is explained with more detail
below.
ENVS363-563 Computational Essay
• Data made available on Murray Cox’s website as part of his “Inside Airbnb” project which
you can download (http://insideairbnb.com/). The website periodically publishes
snapshots of Airbnb listings around the world. You should Download the San Francisco
data, the San Mateo data and the Oakland data. These are all part of the Bay Area.
Please Note: that for best results you will need to drop some of the outliers.
• Socio-economic variables for the Bay Area. Source: American Community Survey (ACS)
2016-2020, US Census Bureau. Observations: 1039; Variables: 472; Years: 2016-2020.
o A subset of variables from the latest ACS has already been retrieved for you in
ACS_2016_2020_vars.csv. However, you have access to ALL variables in the
American Community Survey (ACS) 2016-2020 through the R package
Tidycensus.
o You are strongly recommended to use the census API in the R package
Tidycensus to extract your variables of interest instead of the csv. For more
information about the ACS (2016-2020) you can have a look at:
https://www.census.gov/data/developers/data-sets/acs-5year.html and
https://api.census.gov/data/2020/acs/acs5/variables.html.
If you want to visualise some aspects at different Subnational Administrative boundaries, you can
download USA boundaries from GADM. You can also find other geodata for the Bay Area in the
Berkeley Library.
IMPORTANT - Students of ENVS563 will need to source, at least, two additional datasets relating
to San Francisco or the Bay Area. You can use any dataset that will help you complete the tasks
below but, if you need some inspiration, have a look at the following:
• Geodata for the Bay Area in the Berkeley Library.
• San Francisco Open Data Portal: https://datasf.org/opendata/
• Data World: https://data.world/datasets/san-francisco
• NASA Data: https://earthdata.nasa.gov/earth-observation-data/near-real-time/hazardsand-disasters/air-quality
Part 1 – Common
1.1 Collecting and importing the data
1.1.1 Import and explore
1.2 Preparing the data
1.2.1 What CRS are you going to use? Justify your answer.
1.3 Discussion of the data
• Present and describe the data sets used for this project.
1.4 Mapping and Data visualisation
1.4.1 Airbnb in the BAY AREA at Neighbourhood Level
• Summarise the data. Using Bay Area zipcodes/ ZCTAs obtained from Berkeley Library.
This is slightly different from the Airbnb neighbourhood file. Obtain a count of listings by
neighbourhood.
ENVS363-563 Computational Essay
• Map 1.1: Number of listings per zipcode. Explore the spatial distribution of the data using
choropleths. Style the layers using a colour ramp.
• Map 1.2: Average price per zipcode. Explore the spatial distribution of the data using
choropleths. Style the layers using a colour ramp.
Justify your data classification methods and visualization choices. You should include these maps
in your assessment submission. The maps should be well-presented and include a short
description.
Questions to answer within your analysis: How does the Inside Airbnb data compare to other ‘new’
forms of spatial data? Discuss the potential insights and biases, as well as opportunities and
limitations of the Airbnb data.
1.4.2. Socio-economic variables from the ACS data
Select two variables from American Community Survey data. These could be but are not limited
to population density, median income, median age, unemployed, percentage of black population,
percentage of Hispanic population or education level. See the Appendix in this document for help.
If you chose to calculate population percentages, make sure you standardise the table by the
population size of each tract.
• Map2: Explore the spatial distribution of your chosen variables using choropleths. Style the
variables using a colour ramp. Justify your data classification methods and visualization
choices. You should include these maps in your assessment submission. The maps should
be well-presented and include a short description.
Questions to answer within your analysis. Comment on the details of your map and analyse the
results. What are the main types of neighbourhoods you identify? Which characteristics help you
delineate this typology? What can you say about the spatial distribution of your socio-economic
variable of interest? If you had to use this classification to evaluate where Airbnbs would cluster,
what would your hypothesis be? Why?
For some stylised (not necessarily accurate) facts about the Bay Area see here.
1.4.3. Combining Data sets
• Map 3: Plot the natural logarithm of price (ln of price) of Airbnbs in the San Francisco Bay
Area together (point plot) with one of your chosen socio-economic variables of interest
at zipcode level using ggplot or tmap or mapsf (polygon plot). There are various ways of
doing this. The maps should be well-presented.
Questions to answer within your analysis. Comment on the details of your map and analyse the
results. Does this map tell you more about the relationship between Airbnb location/price and
your socio-economic variable of choice? Explain your answer.
1.4.4. Autocorrelation
• Map 4: Explore the degree of spatial autocorrelation. Describe the concepts behind your
approach and interpret your results.
ENVS363-563 Computational Essay
Part 2 – Chose your own analysis
For this one, you need to pick one of the following three options. Only one, and make the most
of it.
Please Note: This part of the assignment can be done on the Bay Area as a whole or you can
zoom in on one of the counties. For example, you could just focus on San Francisco.
1. Create a geodemographic classification and interpret the results. In the process, answer
the following questions:
• What are the main types of neighbourhoods you identify?
• Which characteristics help you delineate this typology?
• If you had to use this classification to target areas in most need, how would you use it?
why?
2. Create a regionalisation and interpret the results. In the process, answer at least the
following questions:
• How is the city partitioned by your data?
• What do you learn about the geography of the city from the regionalisation?
• What would one useful application of this regionalisation in the context of urban policy?
3. Use the OpenStreetMap package to osmdata download Point of Interest (POIs) Data for
the Bay Area or San Francisco. Using this this data, complete the following tasks:
• Visualise the dataset appropriately and discuss why you have taken your specific
approach
• Use DBSCAN to identify areas of the city with high density of POIs, which we will call
areas of interest (AOI). In completing this, answer the following questions:
o What parameters have you used to run DBSCAN? Why?
o What do the clusters help you learn about areas of interest in the city?
o Name one example of how these AOIs can be of use for the city. You can take
the perspective of an urban planner, a policy maker, an operational
practitioner (e.g. police, trash collection), an urban entrepreneur, or any
other role you envision.
Resources to help you. See also suggested bibliography in slides throughout the course.
• https://www.r-bloggers.com/2017/11/programming-meh-lets-teach-how-to-writecomputational-essays-instead/
• https://rmarkdown.rstudio.com/
• https://www.rstudio.com/wp-content/uploads/2015/02/rmarkdown-cheatsheet.pdf
• https://vizual-statistix.tumblr.com/post/114850050736/i-find-the-spread-of-airbnb-to-beas-fascinating
• https://carto.com/blog/airbnb-impact/
• https://cran.r-project.org/web/packages/biscale/vignettes/biscale.html
Appendix
American Community Survey (ACS) 2016-2020, US Census Bureau. Observations: 1039; Variables:
472; Years: 2016-2020
ENVS363-563 Computational Essay
Variable Description
B19013_001E Median household income in the past 12 months (in 2020 inflation-adjusted
dollars). Coded as hh_income
B02001 (list of vars) Population by race
See https://api.census.gov/data/2020/acs/acs5/variables.html
I have already recoded black (n of black people) and all_ppl_race (total
population by census tract)
B23006 (list of vars) Population by education
See https://api.census.gov/data/2020/acs/acs5/variables.html
C15002A (list of vars) Population by Sex by Education
See https://api.census.gov/data/2020/acs/acs5/variables.html
C27012 (list of vars) Population by Health insurance
See https://api.census.gov/data/2020/acs/acs5/variables.html
B08006 (list of vars) Commuting variable
See https://api.census.gov/data/2020/acs/acs5/variables.html
B09010 (list of vars) Supplementary income variables
See https://api.census.gov/data/2020/acs/acs5/variables.html
B09019 (list of vars) Household type counts
See https://api.census.gov/data/2020/acs/acs5/variables.html
B17001 (list of vars) Poverty Status
See https://api.census.gov/data/2020/acs/acs5/variables.html
B28011 (list of vars) Internet Access
See https://api.census.gov/data/2020/acs/acs5/variables.html
B99084 (list of vars) Work From Home
See https://api.census.gov/data/2020/acs/acs5/variables.html
软件开发、广告设计客服
QQ:99515681
邮箱:99515681@qq.com
工作时间:8:00-23:00
微信:codinghelp
热点项目
更多
代做ceng0013 design of a pro...
2024-11-13
代做mech4880 refrigeration a...
2024-11-13
代做mcd1350: media studies a...
2024-11-13
代写fint b338f (autumn 2024)...
2024-11-13
代做engd3000 design of tunab...
2024-11-13
代做n1611 financial economet...
2024-11-13
代做econ 2331: economic and ...
2024-11-13
代做cs770/870 assignment 8代...
2024-11-13
代写amath 481/581 autumn qua...
2024-11-13
代做ccc8013 the process of s...
2024-11-13
代写csit040 – modern comput...
2024-11-13
代写econ 2070: introduc2on t...
2024-11-13
代写cct260, project 2 person...
2024-11-13
热点标签
mktg2509
csci 2600
38170
lng302
csse3010
phas3226
77938
arch1162
engn4536/engn6536
acx5903
comp151101
phl245
cse12
comp9312
stat3016/6016
phas0038
comp2140
6qqmb312
xjco3011
rest0005
ematm0051
5qqmn219
lubs5062m
eee8155
cege0100
eap033
artd1109
mat246
etc3430
ecmm462
mis102
inft6800
ddes9903
comp6521
comp9517
comp3331/9331
comp4337
comp6008
comp9414
bu.231.790.81
man00150m
csb352h
math1041
eengm4100
isys1002
08
6057cem
mktg3504
mthm036
mtrx1701
mth3241
eeee3086
cmp-7038b
cmp-7000a
ints4010
econ2151
infs5710
fins5516
fin3309
fins5510
gsoe9340
math2007
math2036
soee5010
mark3088
infs3605
elec9714
comp2271
ma214
comp2211
infs3604
600426
sit254
acct3091
bbt405
msin0116
com107/com113
mark5826
sit120
comp9021
eco2101
eeen40700
cs253
ece3114
ecmm447
chns3000
math377
itd102
comp9444
comp(2041|9044)
econ0060
econ7230
mgt001371
ecs-323
cs6250
mgdi60012
mdia2012
comm221001
comm5000
ma1008
engl642
econ241
com333
math367
mis201
nbs-7041x
meek16104
econ2003
comm1190
mbas902
comp-1027
dpst1091
comp7315
eppd1033
m06
ee3025
msci231
bb113/bbs1063
fc709
comp3425
comp9417
econ42915
cb9101
math1102e
chme0017
fc307
mkt60104
5522usst
litr1-uc6201.200
ee1102
cosc2803
math39512
omp9727
int2067/int5051
bsb151
mgt253
fc021
babs2202
mis2002s
phya21
18-213
cege0012
mdia1002
math38032
mech5125
07
cisc102
mgx3110
cs240
11175
fin3020s
eco3420
ictten622
comp9727
cpt111
de114102d
mgm320h5s
bafi1019
math21112
efim20036
mn-3503
fins5568
110.807
bcpm000028
info6030
bma0092
bcpm0054
math20212
ce335
cs365
cenv6141
ftec5580
math2010
ec3450
comm1170
ecmt1010
csci-ua.0480-003
econ12-200
ib3960
ectb60h3f
cs247—assignment
tk3163
ics3u
ib3j80
comp20008
comp9334
eppd1063
acct2343
cct109
isys1055/3412
math350-real
math2014
eec180
stat141b
econ2101
msinm014/msing014/msing014b
fit2004
comp643
bu1002
cm2030
联系我们
- QQ: 9951568
© 2021
www.rj363.com
软件定制开发网!