首页
网站开发
桌面应用
管理软件
微信开发
App开发
嵌入式软件
工具软件
数据采集与分析
其他
首页
>
> 详细
data程序代写、Python/c++设计编程代做
项目预算:
开发周期:
发布时间:
要求地区:
STATS 3DA3
Homework Assignment 2
Pratheepa Jeganathan
02/05/2024
Instruction
• Due before 10:00 PM on Tuesday, February 13, 2024.
• Submit a copy of PDF with your solution to Avenue to Learn.
• Late penalty for assignments: 15% will be deducted from assignments each day
after the due date (rounding up).
• Assignments won’t be accepted after 48 hours after the due date.
Assignment Standards
Your assignment must conform to the Assignment Standards listed below.
• Write your name and student number on the title page. We will not grade assignments
without the title page.
• You may discuss homework problems with other students, but you have to prepare the written
assignments yourself.
• LATEXis strongly recommended but not strictly required.
• Eleven-point font (times or similar) must be used with 1.5 line spacing and margins of at
least 1~inch all around.
• Use newpage to write solution for each question (1, 2, 3).
• No screenshots are accepted for any reason.
• The writing and referencing should be appropriate to the undergradaute level.
1
• Various tools, including publicly available internet tools, may be used by the instructor to
check the originality of submitted work.
• Assignment policy on the use of generative AI:
– Students are not permitted to use generative AI in this assignment. In alignment
with McMaster academic integrity policy, it “shall be an offence knowingly to … submit academic work for assessment that was purchased or acquired from another source”.
This includes work created by generative AI tools. Also state in the policy is the following, “Contract Cheating is the act of”outsourcing of student work to third parties”
(Lancaster & Clarke, 2016, p. 639) with or without payment.” Using Generative AI tools
is a form of contract cheating. Charges of academic dishonesty will be brought forward
to the Office of Academic Integrity.
2
Question 1
Download the paper Data Science at the Singularity by David Donoho (2024) at paper. Follow the steps to find the most frequently used words and create a word cloud.
• (1) Reference where you obtained the original PDF document.
• (2) Read all PDF document pages and separate each line by \n.
• (3) Split the lines by \n.
• (4) Remove the lines before Abstract. ...... You can print the first few lines and find
the number of lines to remove.
• (5) Create a data frame with lines.
• (6) Tokenize each line and convert each word to a row.
• (7) Convert each word to lowercase.
• (8) Remove stopwords.
• (9) Remove any other words that are not suitable for the word cloud. For example, a single
letter word, symbols [ . , ) , abbreviation, etc.
• (10) Create a term-frequency data frame.
• (11) Produce a word cloud. You can decide on the most frequently used words in the world
cloud—for example, word cloud for the ten most frequently used words.
• (12) Write a summary paragraph (at least two statements) about your word cloud. The
summary should be cast in the context of your chosen text document.
Question 2
Question 2 uses Johns Hopkins GitHub data on the COVID-19 global vaccine administered to
develop a Shiny App.
Visit the website https://github.com/govex/COVID-19/tree/master/data_tables/vaccine
_data/global_data and read the description (readme.md).
3
This question will lead to developing a Shiny app so that users can choose the date range to
investigate the COVID-19 vaccine administrated and the number of people for whom at least one
dose has been administered.
• (1) Read the CSV file of https://raw .githubusercontent .com/govex/COVID -19/
master/data_tables/vaccine_data/global_data/time_series_covid19_vaccine
_global .csv into Python. Read the data dictionary at https://github .com/
govex / COVID -19 / blob / master / data _tables / vaccine _data / global _data /
data_dictionary.csv.
• (2) Each row is uniquely defined by country and date in the data frame. What is the
dimension of the data?
• (3) Look at the data dictionary. Describe the Doses_admin and People at least one
dose administered variables.
• (4) Identify the data frame column representing the countries. Then, select the rows in the
data frame for Canada.
• (5) Use only the Canada vaccine data to answer the rest of the questions. Plot the time series
data of Dose_dmin and People_at_least_one_dose in the same graph. Label the time
series lines by Doses Administered and People at least one dose administered,
respectively. Convert the y-axis to the log scale. Rotate the x-axis ticks by 45 degrees.
Hint:
1. Convert ‘Date’ column to datetime format.
2. Use matplotlib.pyplot.plot.
• (6) Describe the plot in the context of data.
• (7) Create the Shiny app as follows. In the Shiny app, the user input is any starting and
ending dates. The range of dates may be 2020-12-29 to 2023-03-09. The output is the
time series plot for the logarithm of the doses administrated and people at least
one dose administrated in Canada for the range of dates the users choose. You can
use the following template to create the Shiny app.
4
• (8) Deploy your Shiny app at https://www.shinyapps.io/. Then, provide the link to the
app—for example, https://pratheepaj.shinyapps.io/my_app/.
from shiny import App, render, ui
# import required libraries
app_ui = ui.page_fluid(
ui.input_date_range(
"daterange",
"Date range",
start="2020-12-29",
end= '2023-03-09'
),
ui.output_plot('myplot'),
)
def server(input, output, session):
@output
@render.plot
def myplot():
# Read the data
# select the data for Canada
# If you call the data frame as `df`, then the
# following codes select the rows in the user
# selected date range
df = df[df['Date'] > pd.Timestamp(input.daterange()[0])]
df = df[df['Date'] < pd.Timestamp(input.daterange()[1])]
# Create the plot using `df`
5
app = App(app_ui, server)
3. Helper’s name.
After attempting homework problems individually, students may discuss a homework assignment
with their classmates. However, students must write up their solutions individually and explicitly
indicate who (if anyone) or resources students received help. Write your helper’s name (only one
helper’s name is accepted).
6
Grading scheme
1. 1. Link to the document[1]
2. Codes to read all the pages[1]
3. Codes [1]
4. Codes [1]
5. Codes [1]
6. Codes [2]
7. Codes [1]
8. Codes [1]
9. Codes [1]
10. Codes [1]
11. Codes, word cloud for the most frequently used words [2]
12. Two statements[2]
2. 1. Codes [1]
2. Codes and answer [1]
3. Description [2]
4. Identify the column and code [2]
5. Plot variable 1, plot variable 2 in the same plot, label both time
series, y-axis scale, x-axis ticks [5]
6. At least one statement [1]
7. importing libraries, complete the codes for creating the plot, app
works locally[3]
8. deploying the app, link to the app [2]
The maximum point for this assignment is 32. We will convert this to 100%.
7
软件开发、广告设计客服
QQ:99515681
邮箱:99515681@qq.com
工作时间:8:00-23:00
微信:codinghelp
热点项目
更多
代写dts207tc、sql编程语言代做
2024-12-25
cs209a代做、java程序设计代写
2024-12-25
cs305程序代做、代写python程序...
2024-12-25
代写csc1001、代做python设计程...
2024-12-24
代写practice test preparatio...
2024-12-24
代写bre2031 – environmental...
2024-12-24
代写ece5550: applied kalman ...
2024-12-24
代做conmgnt 7049 – measurem...
2024-12-24
代写ece3700j introduction to...
2024-12-24
代做adad9311 designing the e...
2024-12-24
代做comp5618 - applied cyber...
2024-12-24
代做ece5550: applied kalman ...
2024-12-24
代做cp1402 assignment - netw...
2024-12-24
热点标签
mktg2509
csci 2600
38170
lng302
csse3010
phas3226
77938
arch1162
engn4536/engn6536
acx5903
comp151101
phl245
cse12
comp9312
stat3016/6016
phas0038
comp2140
6qqmb312
xjco3011
rest0005
ematm0051
5qqmn219
lubs5062m
eee8155
cege0100
eap033
artd1109
mat246
etc3430
ecmm462
mis102
inft6800
ddes9903
comp6521
comp9517
comp3331/9331
comp4337
comp6008
comp9414
bu.231.790.81
man00150m
csb352h
math1041
eengm4100
isys1002
08
6057cem
mktg3504
mthm036
mtrx1701
mth3241
eeee3086
cmp-7038b
cmp-7000a
ints4010
econ2151
infs5710
fins5516
fin3309
fins5510
gsoe9340
math2007
math2036
soee5010
mark3088
infs3605
elec9714
comp2271
ma214
comp2211
infs3604
600426
sit254
acct3091
bbt405
msin0116
com107/com113
mark5826
sit120
comp9021
eco2101
eeen40700
cs253
ece3114
ecmm447
chns3000
math377
itd102
comp9444
comp(2041|9044)
econ0060
econ7230
mgt001371
ecs-323
cs6250
mgdi60012
mdia2012
comm221001
comm5000
ma1008
engl642
econ241
com333
math367
mis201
nbs-7041x
meek16104
econ2003
comm1190
mbas902
comp-1027
dpst1091
comp7315
eppd1033
m06
ee3025
msci231
bb113/bbs1063
fc709
comp3425
comp9417
econ42915
cb9101
math1102e
chme0017
fc307
mkt60104
5522usst
litr1-uc6201.200
ee1102
cosc2803
math39512
omp9727
int2067/int5051
bsb151
mgt253
fc021
babs2202
mis2002s
phya21
18-213
cege0012
mdia1002
math38032
mech5125
07
cisc102
mgx3110
cs240
11175
fin3020s
eco3420
ictten622
comp9727
cpt111
de114102d
mgm320h5s
bafi1019
math21112
efim20036
mn-3503
fins5568
110.807
bcpm000028
info6030
bma0092
bcpm0054
math20212
ce335
cs365
cenv6141
ftec5580
math2010
ec3450
comm1170
ecmt1010
csci-ua.0480-003
econ12-200
ib3960
ectb60h3f
cs247—assignment
tk3163
ics3u
ib3j80
comp20008
comp9334
eppd1063
acct2343
cct109
isys1055/3412
math350-real
math2014
eec180
stat141b
econ2101
msinm014/msing014/msing014b
fit2004
comp643
bu1002
cm2030
联系我们
- QQ: 9951568
© 2021
www.rj363.com
软件定制开发网!