首页
网站开发
桌面应用
管理软件
微信开发
App开发
嵌入式软件
工具软件
数据采集与分析
其他
首页
>
> 详细
代写MATH38161、代做R程序设计
项目预算:
开发周期:
发布时间:
要求地区:
MATH38161 Multivariate Statistics and Machine Learning
Coursework
November 2024
Overview
The coursework is a data analysis project with a written report. You will apply skills
and techniques acquired from Week 1 to Week 8 to analyse a subset of the FMNIST
dataset.
In completing this coursework, you should primarily use the techniques and methods
introduced during the course. The assessment will focus on your understanding and
demonstration of these techniques in alignment with the learning outcomes, rather
than the accuracy or exactness of the final results.
The project report will be marked out of 30. The marking scheme is detailed below.
You have twelve days to complete this coursework, with a total workload of approximately 10 hours (including preliminary coursework tasks).
Format
• Software: You should mainly use R to perform the data analysis. You may use
built-in functions from R packages or implement the algorithms with your own
codes.
• Report: You may use any document preparation system of your choice but the
final document must be a single PDF in A4 format. Ensure that the text in the
PDF is machine-readable.
• Content: Your report must include the complete analysis in a reproducible format,
integrating the computer code, figures, and text etc. in one document.
• Title Page: Show your full name and your University ID on the title page of your
report.
• Length: Recommended length is 8 pages of content (single sided) plus title
page. Maximum length is 10 pages of content plus the title page. Any content
exceeding 10 pages will not be marked.
1
Submission process and deadline
• The deadline for submission is 11:59pm, Friday 29 November 2024.
• Submission is online on Blackboard (through Grapescope).
Academic Integrity and Use of AI Tools
This is an individual coursework. Your analysis and report must be completed
independently, including all computer code. Note that according to the University
guidances, output generated by AI tools is considered work created by another person.
• Citations: Acknowledge all sources, including AI tools used to support text and
code writing.
• Ethics: Use sources in an academically appropriate and ethical manner. Do not
copy verbatim, and cite the original authors rather than second- or third-level
sources.
• Accuracy: Be mindful that sources, including Wikipedia and AI tools, may contain
non-obvious errors.
Copying and plagiarism (=passing off someone else’s work as your own) is a very
serious offence and will be strictly prosecuted. For more details see the “Guidance
to students on plagiarism and other forms of academic malpractice” available at
https://documents.manchester.ac.uk/display.aspx?DocID=2870 .
2
Coursework tasks
Analysis of the FMNIST data using principal component analysis
(PCA) and Gaussian mixture models (GMMs)
The Fashion MNIST dataset contains 70,000 grayscale images of fashion products
categorised into 10 distinct classes. More information is available on Wikipedia and
Github.
The data set to be analysed in this coursework is a subset of the full FMNIST data and
contains 10,000 images, each with dimensions of 28 by 28 pixels, resulting in a total of
784 pixels per image. Each pixel is represented by an integer value ranging from 0 to
255. You can download this data subset as “fmnist.rda” (7.4 MB) from Blackboard.
load("fmnist.rda") # load sampled FMNIST data set
dim(fmnist$x) # dimension of features data matrix (10000, 784)
## [1] 10000 784
range(fmnist$x) # range of feature values (0 to 255)
## [1] 0 255
Here is a plot of the first 15 images:
par(mfrow=c(3,5), mar=c(1,1,1,1))
for (k in 1:15) # first 15 images
{
m = matrix( fmnist$x[k,] , nrow=28, byrow=TRUE)
image(t(apply(m, 2, rev)), col=grey(seq(1,0,length=256)), axes = FALSE)
}
3
Each sample is assigned to one label represented by an integer from 0 to 9 (as R factor
with 10 levels):
fmnist$label[1:15] # first 15 labels
## [1] 7 1 4 8 1 4 7 1 2 0 7 0 8 1 6
## Levels: 0 1 2 3 4 5 6 7 8 9
Task 1: Dimension reduction for FMNIST data using principal components analysis
(PCA)
The following steps are suggested guidelines to help structure your analysis but are not
meant as assignment-style questions. Integrate your work as part of a cohesive report
with a logical narrative.
• Do some research to learn more about the FMNIST data.
• Compute the 784 principal components from the 784 original pixel variables.
• Compute and plot the proportion of variation attributed to each principal component.
• Create a scatter plot of the first two principal components. Use the known labels
to colour the scatter plot.
• Construct the correlation loadings plot.
• Interpret and discuss the result.
• Save the first 10 principal components of all 10,000 images to a data file for Task 2.
Task 2: Analysis of the FMNIST data set using Gaussian mixture models (GMMs)
Using all 784 pixel variables for cluster analysis is computationally impractical. In
this task, use the 10 (or fewer) principal components instead of the original 784 pixel
variables. Again, these steps serve as guidelines. Integrate this work into your report
logically following from Task 1.
• Cluster the data using Gaussian mixture models (GMMs).
• Find out how many clusters can be identified.
• Interpret and discuss the results.
Structure of the report
Your report should be structured into the following sections:
1. Dataset
2. Methods
3. Results and Discussion
4. References
In Section 1 provide some background and describe the data set. In Section 2 briefly
introduce the method(s) you are using to analyse the data. In Section 3 run the analyses
and present and interpret the results. Show all your R code so that your results are
fully reproducible. In Section 4 list all journal articles, books, wikipedia entries, github
pages and other sources you refer to in your report.
4
Marking scheme
The project report will be assessed out of 30 points based on the following rubrics.
Criteria Marks Rubrics
Description of
data
6 Excellent (5-6 marks): Provides a clear and thorough
overview of the FMNIST dataset, detailing the image
structure, pixel data, and its context within multivariate
analysis.
Good (3-4 marks): Provides a clear overview of the
dataset with some context; minor details may be missing.
Adequate (1-2 marks): Basic description of the dataset
with limited context; lacks important details.
Insufficient (0 marks): Little to no description provided.
Description of
Methods
6 Excellent (5-6 marks): Clearly and thoroughly explains
PCA and GMMs, their purposes, and how they apply to
this dataset.
Good (3-4 marks): Provides a clear explanation of PCA
and GMMs, with minor gaps in clarity or relevance.
Adequate (1-2 marks): Basic explanation of methods with
limited detail or relevance to the course techniques.
Insufficient (0 marks): Lacks clear explanations of the
methods.
Results and
Discussion
12 Excellent (10-12 marks): Correctly applies PCA and
GMMs, presents clear and informative visualisations, and
provides a coherent and insightful interpretation of the
results.
Good (7-9 marks): Accurately applies PCA and GMMs
with mostly clear visuals and reasonable interpretation;
minor improvements needed.
Adequate (4-6 marks): Basic application of techniques,
limited or unclear visuals, minimal interpretation.
Insufficient (0-3 marks): Incorrect application of
techniques, with little to no interpretation.
Overall
Presentation of
Report
6 Excellent (5-6 marks): Report is well-organised, clear, and
professionally formatted, with a logical narrative and
adherence to page limits.
Good (3-4 marks): Report is generally clear and
organised, with minor structural or formatting issues.
Adequate (1-2 marks): Report lacks coherence or has
significant formatting issues; may not meet all format
requirements.
Insufficient (0 marks): Report lacks structure and clarity,
does not meet formatting requirements.
5
软件开发、广告设计客服
QQ:99515681
邮箱:99515681@qq.com
工作时间:8:00-23:00
微信:codinghelp
热点项目
更多
代做 program、代写 c++设计程...
2024-12-23
comp2012j 代写、代做 java 设...
2024-12-23
代做 data 编程、代写 python/...
2024-12-23
代做en.553.413-613 applied s...
2024-12-23
代做steady-state analvsis代做...
2024-12-23
代写photo essay of a deciduo...
2024-12-23
代写gpa analyzer调试c/c++语言
2024-12-23
代做comp 330 (fall 2024): as...
2024-12-23
代写pstat 160a fall 2024 - a...
2024-12-23
代做pstat 160a: stochastic p...
2024-12-23
代做7ssgn110 environmental d...
2024-12-23
代做compsci 4039 programming...
2024-12-23
代做lab exercise 8: dictiona...
2024-12-23
热点标签
mktg2509
csci 2600
38170
lng302
csse3010
phas3226
77938
arch1162
engn4536/engn6536
acx5903
comp151101
phl245
cse12
comp9312
stat3016/6016
phas0038
comp2140
6qqmb312
xjco3011
rest0005
ematm0051
5qqmn219
lubs5062m
eee8155
cege0100
eap033
artd1109
mat246
etc3430
ecmm462
mis102
inft6800
ddes9903
comp6521
comp9517
comp3331/9331
comp4337
comp6008
comp9414
bu.231.790.81
man00150m
csb352h
math1041
eengm4100
isys1002
08
6057cem
mktg3504
mthm036
mtrx1701
mth3241
eeee3086
cmp-7038b
cmp-7000a
ints4010
econ2151
infs5710
fins5516
fin3309
fins5510
gsoe9340
math2007
math2036
soee5010
mark3088
infs3605
elec9714
comp2271
ma214
comp2211
infs3604
600426
sit254
acct3091
bbt405
msin0116
com107/com113
mark5826
sit120
comp9021
eco2101
eeen40700
cs253
ece3114
ecmm447
chns3000
math377
itd102
comp9444
comp(2041|9044)
econ0060
econ7230
mgt001371
ecs-323
cs6250
mgdi60012
mdia2012
comm221001
comm5000
ma1008
engl642
econ241
com333
math367
mis201
nbs-7041x
meek16104
econ2003
comm1190
mbas902
comp-1027
dpst1091
comp7315
eppd1033
m06
ee3025
msci231
bb113/bbs1063
fc709
comp3425
comp9417
econ42915
cb9101
math1102e
chme0017
fc307
mkt60104
5522usst
litr1-uc6201.200
ee1102
cosc2803
math39512
omp9727
int2067/int5051
bsb151
mgt253
fc021
babs2202
mis2002s
phya21
18-213
cege0012
mdia1002
math38032
mech5125
07
cisc102
mgx3110
cs240
11175
fin3020s
eco3420
ictten622
comp9727
cpt111
de114102d
mgm320h5s
bafi1019
math21112
efim20036
mn-3503
fins5568
110.807
bcpm000028
info6030
bma0092
bcpm0054
math20212
ce335
cs365
cenv6141
ftec5580
math2010
ec3450
comm1170
ecmt1010
csci-ua.0480-003
econ12-200
ib3960
ectb60h3f
cs247—assignment
tk3163
ics3u
ib3j80
comp20008
comp9334
eppd1063
acct2343
cct109
isys1055/3412
math350-real
math2014
eec180
stat141b
econ2101
msinm014/msing014/msing014b
fit2004
comp643
bu1002
cm2030
联系我们
- QQ: 9951568
© 2021
www.rj363.com
软件定制开发网!