首页
网站开发
桌面应用
管理软件
微信开发
App开发
嵌入式软件
工具软件
数据采集与分析
其他
首页
>
> 详细
CS209A代做、Java程序设计代写
项目预算:
开发周期:
发布时间:
要求地区:
Project.md 2024-11-10
1 / 3
[CS209A-24Fall] Assignment2 (100 points)
Background
In the process of software development, many questions will arise. Developers may resort to Q&A website to
post questions and seek answers. Stack Overflow is such a Q&A website for programmers, and it belongs to the Stack Exchange Network. Stack
Overflow serves as a platform for users to ask and answer questions, and, through membership and active
participation, to vote questions and answers up or down and edit questions and answers in a fashion similar
to a wiki. Users of Stack Overflow can earn reputation points and "badges"; for example, a person is awarded
10 reputation points for receiving an "up" vote on a question or an answer to a question, and can receive
badges for their valued contributions. Users unlock new privileges with an increase in reputation, like the
ability to vote, comment, and even edit other people's posts. In this final project, we'll use Spring Boot to develop a web application that stores, analyzes, and visualizes
Stack Overflow Q&A data w.r.t. java programming, with the purpose of understanding the common
questions, answers, and resolution activities associated with Java programming. Data Collection (10 points)
On Stack Overflow, questions related to Java programming are typically tagged java. You could use this java
tag to identify java-related questions. A question and all of its answers and comments are together referred to
as a thread. For java-related threads on Stack Overflow, we are interested in answering a list of questions as described
below. You should first collect proper data from Stack Overflow to answer these questions. Please check the
official Stack Overflow REST API documentation to learn the REST APIs for collecting different types of data. . You may need to create a Stack Overflow account in order to use its full REST API service. · API requests are subject to rate limits. Please carefully design and execute your requests, otherwise
you may reach your daily quota quickly. . Connections to Stack Overflow REST service maybe unstable sometimes. So, please start the data
collection ASAP!
There are over 1 million threads tagged with java on Stack Overflow. You DON'T have to collect them all. Yet, you should collect data for at least 1000 threads in order to get meaningful insights from the data analysis. Important:
Data collection is offline, meaning that you need to collect and persist the data first. It is recommended that
you use a database (e.g., PostgreSQL, MySQL, etc.) to store the data. However, it is also fine if you store the
data in plain files. In other words, when users interact with your application, the server should get the data
from your local database (or local files), instead of sending REST requests to Stack Overflow on the fly. Hence, the data analysis for the below questions should be performed on the dataset you collected. That is, we first collect a subset of Stack Overflow data (e.g., 1000 threads tagged java) and then answer the
following questions using this subset.
Project.md 2024-11-10
2 / 3
Part I: Data Analysis (70 points)
For each question from this part, you should: . Figure out which data is needed to answer the question
. Design and implement the data analysis on the backend
. Visualize the results on the frontend using proper charts. In other words, when interacting with your web application from the browser, users could select interested
analysis, which sends requests to the server; the server performs corresponding data analysis and returns the
results back to the frontend, which visualizes the results on the webpages. Your work will be evaluated by: . whether the data analysis is meaningful and relevant, i.e., it can indeed answer the question with proper
they want
instantly by looking at the visualization. Take a look at the data visualization catalogue for inspirations. 1. Java Topics (10 points)
We have covered various topics in this course, e.g., generics, collections, I/O, lambda, multithreading, socket, etc. It's interesting to know, what are the top N (N>1, you may choose a proper N depending on your data
and your UI design, same below) topics that are most frequently asked on Stack Overflow?
2. User Engagement (15 points)
What are the top N topics that have the most engagement from users with higher reputation scores? User
engagement means any user activity (e.g., edit, answer, comment, upvote, downvote, etc.) on the thread. 3. Common Mistakes (15 points)
Developers make mistakes, which result in bugs in the code. Bugs manifest themselves as errors or
exceptions, which can be roughly classified as: . Fatal errors: errors like OutOfMemoryError that cannot be recovered at runtime. . Exceptions: checked exceptions and runtime exceptions that can be handled programmatically by
developers. What are the top N errors and exceptions that are frequently discussed by Java developers?
Note that, tags are high-level information and may not include low-level errors or exceptions. Hence, for this
question, you cannot only use tag information. You need to further analyze thread content (e.g., question text
and answer text) to identify error or exception related information, probably using advanced techniques such
as regular expression matching. 4. Answer Quality (30 points)
We consider an answer to be "high-quality" if it is accepted or has many upvotes. It's useful to know, what
factors contribute to high-quality answers?
3 / 3
Project.md 2024-11-10
Please investigate the following factors: . The elapsed time between question creation and answer creation (e.g., whether the first posted answer
tends to be accepted?). . The reputation of the user that creates the answer (e.g., whether answers created by high-reputation
users tend to be accepted or have more upvotes?). In addition to these 2 factors, you should also propose another 1 factor that may contribute to the quality of
answers. For each of the 3 factors, use proper data analysis and visualizations to demonstrate whether the factor
contributes to high-quality answers or not. Part II: RESTful Service (20 points)
Your application should also provide a REST service that answers the following two questions, so that users
may use RESTful APIs to GET the answers they want. The required REST services include: . Topic frequency: users could query for the frequency of a specific topic. Users could also query for the
top N topics sorted by frequency. . Bug frequency: users could query for the frequency of a specific error or exception. Users could also
query for the top N errors or exceptions sorted by frequency. Here, you could reuse the data analysis from Part I. Responses of the REST requests should be in json format. Requirements
Data Analysis
You should implement the data analysis by yourself, using Java features such as Collections, Lambda, and
Stream. You CANNOT feed the data to AI, ask AI to do the analysis, and use AI responses as your data analysis results. You will get 0 point for the question if you do so. Data analysis results should be dynamically generated by the server everytime clients send a request. You
SHOULD NOT precompute the results and stored it as a static content then simply display the precomputed
static content on the frontend. 20 points will be deducted if you do so. Web Framework
You should only use Spring Boot as the web framework. Frontend
Frontend functionalities, such as data visualization and interactive controls, could be implemented in any
programming language (e.g., JavaScript, HTML, CSS, etc.) with any 3rd-party libraries or framework.
软件开发、广告设计客服
QQ:99515681
邮箱:99515681@qq.com
工作时间:8:00-23:00
微信:codinghelp
热点项目
更多
代写dts207tc、sql编程语言代做
2024-12-25
cs209a代做、java程序设计代写
2024-12-25
cs305程序代做、代写python程序...
2024-12-25
代写csc1001、代做python设计程...
2024-12-24
代写practice test preparatio...
2024-12-24
代写bre2031 – environmental...
2024-12-24
代写ece5550: applied kalman ...
2024-12-24
代做conmgnt 7049 – measurem...
2024-12-24
代写ece3700j introduction to...
2024-12-24
代做adad9311 designing the e...
2024-12-24
代做comp5618 - applied cyber...
2024-12-24
代做ece5550: applied kalman ...
2024-12-24
代做cp1402 assignment - netw...
2024-12-24
热点标签
mktg2509
csci 2600
38170
lng302
csse3010
phas3226
77938
arch1162
engn4536/engn6536
acx5903
comp151101
phl245
cse12
comp9312
stat3016/6016
phas0038
comp2140
6qqmb312
xjco3011
rest0005
ematm0051
5qqmn219
lubs5062m
eee8155
cege0100
eap033
artd1109
mat246
etc3430
ecmm462
mis102
inft6800
ddes9903
comp6521
comp9517
comp3331/9331
comp4337
comp6008
comp9414
bu.231.790.81
man00150m
csb352h
math1041
eengm4100
isys1002
08
6057cem
mktg3504
mthm036
mtrx1701
mth3241
eeee3086
cmp-7038b
cmp-7000a
ints4010
econ2151
infs5710
fins5516
fin3309
fins5510
gsoe9340
math2007
math2036
soee5010
mark3088
infs3605
elec9714
comp2271
ma214
comp2211
infs3604
600426
sit254
acct3091
bbt405
msin0116
com107/com113
mark5826
sit120
comp9021
eco2101
eeen40700
cs253
ece3114
ecmm447
chns3000
math377
itd102
comp9444
comp(2041|9044)
econ0060
econ7230
mgt001371
ecs-323
cs6250
mgdi60012
mdia2012
comm221001
comm5000
ma1008
engl642
econ241
com333
math367
mis201
nbs-7041x
meek16104
econ2003
comm1190
mbas902
comp-1027
dpst1091
comp7315
eppd1033
m06
ee3025
msci231
bb113/bbs1063
fc709
comp3425
comp9417
econ42915
cb9101
math1102e
chme0017
fc307
mkt60104
5522usst
litr1-uc6201.200
ee1102
cosc2803
math39512
omp9727
int2067/int5051
bsb151
mgt253
fc021
babs2202
mis2002s
phya21
18-213
cege0012
mdia1002
math38032
mech5125
07
cisc102
mgx3110
cs240
11175
fin3020s
eco3420
ictten622
comp9727
cpt111
de114102d
mgm320h5s
bafi1019
math21112
efim20036
mn-3503
fins5568
110.807
bcpm000028
info6030
bma0092
bcpm0054
math20212
ce335
cs365
cenv6141
ftec5580
math2010
ec3450
comm1170
ecmt1010
csci-ua.0480-003
econ12-200
ib3960
ectb60h3f
cs247—assignment
tk3163
ics3u
ib3j80
comp20008
comp9334
eppd1063
acct2343
cct109
isys1055/3412
math350-real
math2014
eec180
stat141b
econ2101
msinm014/msing014/msing014b
fit2004
comp643
bu1002
cm2030
联系我们
- QQ: 9951568
© 2021
www.rj363.com
软件定制开发网!