This assignment is meant to give you practice with loops, arrays, String input/output, and fundamental knowledge of primitive v.s. object types. You should write your responses to the written components of this homework in the .tex file inside the solution environments for each problem, compile the .tex file, and submit the resulting pdf file (I don’t need your tex files). You should also submit all java files for your assignment after making sure they work. A submission that doesn’t at least compile without error will automatically get a 0.
1. (50 points) This project asks you to write code to answer some basic data science questions
about a data set.
On Blackboard, you will find a file containing data about news related posts made on Facebook
by a variety of groups (not exhaustive). The data has information about the type of post it was,
who posted it, whether fact checkers later found that the post was mostly true, mostly false,
half and half, or not factual content (i.e. a piece about a restaurant opening or something). I
want you to write a program that will take this data file and process it to answer the following
questions. Write the results that you find here in the pdf file, and submit your code online.
Typically reactions to a post happen in much greater number than shares or comments. For
the purposes of this assignment, Define the “user engagement score” of a post as 5*shares +
reactions + 6*comments.
(a) Out of the news sources considered, which had the highest average truth rating and what
was that average rating? Which had the lowest? You should consider mostly true ratings
to be worth one point, half and half to be .5, mostly false to be 0, and not factual posts to
not factor in the discussion. In order words, a source that posts 100 not factual posts and
1 mostly true post should have a perfect rating.
(b) What is the most popular type of post (link, video. . . ) in terms of the average user
engagement score? What about in terms of only shares or only comments?
(c) What is the most popular truth rating of post (mostly true, mostly false, etc.) in terms of
the average user engagement score? What about in terms of only shares or only comments?
Get Free Quote!
425 Experts Online