This page is provided to give you a one-page view of your task completion throughout the semester. You can edit the .md file in your repository from * [ ] blah blah blah to * [x] blah blah blah to show the task as completed. You can see a one-page view of all task backgrounds as well.
The mapping of tasks to a final grade is listed in Syllabus::Competency Scale
The deliverables for your semester project are assigned in case study 1 and case study 12. However you should keep a steady workflow on this project throughout the semester.
Leadership in data science requires you to be prepared to answer the clients questions when the team meeting is scheduled. We are treating class like the team meeting and you need to be prepared to answer and ask questions about the driving objectives from the client.
On-time - 18/8/3
Complete
On-time
Complete
On-time
Complete
Complete
Case study 13 provides the checklist items for your portfolio. This portfolio package will be the basis of your final grade.
On the last day of class each of you will have the 1.5 hours of class to complete a coding challenge. You will not be able to work with others, but will be able to ask questions of the teacher in class. The coding challenge will use the techniques and programming developed from the course.
.Rmd file in R-Studio.Rmd file.html and keep the .md file as wellhathawayj..md file to the general channel in our Slack groupgapminder dataset from library(gapminder) (get them to match as closely as you can)library(tidyverse) to load ggplot2 and dplyr and the theme_bw() to duplicate the first plotscale_y_continuous(trans = "sqrt") to get the correct scale on the y-axis.weighted.mean() and GDP with summarise() and group_by() that will be the black continent average line on the second plottheme_bw() to duplicate the second plot. You will need to use the new data to make the black lines and dots showing the continent average.ggsave() and save each plot as a .png with a width of 15 inches.Rmd, .md, and the two .png’s of the plots into your git repository..rmd file that has your report.Rmd, .md, and .html to your GitHub repo.md and .html file into your git repositorylibrary(haven) , library(readr), and library(readxl) to load the 6 data sets listed here.xlsx fileseparate() and mutate() functions to create a decade column..xlsx file..Rmd file with 1-2 paragraphs summarizing your graphics and how those graphics answer the driving question.md and .html file into your git repositorydevtools::install_github("hathawayj/buildings") and find out what data is in the packageclimate_zone_fips data to the buildings0809 data using the two FIPS columns for state and county.Food_Beverage_Service group of buildings in the Type variable, use the ProjectTitle column to create new subgroups from the groupings in the code section below and the restaurant names in restaurants.Full Service Restaurants and be in Quick Service Restaurants if they are under 4000 square feet and new construction.str_to_lower() and str_trim() to get all the words in a standardized formcase_when() function to create the subgroups..Rmd file with 2-3 paragraphs summarizing your 3-4 graphics that inform the client questions.md and .html file into your git repository.csv file that was in the zip file and examine the structure of the dataread_rds(gzcon(url("https://byuistats.github.io/M335/data/BoM_SaviorNames.rds"))) to download and load the Savior names table into R.Rmd file with 1-2 paragraphs summarizing your graphic that shows how the distance between Savior names is distributed across the Book of Mormon.md and .html file into your git repositoryhttps://byuistats.github.io/M335/data/sales.csv and format it for visualization and analysislibrary(lubridate) to create the correct time aggregations.md and .html file into your git repositorytickers_today <- c("CXW", "F", "GM", "JCP", "KR", "WDC", "NKE","T", "WDAY", "WFC", "WMT")library(dygraphs) to build interactive visualizations of the stock performances over the last 5 years.library(ggplot2) graphic that helps you build a solid question around how an investor would use volume in their trading strategy.Rmd file with 1-2 paragraphs summarizing your graphics and the choices you made in your visualization.md and .html file into your git repositorypermits data from devtools::install_github("hathawayj/buildings") and tidy it for spatial visualization needslibrary(USAboundaries) package to get the spatial maps for the US.Rmd file with 1-2 paragraphs summarizing your graphics and the choices you made in your visualization to address the reporter’s needs.md and .html file into your git repository.Rmd file with 1-2 paragraphs summarizing your graphics and the choices you made in your visualization.md and .html file into your git repository.Rmd for your presentationlastname firstletterfirstname – For example mine is hathawayj..rmd file that has your reportchild_mortality and create an interesting graphic that highlights this dataset.devtools::install_github("drsimonj/ourworldindata") to get a larger dataset on child_mortality. This will require install.packages("devtools")..Rmd file with short paragraph describing your plots.Rmd, .md, and .html to your GitHub reponycflights13::flights.R script that has your data visualization development with 1-2 commented paragraphs summarizing your 2 finalized graphics and the choices you made in the data presentation.png images of each your final graphics and push all your work to your repository..Rmd file.financing_healthcare and create an interesting graphic that highlights this dataset in relation to child mortality.devtools::install_github("drsimonj/ourworldindata") to get a larger dataset on financing_healthcarechild_mort column).Rmd, .md, and .html to your GitHub repolibrary(readr), library(haven), library(readxl) to read in the five files found on GitHubread_rds(url("WEBLOCATION.rds")) to download and read the .rds file typelibrary(downloader) R package and use the download(mode = "wb") function to download the xlsx data as read_xlsx() cannot read files from the web pathtempfile() function to download and save the file.all.equal().R script and your image to your repository and be ready to share your code that built your graphic in classcontestant_period column is not “tidy” we want to create a month_end and year_end column from the information it contains.rds object.Rmd file with short paragraph describing your plots. Make sure to display the tidyr code in your file.Rmd, .md, and .html to your GitHub reporeadr::read_lines() function to read in each string - randomletters.txt and randomletters_wnumbers.txtrandomletters.txt file, pull out every 1700 letter (e.g. 1, 1700, 3400, …) and find the quote that is hidden - the quote ends with a periodrandomletters_wnumbers.txt file, find all the numbers hidden and convert those numbers to letters using the letters order in the alphabet to decipher the messagerandomletters.txt file, remove all the spaces and periods from the string then find the longest sequence of vowels..R script to your repository and be ready to share your code solution at the beginning of classlibrary(Lahman) and examine the available data sets availablelibrary(blscrapeR) and use the inflation_adjust(2017) function to get all earnings in 2017 dollars.R script and image to your repository and be ready to share your code and image at the beginning of class.Rmd file with short paragraph describing your plots. Make sure to display the tidyr code in your file.Rmd, .md, and .html to your GitHub repo.csv file that was in the zip file and examine the structure of the datainstall.packages("stringr") and install.packages("stringi")stri_stats_latex() and str_locate_all() functions from each package.Rmd file with 1-2 paragraphs and your graphics that answers the three questions.md and .html file into your git repositoryhttps://byuistats.github.io/M335/data/carwash.csv and format it for the needs of this tasklibrary(lubridate).ceiling_date() from library(lubridate)riem_measures(station = "RXE", date_start = , date_end = ) for station RXE from library(riem) to get the matching temperatureslibrary(tidyquant) or library(quantmod).Rmd file with 1-2 paragraphs summarizing your graphics and the choices you made in the data presentation.md and .html file into your git repositorylibrary(dygraphs) plot that shows the Kroger (KR) stock price performance over 5 years.dyRebased() to $10,000..Rmd file with 1-2 paragraphs summarizing your graphics and the choices you made in the data presentation.md and .html file into your git repository.png image that closely matches my examplefill = NA in geom_sf() will not fill the polygons with a grey colorlibrary(USAboundaries) has three useful functions - us_cities(), us_states(), and us_counties().png files to GitHub.png image that plots the required information.png files to GitHub.Rmd file summarizing your reading.md and .html file into your git repository