This page is provided to give you a one-page view of your task completion throughout the semester. You can edit the .md
file in your repository from * [ ] blah blah blah
to * [x] blah blah blah
to show the task as completed. You can see a one-page view of all task backgrounds as well.
The mapping of tasks to a final grade is listed in Syllabus::Competency Scale
The deliverables for your semester project are assigned in case study 1 and case study 12. However you should keep a steady workflow on this project throughout the semester.
Leadership in data science requires you to be prepared to answer the clients questions when the team meeting is scheduled. We are treating class like the team meeting and you need to be prepared to answer and ask questions about the driving objectives from the client.
On-time - 18/8/3
Complete
On-time
Complete
On-time
Complete
Complete
Case study 13 provides the checklist items for your portfolio. This portfolio package will be the basis of your final grade.
On the last day of class each of you will have the 1.5 hours of class to complete a coding challenge. You will not be able to work with others, but will be able to ask questions of the teacher in class. The coding challenge will use the techniques and programming developed from the course.
.Rmd
file in R-Studio.Rmd
file.html
and keep the .md
file as wellhathawayj
..md
file to the general channel in our Slack groupgapminder
dataset from library(gapminder)
(get them to match as closely as you can)library(tidyverse)
to load ggplot2 and dplyr and the theme_bw()
to duplicate the first plotscale_y_continuous(trans = "sqrt")
to get the correct scale on the y-axis.weighted.mean()
and GDP with summarise()
and group_by()
that will be the black continent average line on the second plottheme_bw()
to duplicate the second plot. You will need to use the new data to make the black lines and dots showing the continent average.ggsave()
and save each plot as a .png
with a width of 15 inches.Rmd
, .md
, and the two .png
’s of the plots into your git repository..rmd
file that has your report.Rmd
, .md
, and .html
to your GitHub repo.md
and .html
file into your git repositorylibrary(haven)
, library(readr)
, and library(readxl)
to load the 6 data sets listed here.xlsx
fileseparate()
and mutate()
functions to create a decade column..xlsx
file..Rmd
file with 1-2 paragraphs summarizing your graphics and how those graphics answer the driving question.md
and .html
file into your git repositorydevtools::install_github("hathawayj/buildings")
and find out what data is in the packageclimate_zone_fips
data to the buildings0809
data using the two FIPS
columns for state and county.Food_Beverage_Service
group of buildings in the Type
variable, use the ProjectTitle
column to create new subgroups from the groupings in the code section below and the restaurant names in restaurants
.Full Service Restaurants
and be in Quick Service Restaurants
if they are under 4000 square feet and new construction.str_to_lower()
and str_trim()
to get all the words in a standardized formcase_when()
function to create the subgroups..Rmd
file with 2-3 paragraphs summarizing your 3-4 graphics that inform the client questions.md
and .html
file into your git repository.csv
file that was in the zip file and examine the structure of the dataread_rds(gzcon(url("https://byuistats.github.io/M335/data/BoM_SaviorNames.rds")))
to download and load the Savior names table into R.Rmd
file with 1-2 paragraphs summarizing your graphic that shows how the distance between Savior names is distributed across the Book of Mormon.md
and .html
file into your git repositoryhttps://byuistats.github.io/M335/data/sales.csv
and format it for visualization and analysislibrary(lubridate)
to create the correct time aggregations.md
and .html
file into your git repositorytickers_today <- c("CXW", "F", "GM", "JCP", "KR", "WDC", "NKE","T", "WDAY", "WFC", "WMT")
library(dygraphs)
to build interactive visualizations of the stock performances over the last 5 years.library(ggplot2)
graphic that helps you build a solid question around how an investor would use volume in their trading strategy.Rmd
file with 1-2 paragraphs summarizing your graphics and the choices you made in your visualization.md
and .html
file into your git repositorypermits
data from devtools::install_github("hathawayj/buildings")
and tidy it for spatial visualization needslibrary(USAboundaries)
package to get the spatial maps for the US.Rmd
file with 1-2 paragraphs summarizing your graphics and the choices you made in your visualization to address the reporter’s needs.md
and .html
file into your git repository.Rmd
file with 1-2 paragraphs summarizing your graphics and the choices you made in your visualization.md
and .html
file into your git repository.Rmd
for your presentationlastname
firstletterfirstname
– For example mine is hathawayj
..rmd
file that has your reportchild_mortality
and create an interesting graphic that highlights this dataset.devtools::install_github("drsimonj/ourworldindata")
to get a larger dataset on child_mortality
. This will require install.packages("devtools")
..Rmd
file with short paragraph describing your plots.Rmd
, .md
, and .html
to your GitHub reponycflights13::flights
.R
script that has your data visualization development with 1-2 commented paragraphs summarizing your 2 finalized graphics and the choices you made in the data presentation.png
images of each your final graphics and push all your work to your repository..Rmd
file.financing_healthcare
and create an interesting graphic that highlights this dataset in relation to child mortality.devtools::install_github("drsimonj/ourworldindata")
to get a larger dataset on financing_healthcare
child_mort
column).Rmd
, .md
, and .html
to your GitHub repolibrary(readr)
, library(haven)
, library(readxl)
to read in the five files found on GitHubread_rds(url("WEBLOCATION.rds"))
to download and read the .rds
file typelibrary(downloader)
R package and use the download(mode = "wb")
function to download the xlsx data as read_xlsx()
cannot read files from the web pathtempfile()
function to download and save the file.all.equal()
.R
script and your image to your repository and be ready to share your code that built your graphic in classcontestant_period
column is not “tidy” we want to create a month_end
and year_end
column from the information it contains.rds
object.Rmd
file with short paragraph describing your plots. Make sure to display the tidyr
code in your file.Rmd
, .md
, and .html
to your GitHub reporeadr::read_lines()
function to read in each string - randomletters.txt and randomletters_wnumbers.txtrandomletters.txt
file, pull out every 1700 letter (e.g. 1, 1700, 3400, …) and find the quote that is hidden - the quote ends with a periodrandomletters_wnumbers.txt
file, find all the numbers hidden and convert those numbers to letters using the letters order in the alphabet to decipher the messagerandomletters.txt
file, remove all the spaces and periods from the string then find the longest sequence of vowels..R
script to your repository and be ready to share your code solution at the beginning of classlibrary(Lahman)
and examine the available data sets availablelibrary(blscrapeR)
and use the inflation_adjust(2017)
function to get all earnings in 2017 dollars.R
script and image to your repository and be ready to share your code and image at the beginning of class.Rmd
file with short paragraph describing your plots. Make sure to display the tidyr
code in your file.Rmd
, .md
, and .html
to your GitHub repo.csv
file that was in the zip file and examine the structure of the datainstall.packages("stringr")
and install.packages("stringi")
stri_stats_latex()
and str_locate_all()
functions from each package.Rmd
file with 1-2 paragraphs and your graphics that answers the three questions.md
and .html
file into your git repositoryhttps://byuistats.github.io/M335/data/carwash.csv
and format it for the needs of this tasklibrary(lubridate)
.ceiling_date()
from library(lubridate)
riem_measures(station = "RXE", date_start = , date_end = )
for station RXE
from library(riem)
to get the matching temperatureslibrary(tidyquant)
or library(quantmod)
.Rmd
file with 1-2 paragraphs summarizing your graphics and the choices you made in the data presentation.md
and .html
file into your git repositorylibrary(dygraphs)
plot that shows the Kroger (KR) stock price performance over 5 years.dyRebased()
to $10,000..Rmd
file with 1-2 paragraphs summarizing your graphics and the choices you made in the data presentation.md
and .html
file into your git repository.png
image that closely matches my examplefill = NA
in geom_sf()
will not fill the polygons with a grey colorlibrary(USAboundaries)
has three useful functions - us_cities()
, us_states()
, and us_counties()
.png
files to GitHub.png
image that plots the required information.png
files to GitHub.Rmd
file summarizing your reading.md
and .html
file into your git repository