library(tidyverse)
library(mosaic)
library(rio)
library(ggplot2)
# Be sure to make this using ggplot()
Semester Project
Introduction
[Provide a brief overview of your research question. Explain why you’re interested in the topic, where you found your data and the objectives of your analysis.]
Part 1: Descriptive Statistics
Part 1 of your semester project should include:
- Reading in your data
- Preparing the data a. This could include filtering out missing, bad or unnecessary data
- Summarizing the data a. Identify and summarize your response variable
- Visual summaries using ggplot
- Numerical summary tables
- Explore the explanatory variables (check for outliers)
- Visualize the relationship between a few explanatory variables and your response variable using ggplot
Data Import and Preparation
Visualizations
[Use ggplot()
to create presentation-worth graphs (histograms, scatterplots, bar charts, boxplots, etc). Include an overall visualization of your response variable and one or two exploratory graphs showing the relationship between explanatory variables and your response.]
# Create Visuals Here
Numerical Summaries
[Create numerical summaries (favstats()
, prop.table()
with row or column totals, etc.) of your response variable. Include a table of summary statistics broken out by your explanatory variable. Include a discussion of what you observe.]
# Create numerical summaries
Part 2: Statistical Inference
Perform at least 1 complete statistical inference appropriate to your data, including:
- State your null and alternative hypotheses
- Perform the appropriate statistical test
- Create confidence intervals where appropriate
- Check test requirements
- Correct interpretation and conclusions
State Ho and Ha
Hypothesis Test
# Perform the appropriate test for the data selected