Task 6: Developing a graphic

Background

Before we can start to answer business questions we need to become familiar with our data. Often, you will want to start with the data dictionary. However, you can also just dive into the data and gain understanding based on the variable names and types.

Beyond the variable descriptions is how the variables relate to each other. We can create tables or visualizations that summarize how different variables relate to each other. At this point we are deepeing our understanding as well as beginning our analysis.

Remember: Your job is to become the data expert not the domain expert. You will build domain skills but you are not going to replace domain experts. People will depend on you to have a firm understanding of what data your company has available to answer domain specific questions.

Use nycflights13::flights to practice your data summary and data investigation through visualization skills.

Reading

This reading will help you complete the tasks below.

Tasks

  • [ ] Find an insightfull relationship between two of the variables (columns) and display that relationship in a table or graphic
    • [ ] Provide a distributional summary of the relevant variable in nycflights13::flights
    • [ ] Build bivariate summaries of the relevant variables
    • [ ] document the iterative script that built to your insightfull relationship
  • [ ] Create an .R script that has your data visualization development with 1-2 commented paragraphs summarizing your 2 finalized graphics and the choices you made in the data presentation
  • [ ] Save your .png images of each your final graphics and push all your work to your repository.