Tools

Links to the technoligies that we will leverage in class.

GitHub and Git

Blog packages

Spark

Spark SQL

Spark ML

Databricks

Databricks community

We are going to start out using the Databricks Community Edition that is free. Please do the following.

  1. Go to databricks.com/try-databricks.

  1. Enter your information based on the below guidance.
    A. Company: Don’t put BYU-I. Put your First Name, Last Name LLc.
    B. Company Email: Use your BYU-I email that has the numbers.
    C. Enter all other fields as guided.
  2. Click GET STARTED FOR FREE
  3. On the next page select GET STARTED under the COMMUNITY EDITION column on the right.

  1. You will then need to click on link in the email and create a password.
  2. In the upper right corner you will see a user icon (blue circle on blue shoulders) which you should click.
  3. From the drop down menu select Admin Console.
  1. On the Admin Console you should see a blue Add User button where you can add the following two users.
    A. Add hathawayj@byui.edu
    B. Add tun18001@byui.edu

Python and R packages

PySpark

SparklyR

Docker

Installation

PostgresSQL

References