CSE 451: Big Data Programming and Analytics
Home
Syllabus
Readings
▾
Spark
Learning Spark
Spark Programming Guide
SparkSQL
Spark w/ Python
Spark by {Examples}
pyspark-examples
Spark w/ R
Mastering Spark with R
Spark from R
Docker
docker 101 tutorial
docker for beginners
docker docs
CS451 Docker Guide
Course articles
Design Thinking
Big data articles
Math 488 Data Science Consulting Articles
Class Time
Projects
Tools
▾
Overview
Visit Me
☰
Articles on Big Data
Contents
Big Data Concepts
Visualizating Big Data
Storage
Spark Concepts
SparkR and PySpark
Spark SQL
Spark ML
Big Data Concepts
The Four V’s of Big Data
The 10 Vs of Big Data
Big Data Cycle
Visualizating Big Data
Big data visualization techniques
Storage
What is Apache Parquet and why you should us it
Spark Concepts
The art of joining in Spark
Billions of Rows, Milliseconds of Time- PySpark Starter Guide
SparkR and PySpark
Cheat sheet PySpark Python.indd
A Compelling Case for SparkR
Spark SQL
PySpark and SparkSQL Basics
Hands-On Tutorial to Analyze Data using Spark SQL
Spark ML
Parallel Processing of Machine Learning Algorithms