Free Udemy Courses

Spark Starter Kit

Spark Starter Kit
Spark Starter Kit

Spark Starter Kit

NOT another “What is Spark?” course! Explore Spark in-depth and get a strong foundation in Spark.

What you’ll learn

Spark Starter Kit

  • Learn about the similarities and differences between Spark and Hadoop.
  • Explore the challenges Spark tries to address, you will give you a good idea about the need for a spark.
  • Learn “How Spark is faster than Hadoop?”, and you will understand the reasons behind Spark’s performance and efficiency.
  • Before we talk about what is RDD, we explain in detail what is the need for something like RDD.
  • You will get a strong foundation in understanding RDDs in-depth and then we take a step further to point out and clarify some of the common misconceptions about RDD among new Spark learners.
  • You will understand the types of dependencies between RDD and more importantly we will see why dependencies are important.
  • We will walk you through step by step how the program we write gets translated into actual execution behind the scenes in a Spark cluster.
  • You will get a very good understanding of some of the key concepts behind Spark’s execution engine and the reasons why it is efficient.
  • Master fault tolerance by simulating a fault situation and examining how Spark recovers from it.
  • You will learn how memory and the contents in memory are managed by a spark.
  • Understand the need for a new programming language like Scala.
  • Examine object-oriented programming vs. functional programming.
  • Explore Scala’s features and functions.


  • Basic Hadoop concepts. Don’t know Hadoop? Don’t worry, sign up for our free Hadoop Starter Kit course.


When our students asked us to create a course on Spark, we looked at other Spark-related courses in the market and also what are some of the common questions students are asking on websites like StackOverflow and other forums when they try to learn Spark and we saw a recurring theme.

Most courses and other online help including Spark’s documentation are not good in helping students understand the foundational concepts. They explain what is Spark, what is RDD, what is “this” and what is “that” but students were most interested in understanding core fundamentals and more importantly answering questions like –

1. Why do we need Spark when we have Hadoop?
2. What is the need for RDD?
3. How Spark is faster than Hadoop?
4. How does Spark achieve the speed and efficiency it claims?
5. How does memory gets managed in Spark?
6. How does fault tolerance work in Spark?

and that is exactly what you will learn in this free Spark Starter Kit course. This course aims to give you a strong foundation in Spark.

Who this course is for

  • Anyone interested in distributed systems and computing and big data-related technologies.
Get Course Now