Big Data Analytics with PySpark

Big Data Analytics with Pyspark

ACF Type: date_picker

Start Date: August 14, 2022

ACF Type: date_picker

Last date to register: August 9, 2022

ACF Type: text

Course duration: 15 hours

ACF Type: radio

Cost: donation

ACF Type: checkbox

Skill level: intermediate

ACF Type: url

Course Description

For whom is this course

ACF Type: wysiwyg

Spark is a “lightning-fast cluster computing” framework for Big Data that provides a general data processing platform engine and lets you run programs up to 100x faster in memory, or 10x faster on disk, than Hadoop.

This course is for data science enthusiast learners who will use PySpark, a Python package for Spark programming and its powerful, higher-level libraries such as SparkSQL, MLlib (for machine learning), etc.

At the end of this course, you will have gained an in-depth understanding of PySpark and its application to general Big Data analysis.

What you will learn

ACF Type: wysiwyg

You will learn the following topics in this course

  • Pyspark Installation
  • Introduction to Big Data analysis with Spark
  • Programming in PySpark RDD’s
  • PySpark SQL & Data Frames
  • Machine Learning with PySpark MLlib

Prerequisites

ACF Type: wysiwyg

  • Python
  • Deep Learning Basis
  • SQL
  • Pandas (Data Frame)

Syllabus

ACF Type: wysiwyg

  • Introduction to Big Data analysis with Spark
  • Programming in PySpark RDD’s
  • PySpark SQL & Data Frames
  • Machine Learning with PySpark MLlib

ACF Type: url

Course Features

ACF Type: text

Lectures: Hands on

ACF Type: text

Duration: 15 hours

ACF Type: text

Students: 100

ACF Type: radio

Certificate: yes

ACF Type: radio

Cost: donation

ACF Type: checkbox

Skill level: intermediate

ACF Type: url

Video

ACF Type: oembed

Instructor

ACF Type: image

h

ACF Type: text

QASIM HASSAN

ACF Type: url

ACF Type: textarea

Machine Learning Engineer @Omdena

Upcoming Courses

JOIN OUR NEWSLETTER

Want to build the skills that matter? Never miss an Omdena Course.