Big Data Analytics (SF)

This five-day instructor-led course provides participants with concepts beyond the Big Data knowledge to get a head start with Hadoop.

Course Fee (with 9% GST)

Full Course Fees: $3,815

Self-Sponsored
SG Citizen/PR aged ≥ 21 years: $1,740.50
SG Citizen aged ≥ 40 years: $1,150.50

Co-Sponsored (SME)
SG Citizen/PR aged ≥ 21 years: $1,150.50
SG Citizen aged ≥ 40 years: $1,150.50

Co-Sponsored (MNC)
SG Citizen/PR aged ≥ 21 years: $1,740.50
SG Citizen aged ≥ 40 years: $1,150.50

Overview

Course Reference Number: TGS-2018501865

This five-day instructor-led course provides participants with concepts beyond the Big Data knowledge to get a head start with Hadoop. This course will also teach about data analysis using Hadoop Ecosystem for data analysts, business intelligence specialists, developers and system architects.

Associated CertificatioN(s)

Upon completion of the course, participants can take the exam on Cloudera Certified Associate (CCA) Data Analyst, MapR: Certified Data Analyst (MCDA) or Hortonworks HDP Certified Developer (HDPCD): Pig and Hive. These certifications are great differentiators to establish yourself as a leader in the field, providing employers and customers with tangible evidence of your skills and expertise.

Prerequisites

Prior knowledge of SQL is highly recommended. Linux knowledge will be helpful.

IMportant notes

All Trainees must take note of the following:
  1. Must attend at least 75% of the course before being eligible to take the assessments.
  2. Dynamic QR Code Attendance Taking:
    a. Scan the QR Code that will be displayed by the Trainer on each session. Use your SingPass App to scan and submit your attendance. If you fail to do so, you will be deemed absent from that session.
    b. The QR Code is only accessible on:
    • Morning Session: between 9.00 am to 1.00 pm.
    • Afternoon Session: between 2.00 pm to 6.00 pm.
    c. Please take the attendance one at a time as the system can only register you one by one.
  3. Sign daily on the Attendance Sheet as a backup if any technical glitch happens.
  4. Submit Course Evaluation by the end of each module to help us improve the course and your future learning experience with us.
The course completion requirements for this course as follow:
  1. Attended at least 75% of the course.
  2. Declared as competent during the assessments.

Who Should Attend?

This course is intended for executives, managers, consultants, business analysts, operation personnel, programmers, architects, administrators and data analysts who want a foundational overview of the key components required to effectively understand and analyse Big Data. Familiarity working with computers and business applications is assumed. Programming experience is beneficial but not required.

Course Outline

  • Why we need Hadoop
  • Why Hadoop is in demand in market nowadays
  • Where expensive SQL based tools are failing
  • Key Points, Why Hadoop is leading tool in current IT Industry Definition of Big Data
  • Hadoop nodes
  • Introduction to Hadoop Release-1
  • Hadoop Daemons in Hadoop Release-1
  • Introduction to Hadoop Release-2
  • Hadoop Daemons in Hadoop Release-2
  • Hadoop Cluster and Racks
  • Hadoop Cluster Demo
  • New projects on Hadoop
  • How Open Source tools is capable to run jobs in lesser time Hadoop Storage – HDFS (Hadoop Distributed file system) Hadoop Processing Framework (Map Reduce / YARN) Alternates of Map Reduce
  • Why NOSQL is in much demand instead of SQL
  • Distributed warehouse for HDFS
  • Hadoop Ecosystem and its usages
  • Data import/Export tools
  • Hadoop installation
  • Introduction to Hadoop FS and Processing Environment’s UIs How to read and write files
  • Basic Unix commands for Hadoop
  • Hadoop FS shell
  • Hadoop releases practical
  • Hadoop daemons practical
  • Pig-UDFs
  • Pig Use cases
  • Pig Assignment
  • Complex Use cases on Pig
  • Real time scenarios on Pig
  • When we should use Pig
  • When we shouldn’t use Pig
  • Hive Introduction
  • Meta storage and meta store
  • Introduction to Derby Database
  • Hive Data types
  • HQL
  • DDL, DML and sub languages of Hive
  • Internal, external and Temp tables in Hive
  • Differentiation between SQL based Datawarehouse and Hive
  • Hive releases
  • Why Hive is not best solution for OLTP OLAP in Hive
  • Partitioning
  • Bucketing
  • Hive Architecture
  • Thrift Server
  • Hue Interface for Hive
  • How to analyze data using Hive script Differentiation between Hive and Impala UDFs in Hive
  • Complex Use cases in Hive
  • Hive Advanced Assignment
  • How to load data streaming data without fixed schema
  • How to load unstructured and semi structured data in Hadoop Introduction to Flume
  • Hands-on on Flume
  • How to load Twitter data in HDFS using Hadoop
  • Introduction to Oozie
  • How to schedule jobs using Oozie
  • What kind of jobs can be scheduled using Oozie
  • How to schedule jobs which are time based
  • Hadoop releases From where to get
  • Hadoop and other components to install
  • Introduction to YARN
  • Significance of YARN
  • Introduction to NOSQL
  • Why NOSQL if SQL is in market since several years
  • Databases in market based on NOSQL CAP Theorem
  • ACID Vs. CAP
  • OLTP Solutions with different capabilities
  • Which Nosql based solution is capable to handle specific requirements Examples of companies that uses NOSQL based databases
  • HBase Architecture of column families
  • Introduction to Spark
  • Basics Features of SPARK and Scala available in Hue Why SPARK demand is increasing in market
  • How can we use Spark with Hadoop Eco System Datasets for practice purpose
  • YARN
  • Emerging Technologies of Big Data
  • Emerging use cases e.g. IoT, Industrial Internet, New Applications
  • Certifications and
  • Job Opportunities

Get Pricing and Brochure

More Like This

Get the course Brochure & Pricing

Our course consultant will contact you within 1 working day

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

Get in touch with our consultant