Big Data
Duration 30 hours Fee $899.00
Introduction to Big Data and Hadoop
What is Big Data?Dimensions of Big data 6 Vs
Why do we bother Big Data?
Challenge of Existing Systems
What is Hadoop?
Benefits of History of Hadoop
Characteristics of Hadoop
Hadoop Customers and their use cases
Popular Hadoop Vendors and Distributions
Hadoop Certifications details
Q/A
Architecture of Hadoop 1.x and Hadoop 2.x
What is Master-Slave Architecture?Hadoop 1.0 and 2.0 Architecture
Concepts of Blocks, Replication and Rack Awareness
File read and write anatomy
Coherency model
Q/A and Exercises
Installation & Configuration of Hadoop
Installation options and Pre-RequisitesCloudera Distributed Hadoop CDH) installation
Hadoop Installation Modes
Hadoop Configuration File
Basic Linux and Hadoop Commands Demo
MapReduce
What is MapReduce?Why MapReduce?
Benefits of using MapReduce
Word Count example using MapReduce and Eclipse
More Hands-on using real world datasets such as (Number of sub-patents), Calculate
max Temperature, Find Hot and cold days, Word size and word count, health
Care Datasets
Real-world casestudies
Assignments
Q/A and Quiz
Advanced Mapreduce
Partitioners and CombinersMap side and Reduce side Joins
Hands-on
Assignments
Q/A and Quiz
PIG
What is PIG?Why Pig?
Who uses Pig?
Use cases of Pig in real World
Setting up and Starting up Pig
Pig Handson Loading, Querying and Analyzing data
Pig Scripts and UDF Concepts and Handson
Q/A and Quiz
HIVE
What is Hive?Why Hive?
Who uses Hive?
Difference between Pig and Hive
Use cases of Hive in real World
Setting up and Starting up Hive Hive Hands-on- Loading, Querying and Analyzing data
Hive UDF
Partitioning and Bucketing in Hive
Play with Hive advance parameters such as Dynamic Partitioning etc.
Usage scenarios of using Pig And Hive Together
Advance Hive Codes
Q/A and Quiz
International Students Second Career

Why Big Data
When big data is effectively and efficiently captured, processed, and analyzed, companies are able to gain a more complete understanding of their business, customers, products, competitors, etc. which can lead to efficiency improvements, increased sales, lower costs, better customer service, and/or improved products and services.

Admission Prerequisites
Any Graduate professionals with knowledge in Java programming background are eligible for learning Big Data Hadoop Training. A basic knowledge of any programming language like Java, C or Python and Linux is always an added advantage and also strong knowledge on Concepts of OOPs

This course will be beneficial for:
Software Developers and Architects
Professionals with analytics and data management profile
Business Intelligence Professionals
Project Managers
Data Scientists
Professionals with Business Intelligence, ETL and data warehousing background
Professionals from testing and mainframe background
New - Post Graduate Diploma in Logistic and supply Chain Management
Unemployed students may eligible for Scholarship for Non-Vocational Programs. For details contact at 647-348-3622.
Contact Us
Don Mills Career College
Health, Business and Technology
747 Don Mills Road, Unit # 204 & 220
Toronto, Ontario, Canada.
Tele: 647 348 3622
Admission