The demand for Big Data Hadoop professionals is increasing across the globe and it’s a great opportunity for the IT professionals to move into the most sought technology in the present day world. ExcelR offers Big data & Hadoop course in Bangalore and instructor led live online session delivered by industry experts who are considered to be the best trainers in the industry. The training is studded with loads of practical assignments, case studies and project work, which ensures hands on experience for the participants. The training program is meticulously designed to become a professional of Big data Hadoop developer and crack the job in the space of Big Data.
Duration: 40 – 45 hrs
Timings: Week days 1-2 Hours per day (or) Weekends: 2-3 Hours per day
Method: Online/Classroom Training
Study Material: Soft Copy
What is Big Data?
Introduction to Apache Hadoop.
Flavors of Hadoop: Big-Insights, Google Query etc..
Hadoop Eco-system components: Introduction
Understanding Hadoop Cluster
Why replication factor 3?
Discuss NameNode and DataNode.
Discuss JobTracker and TaskTracker.
Typical workflow of Hadoop application
Assignment of Blocks to Racks and Nodes.
Block Management Service.
Anatomy of File Write.
Anatomy of File Read.
Heart Beats and Block Reports
Discuss Secondary NameNode and Usage of FsImage and Edits log.
Map Reduce Overview
Best Practices to setup Hadoop cluster
Need of *-site.xml
Map Reduce Framework
Why Map Reduce?
Use cases where Map Reduce is used.
Hello world program with Weather Use Case.
Setup environment for the programs.
Possible ways of writing Map Reduce program with sample codes find the best code and discuss.
Configured, Tool, GenericOptionParser and queues usage.
Demo for calculating maximum temperature and Minimum temperature.
Limitations of traditional way of solving word count with large dataset.
Map Reduce way of solving the problem.
Complete overview of MapReduce.
Parts of Map Reduce
Apache Hadoop– Single Node Installation Demo
Apache Hadoop – Multi Node Installation Demo
Namenode – format.
Add nodes dynamically to a cluster with Demo
Remove nodes dynamically to a cluster with Demo.
Hadoop cluster modes.
Psuedo distributed Mode
Fully distributed mode.
Map Reduce Anatomy
Map Reduce Failure Scenarios
Input File Formats
Output File Formats
Custom Input Formats
Custom keys, Values usage of writables.
Walkthrough the installation process through the cloudera manager.
Example List, show sample example list for the installation.
Demo on teragen, wordcount, inverted index, examples….