080-42091111 , +91-8892499499

Hadoop Administration

COURSE HIGHLIGHTS

Trainers are certified Real Time working professionals.

Main focus on Hands-on sessions.

Affordable course fee.

Course aligned to Cloudera Certification.

Flexible timings for working people.

100% money back guarantee.

100% placement assistance.

Post training support.

Life validity for attending classes.

Guidance in Resume Preparation.

Weekend Batch (2 Months): SAT & SUN (8-12pm)
Course Fee: 16,000/-
New Batch starts on:
Free Demo Session scheduled on :
Ph : 8892499499 | Web:www.dvstechnologies.in | mail:dvs.training@gmail.com

Vacancy

Data Scientist
Big Data Visualizer
Big Data Research Analyst
Big Data Engineer
Big Data Architect
Big Data Analyst

  1. What is Big Data?
  2. Where Big Data is coming from?
  3. What are Big Data use cases?
  4. How Data is growing?
  5. What are 3 V’s of Big Data?
  6. What are the challenges in Big Data Storage & Access?

  1. Why Hadoop?
  2. What is Hadoop?
  3. What is Hadoop History?
  4. What are Hadoop distributions?
  5. What are Hadoop components?
  6. Hadoop Architecture

  1. Understanding File System
  2. Understanding Hadoop Distributed File System (HDFS)
  3. HDFS Replication
  4. HDFS Components
    1. NameNode
    2. DataNode
    3. Secondary NameNode
  5. HDFS Features
  6. HDFS Design Assumptions
  7. Formatting NameNode
  8. Communication between Nodes in a Cluster
  9. How Metadata is maintained in Hadoop?
  10. Types of Metadata
  11. What is HDFS Block Report?
  12. Check Pointing Mechanism
  13. Metadata Memory Allocation
  14. Anatomy of a File Write into & Read from HDFS
  15. HDFS Block Replication Strategy
  16. How to deal with Data Corruption?
  17. HDFS Rebalancing & Space Reclamation
  18. File Systems supported by Hadoop
  19. Compression Formats supported by Hadoop

  1. Understanding Hadoop Installation Prerequisites
  2. Building Hadoop Nodes
  3. Installation of Hadoop 1x (Pseudo Mode)
  4. Installation of Hadoop 1x (Distribution Mode)
  5. Commission and Decommission of nodes
  6. Understanding VERSION, FSImage, Editlog
  7. Hadoop Admin Commands (FSCK & Block Scanner Report)
  8. HDFS Replication (by XML file, by Host, by individual file)
  9. Increase & Decrease Replication
  10. Hadoop Rack Awareness
  11. Default Hadoop Settings

  1. Map Reduce Introduction
  2. How Map Reduce works?
  3. Communication between JobTracker and TaskTracker
  4. Anatomy of a Map Reduce Job Submission
  5. Hadoop Schedulers
    1. FIFO Scheduler
    2. Fair Scheduler
    3. Capacity Scheduler

  1. Setting up Mappers & Reducers
  2. Setting up Fair Scheduler
  3. Setting up Capacity Scheduler
  4. Setting up topology
  5. Setting up Logs and Logging mechanism

  1. Hadoop 2.X Architecture
  2. What is Edge/Gateway/Connecting Node?
  3. What is Zookeeper?
  4. Difference between Hadoop 1.X and Hadoop 2.X
  5. Understand the architecture of YARN
  6. Understand the components of the YARN ResourceManager
  7. Demonstrate the relationship between NodeManagers and ApplicationMaster
  8. Demonstrate the relationship between ResourceManager and ApplicationMaster
  9. Explain the relationship between Containers and ApplicationMasters
  10. Job Flow in YARN
  11. Namenode High Availability
    1. Using Shared Edits
    2. Using Zookeeper Quorum

  1. Namenode High Availability using NFS Shared Edits & Zookeepers
  2. Namenode High Availability using Journal Nodes & Zookeepers
  3. Resource Manager High Availability

  1. Understanding Hardware Components
    1. Master Hardware
    2. Slave Hardware
    3. CPU, I/O, Network
  2. Plan your cluster growth
  3. Managing Users & Groups
  4. Cluster sharing across multiple use cases

  1. Understand the minimum hardware and software requirements
  2. Understand the Cloudera Architecture
  3. Understand how to install CDH using Cloudera Manager
  4. Understand complete deployment layout
  5. Understand how to configure and manage different services
  6. Understand different configuration parameters

  1. Cloudera Cluster Installation
  2. Cloudera Manager Walkthrough

  1. Understand the minimum hardware and software requirements
  2. Understand the Ambari Architecture
  3. Understand how to install Ambari& Hortonworks
  4. Understand complete deployment layout
  5. Understand how to configure and manage different services
  6. Understand different configuration parameters

  1. Hortonworks Cluster Installation
  2. Hortonworks Ambari Walkthrough

  1. Understand the minimum hardware and software requirements
  2. Understand the MCS Architecture
  3. Understand various services configured in MapR

  1. Monitor using the CM or Ambari UI
  2. Back up and recover Hadoop data
  3. Use Hadoop snapshots

  1. Understand security concepts
  2. Understanding & Configuring Hadoop ACLs
  3. Understanding & Configuring Kerberos
  4. Understanding & Configuring Knox & Ranger

  1. Introduction to Sqoop
  2. Introduction to Flume
  3. Introduction to Pig
  4. Introduction to Hive
  5. Introduction to Hbase
  6. Introduction to Oozie

  1. Hadoop Cluster Backup
  2. Hadoop Cluster Upgrade
  3. OS & Hadoop Patching

  1. Hadoop Performance Turning from OS Level
  2. Hadoop Performance Turning from HDFS Level (Storage Layer)
  3. Hadoop Performance Turning from MR/YARN Level (Processing Layer)

  1. Day to Day Admin Activities
  2. Frequently Occurring Issues
  3. Roles and Responsibilities

Note: Real Time scenarios, FAQs, Troubleshooting & Issue handling