080-42091111 , +91-8892499499

Hadoop Administration

COURSE HIGHLIGHTS

Trainers are certified Real Time working professionals.

Main focus on Hands-on sessions.

Affordable course fee.

Course aligned to Cloudera Certification.

Flexible timings for working people.

100% money back guarantee.

100% placement assistance.

Post training support.

Life validity for attending classes.

Guidance in Resume Preparation.

Vacancy

Data Scientist
Big Data Visualizer
Big Data Research Analyst
Big Data Engineer
Big Data Architect
Big Data Analyst

  1. What is Big Data?
  2. Where Big Data is coming from?
  3. What are Big Data use cases?
  4. How Data is growing?
  5. What are 3 V’s of Big Data?
  6. What are the challenges in Big Data Storage & Access?

  1. Why Hadoop?
  2. What is Hadoop?
  3. Who are using Hadoop?
  4. What is Hadoop History?
  5. What are Hadoop distributions?
  6. What are Hadoop components?
  7. Hadoop Architecture

  1. Understanding File System
  2. Understanding Hadoop Distributed File System (HDFS)
  3. HDFS Features
  4. HDFS Design Assumptions
  5. File Systems supported by Hadoop
  6. How file is stored on HDFS?
  7. How Metadata is maintained in Hadoop?
  8. Check Pointing Mechanism
  9. Metadata Memory Allocation
  10. Communication between NameNode and DataNode
  11. Anatomy of a File Write into & Read from HDFS
  12. Hadoop Replication
  13. HDFS Block Replication Strategy
  14. How to deal with Data Corruption?
  15. HDFS Rebalancing & Space Reclamation
  16. Compression Formats supported by Hadoop

  1. Understanding Hadoop Installation Prerequisites
  2. Building Hadoop Nodes
  3. Installation of Hadoop 1x (Pseudo Mode)
  4. Installation of Hadoop 1x (Distribution Mode)
  5. Understanding VERSION, FSImage, Editlog
  6. Hadoop Admin Commands (FSCK & Block Scanner Report)
  7. HDFS Replication (by XML file, by Host, by individual file)
  8. Increase & Decrease Replication
  9. Hadoop Rack Awareness
  10. Default Hadoop Settings

  1. Map Reduce Introduction
  2. How Map Reduce works?
  3. Communication between JobTracker and TaskTracker
  4. Anatomy of a Map Reduce Job Submission
  5. Hadoop Schedulers
  6. FIFO Scheduler
  7. Fair Scheduler
  8. Capacity Scheduler

  1. Setting up Mappers & Reducers
  2. Setting up Fair Scheduler
  3. Setting up Capacity Scheduler
  4. Setting up topology
  5. Setting up Logs and Logging mechanism

  1. Hadoop 2.X Architecture
  2. Difference between Hadoop 1.X and Hadoop 2.X
  3. Understand the architecture of YARN
  4. Understand the components of the YARN ResourceManager
  5. Demonstrate the relationship between NodeManagers and ApplicationMasters
  6. Demonstrate the relationship between ResourceManagers and ApplicationMasters
  7. Explain the relationship between Containers and ApplicationMasters
    Job Flow in YARN

  1. Using Shared Edits
  2. Using Zookeeper Quorum

  1. Namenode High Availability using Shared Edits
  2. Namenode High Availability using Zookeeper Quorum

  1. Understanding Hardware Components
  2. Master Hardware
  3. Slave Hardware
  4. CPU
  5. I/O
  6. Network
  7. Plan your cluster growth
  8. Managing Users & Groups
  9. Cluster sharing across multiple use cases

  1. Understand the minimum hardware and software requirements
  2. Understand the Cloudera Architecture
  3. Understand how to install CDH using Cloudera Manager
  4. Understand differences between master and slave services
  5. Understand complete deployment layout
  6. Understand how to configure and manage different services
  7. Understand different configuration parameters

  1. Monitor using the CM UI
  2. Commission and Decommission of nodes
  3. Back up and recover Hadoop data
  4. Use Hadoop snapshots
  5. Understand rack awareness and topology
  6. Understand NameNode high availability
  7. Understand ResourceManager high availability
  8. Use the “hdfs haadmin” commands

  1. Understand security concepts
  2. Configure Kerberos
  3. Configure CDH authentication and authorization

  1. Introduction to Sqoop
  2. Introduction to Pig
  3. Introduction to Hive
  4. Introduction to HBase
  5. Introduction to Oozie

  1. Hadoop Cluster Backup
  2. Hadoop Cluster Upgrade
  3. OS & Hadoop Patching

  1. Day to Day Admin Activities
  2. Frequently Occurring Issues
  3. Roles and Responsibilities

Note: Real Time scenarios, FAQs, Troubleshooting & Issue handling