Hadoop

Page 1

HADOOP


The following topics will be covered in our HADOOP Online Training:

Copyright @ 2015 Learntek. All Rights Reserved.

2


Hadoop Administration Training – Hadoop Cluster Administration

• Hadoop Administration Training : Learning Objectives– In this module, you will understand what is Big Data and Apache Hadoop, How Hadoop solves the Big Data problems, Hadoop Cluster Architecture, Introduction to MapReduce framework, Hadoop Data Loading techniques, and Role of

a Hadoop Cluster Administrator.

Copyright @ 2015 Learntek. All Rights Reserved.

3


Topics: • Introduction to Big Data • Use cases where Big Data is used. • Introduction to Hadoop framework. • HDFS File system.

• Hadoop Architecture • MapReduce Framework • A typical Hadoop Cluster

• Hadoop Cluster Administrator: Roles and Responsibilities, Current Job Market Copyright @ 2015 Learntek. All Rights Reserved.

4


Hadoop Architecture and Cluster setup:

• Learning Objectives– After this module, you will understand Multiple Hadoop Server roles such as NameNode and DataNode, and MapReduce data processing. You will also understand the Hadoop 2.x Cluster setup and configuration, Setting up Hadoop Clients using Hadoop 2.x, and

important Hadoop configuration files and parameters.

Copyright @ 2015 Learntek. All Rights Reserved.

5


Hadoop Administration Training – Topics: • • • • •

Hadoop server roles and their usage. Hadoop Installation and Initial Configuration. Understand Namenode and Datanodes Communication channels. Setup a Single Node Cluster. Namenode Metadata’s details.

Copyright @ 2015 Learntek. All Rights Reserved.

• • • • • • • •

Setup a Multi Node Cluster – Deploying Hadoop in pseudodistributed mode Setup Pass phraseless Access. Rack Awareness. Anatomy of Write and Read,. Replication Pipeline, Data Processing. Installing Hadoop Clients. Scalability — best practices. Adding/Removing nodes into/from the cluster.

6


Hadoop Cluster: Planning and Managing:

• Learning Objectives – In this module, you will understand Planning and Managing a Hadoop Cluster, Hadoop Cluster Monitoring and Troubleshooting, Analyzing logs, and Auditing. You will also understand Scheduling and Executing MapReduce Jobs, and different Schedulers.

Copyright @ 2015 Learntek. All Rights Reserved.

7


Topics: • Planning the Hadoop Cluster. • Cluster Sizing. • Hardware and Software considerations. • Managing and Scheduling Jobs. • Types of schedulers in Hadoop – FIFO, FAIR SCHEDULER • Setup Queues and Pools for Jobs.

• Configuring the schedulers and run MapReduce jobs. • Cluster Monitoring and Troubleshooting. Copyright @ 2015 Learntek. All Rights Reserved.

8


Value Ads (as per latest industry standards) • Running Hadoop on cloud — Connectivity/administration to AWS • Installation/administration of Cloudera Mgr & HDP (Free version)

• Cluster Monitoring and Troubleshooting.

Copyright @ 2015 Learntek. All Rights Reserved.

9


Copyright @ 2015 Learntek. All Rights Reserved.

10


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.