Big Data & Hadoop Development
Learn the most comprehensive Hadoop course to become a big data Expert
How it Works…
Slide 2
Live on-Line class Class recording in Learning management systems (LMS) Module wise Quizzes, Coding Assignments 24*7 on-demand technical support Project work on large Datasets Online Certification exam Lifetime access to the LMS
http://www.xoomtrainings.com/
Course Topics…. Introduction to Big data and Hadoop • Understanding Big Data • Challenges in processing Big Data • 3V Characteristics (Volume, Variety and Velocity) • Brief history of Hadoop • How Hadoop addresses Big Data? • HDFS and MR • Hadoop echo system HDFS (Hadoop Distributed File System) • HDFS Overview and Architecture • HDFS Keywords like Name Node, Data Node, Heart Beat etc. • Configuring HDFS • Data Flows (Read and Write) • HDFS Permissions and Security • HDFS commands • Rack Awareness 5 Daemons processes Map Reduce • Map Reduce Basics • Map Reduce Data Flow • Word count Example solving • Algorithms for simple and complex problems • Hadoop Streaming Developing a Map Reduce Application • Setting up working environment • Custom Data types (Writable and Custom Key types) • Input and Output file formats • Driver, Mapper and Reducer Code
Slide 3
How Map Reduce works? • Classic Map Reduce (Map Reduce I) • YARN (Map Reduce II) • Job Scheduling • Shuffle and Sort • Failures • Oozie Workflows • Hands-on Exercises Hadoop Echo Systems PIG • Overview of PIG • Installation and running PIG • PIG Latin • Loading and storing data • Hands-on HIVE • Overview of HIVE • Installation and running HIVE • HiveQL • Tables • Hands-on HBASE • Overview of HBASE Installation CLinets (Avro, REST, Thrift) • Hands-on SQOOP • Overview of SQOOP • Solving Case studies
http://www.xoomtrainings.com/
Big Data – How it is ? What it means ? Velocity: Data flown continues, time sensitive, streaming flow Batch, Real time, Streams, Historic
Volume: Big Data comes in on large scale. Its on TB & even PB Records, Transaction, Tables, Files
DATA VALUE
Variety: Big Data extends structured, Including semi-structured and unstructured data of all variety; Structured, Semi and un structured
Slide 4
Veracity: Quality, Consistency, reliability and provenance of data Good, undefined, inconsistency and incomplete
http://www.xoomtrainings.com/
Understand various types of data that can be stored in Hadoop Perform Data Analytics using PIG & HIVE
Slide 5
http://www.xoomtrainings.com/
Thank You
For Free Demo you can call us at USA : +1-610-686-8077