Intro to hadoop and mapreduce for beginners

Intro to Hadoop and MapReduce for Beginners The objective informed is to offer a 10,000 feet opinion of Hadoop for those who know next to nothing about it and therefore you can learn hadoop step by step. This post is not developed to get you prepared for Hadoop growth, but to offer a sound understanding for you to take the next measures in mastering the technology.

Lets get down to it:

Hadoop is an Apache Application Platform venture that significantly provides two things:

An allocated file system known as HDFS (Hadoop Distributed File System)

A structure and API for developing and operating MapReduce jobs

some hyperlinks for your information:

HDFS

HDFS is organized in the same way to a normal Unix file system except that detailed storage space is shipped across several devices. It should not have been an alternative to a normal file system, but rather as a file system-like part for big allocated techniques to use. It has in designed systems to deal with device problems, and is enhanced for throughput rather than latency.

There are two and a half types of device in a HDFS cluster:

Turn static files into dynamic content formats.

Create a flipbook