Apache Hadoop – The Big Name In The Big Data World
Java/J2EE Capabilities
What What is is Apache Apache Hadoop? Hadoop?
• A proficient data management framework for Big Data • Open source software for distributed processing of large chunks of data
• Offers distributed parallel processing across servers, ranging from a single server to multiple machines
• Processing and analysis of thousands of terabytes of data
• Apt framework to increase business efficiency and maximize ROI
• Latest Release on 18 November, 2014: Release 2.6.0
Main Main Modules Modules of of Hadoop Hadoop
Main Main Modules Modules of of Hadoop Hadoop (contd.) (contd.)
• Hadoop Common Common utilities to help other Hadoop modules and support subprojects Includes File System, RPC and serialization libraries
• Hadoop Distributed File System (HDFS) Distributed File System giving access to
application data Spans across all nodes in a Hadoop cluster to link them into one big file system Java based, giving scalable and reliable data storage
Main Main Modules Modules of of Hadoop Hadoop (contd.) (contd.)
• Hadoop YARN Utilized for job scheduling and resource
management of clusters Splits up two roles of JobTracker, namely, resource management and job scheduling into different areas
• Hadoop MapReduce System for parallel processing of large data sets A framework that gets into work assignment to
nodes in a particular cluster Writes applications processing large amount of data, on multiple nodes of hardware with utmost reliability
Other Other Hadoop Hadoop Related Related Projects Projects at at Apache Apache
•
Avro
•
Ambari
• Cassandra
• Chukwa
• Hbase
• Mahout
• Hive
• Tez
• Pig
• ZooKeeper
• Spark
Why Why Hadoop? Hadoop?
•
Next generation real time analytics
• Rich eco systems • Scale-out storage • Reduced cost of ownership • Scalability, Flexibility and Reliability • Fault tolerance • Simplistic programming models
THANK YOU
Looking Forward To Have A Mutually Beneficial Association. Assuring You Of Our Best Services Always. SPEC INDIA "SPEC House“, Parth Complex, Swastik Cross Road, Navrangpura, Ahmedabad-380 009, INDIA. Tel.:+91-79-26404031 to 34 VoIP : + 1 - 908 - 450 9862
Instant Messengers spec.bd | spec_india | bd.spec specindia2009 | specindia.bd
e-mail: lead@specindia.com URL: http://www.spec-india.com