Big Data Draft

Page 1

ROI TRAINING Big Data & Data Analytics The amount of data in our world has been exploding, and analyzing large data sets (“Big Data”) has become a key basis of competition. Organizations that leverage this data will gain competitive advantages, see productivity growth, and increase innovation. Analytics is the discovery and communication of meaningful patterns in data. Through analytics a company can confirm actions are having the desired effect and - more importantly - make predictions about future customer needs and industry trends. ROI’s highly customizable Big Data and Data Analytics curriculum provides thought provoking, real-world training for business analysts, data architects, data warehouse experts, developers, and other business professionals. Course

Course 434: Hadoop for MapReduce Applications Students who attend Hadoop for MapReduce Applications will leave the course armed with the skills they require to develop reliable, scalable, distributed applications using Apache Hadoop. The course is useful for programmers who need to create MapReduce applications and for data architects who can program in scripted languages like Python or Perl, or perform analysis using statistical packages such as R. It is also beneficial for system programmers and anyone familiar with writing scripts. Finally, it is useful for system and data administrators tasked with maintaining Hadoop applications since it provides them a deep understanding of how Hadoop applications communicate and share data.

For additional information: customer.service@trainingbyroi.com 1-800-ROI-9877

Course 473: Scala Programming This course provides a hands-on introduction to the Scala programming language. It shows how Scala can be used for both object-oriented programming and functional programming. Although the Scala interpreter (REPL) is used to demonstrate various concepts, the course emphasizes Scala development using tools that would be typically used in a production Java environment. This includes build tools such as SBT, Maven and Gradle as well as development environments such as Eclipse and IntelliJ. In addition, a test driven development (TDD) approach to unit testing Scala code is also used.

Course 660: Hadoop Cluster Building and Administration The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. This course is one of the three Hadoop courses provided by ROI Training. The course covers installation, growth, and management of Hadoop Clusters. Upon completion of the course students will have installed, configured, grown, loaded, and managed Hadoop distributed processing. Further the course covers the problems of lost data nodes and nameserver. The course is approximately 70% hands-on training.

Course 661: Hadoop Programming Using Streams & Pipes Hadoop is written in Java and Java is one of the methods for developing Map/Reduce programs. It is not the only programming interface. Hadoop provides a streaming interface for any executable program and a Hadoop pipes interface for a C/C++ interface. The pipes interface is ”SWIGable” allowing almost any programming language to be used. This course is an in depth, hands-on, exploration of these interfaces. The course uses a completed case study explored in class and a hands-on exercise case study to examine and understand the details of the interfaces.


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.