What is big data- understanding of Data Analysis, Big Data and Statistics

Page 1

What is Big Data? Big Data Certification


We live in a world of data. This data is generated by transactions, feedback, and real-time interaction with customers, partners, suppliers, and employees.

Private and Confidential

2


5 V’s of big data:

Volume: This refers to the large amount of data generated every moment. Think of all the emails, Twitter messages, photos, video clips and sensor data that is generated and shared every second. Data is not just in terabytes, but zettabytes or brontobytes of data is generated. On Facebook alone people send 10 billion messages per day, click the like button 4.5 billion times and upload 350 million pictures each and every day. Taking all the data generated in the world between the beginning of time and the year 2000, it is the same amount that is now generated every minute. This keeps making data sets too voluminous to store and analyze using legacy and old database systems. With big data technology one can now store and analyse these data sets with the help of distributed systems, where parts of the data is stored at different places, connected by networks and brought together by Big Data software.

Private and Confidential

3


5 V’s of big data:

Velocity: This refers to the speed at which data is generated and the frequency at which data moves around. Think of social media messages going viral in minutes, Frequency at which credit card transactions are checked for fraudulent activities or the milliseconds taken by trading systems to analyze social media networks to interpret signals that trigger hints to buy or sell shares. Big data tools allows to analyze the data while it is being generated without the need of first putting it into databases systems and then analyzing it.

Private and Confidential

4


5 V’s of big data:

Variety: This means the different types of data we can use now. In the past our major focus was on structured data that properly fits into tables or relational databases such as financial data (for example, sales by product or region). 80 percent of the world’s data is unstructured format and therefore can’t easily be put into tables or relational databases—for example photos, video sequences or social media updates. With big data tools we can now harness differed types of data like messages, social media conversations, photos, sensor data, video or voice recordings and bring and analyze them together with more traditional, structured data.

Private and Confidential

5


5 V’s of big data:

Veracity: This means the messiness or trustworthiness of the data. With many forms and types of big data, quality and accuracy are less controllable, for example Twitter posts with hashtags, typos and colloquial speech, abbreviations etc. Big data and analytics tools allows to work with these types of data. The volume often causes for the lack of quality or accuracy, but entire volume of fast moving data of different variety and veracity have to be turned into value. This is the reason why value is the one V of big data which matters the most.

Private and Confidential

6


5 V’s of big data:

Value: This means our ability to turn our data into value. It is really necessary that businesses make a case for any attempt to collect and leverage big data. It is easy to fall into the buzz trap and start embarking on big data initiatives without a clear knowledge of the business value it will bring.

Private and Confidential

7


Reasons why we are generating data faster than ever: • Processes are increasingly automated • Systems are increasingly interconnected • People are social and continuously generate data exhausts by interacting online

Data, in general, falls into 3 categories• Business application data (e.g., SAP or Oracle ERP) • Human generated data (e.g., social media) and • Machine data (e.g., RFID, Log Files etc.). In addition to this data, click and mobile business app based transactions, Human generated data — explosive growth of blogs/reviews/messages/emails/pictures. The Twitter alone generates more than 7 terabytes — 10s of millions of tweets per day and is growing rapidly. Facebook is estimated to generate more than 10 terabytes a day. Social graphs such as product recommendations based on circle of friends, jobs you may like (linked in), the products you have looked at, people who are your contacts etc. •

Private and Confidential

8


Why Imarticus

Imarticus we offers Certification in Big Data and Hadoop program with 100% career assistance. This program is designed to ensure that you are job ready to take up assignments in Big Data Analytics. Imarticus Big data analytics Program is 270 hour program delivered by experienced faculty and includes a judicious mix of academics and practical, hands-on learning. The curriculum provides you with deep understanding of Data Analysis, Big Data and Statistics, along with technical & business knowledge and cutting-edge practices.

Private and Confidential

9


Thank you Mumbai | Bangalore | Pune | Chennai | Jaipur | Delhi

ACCREDITED TRAINING PARTNER:


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.