A Survey on Appliance and Secure In Big Data

Page 1

IJIRST –International Journal for Innovative Research in Science & Technology| Volume 1 | Issue 6 | November 2014 ISSN (online): 2349-6010

A Survey on Appliance and Secure In Big Data N.Naveenkumar PG-Scholar Department of Information Technology SNS College of Technology, Coimbatore

N.Naveenkumar Assistant Professor Department of Information Technology SNS College of Technology, Coimbatore

Abstract The area of Big Data is a highly new challenging environment with a rapidly evolving landscape. This paper surveys will involve learning new tools and techniques used to process big data in a highly performing and scalable manner. This will also include the developing new connectors to the Big Data system, new data structures, and techniques to represent the various datasets and analyze to secure them efficiently to uncover insights. Data security and privacy deliver data protection across all enterprises. Together, they comprise the people, process and technology required to prevent destructive forces, threats and unwanted actions, now it’s the time to keep customer, business, personally identifiable information and other types of sensitive data safe against internal and external threats. Data should be protected and no matter where it resides in databases, applications or reports across production and non-production environments. Keywords: Analysis, Big Data, Database, Hadoop, security. _______________________________________________________________________________________________________

I. INTRODUCTION Big data analytics is the use of advanced analytic techniques against very large, diverse data sets that include different types such as structured/ unstructured and streaming/batch, and different sizes from terabytes to zettabytes. Big data is a term applied to data sets whose size or type is beyond the ability of traditional relational databases to capture, manage, and process the data with low-latency. aAnd it has one or more of the following characteristics – high volume, high velocity, or high variety. Big data comes from sensors, devices, video/audio, networks, log files, transactional applications, web, and social media - much of it generated in real time and in a very large scale[1]. Analyzing big data allows analysts, researchers, and business users to make better and faster decisions using data that was previously inaccessible or unusable. Using advanced analytics techniques such as text analytics, machine learning, predictive analytics, data mining, statistics, and natural language processing, businesses can analyze previously untapped data sources independent or together with their existing enterprise data to gain new insights resulting in significantly better and faster decisions. Big Data is a term defining data that has three main characteristics. First, it involves a great volume of data. Second, the data cannot be structured into regular database tables and third, the data is produced with great velocity and must be captured and processed rapidly. Big Data is a relatively new term that came from the need of big companies like Yahoo, Google, Facebook to analyze big amounts of unstructured data, but this need could be identified in a number of other big enterprises as well in the research and development field.The framework for processing[2]. There is Hadoop, an open source platform that consists of the Hadoop kernel, Hadoop Distributed File System (HDFS), MapReduce and several related instruments. Two of the main problems that occur when studying Big Data are the storage capacity and the processing power. Trillions of dollars are at stake in the ongoing battle against cybercriminals and fraudsters. But now, big data technologies like Hadoop are helping to prevent unauthorized access, both behind the corporate firewall and in public coffers. One company that’s hoping to simultaneously put a cap on internal and external threats is Fort scale, which today announced the availability of a Hadoop-based security monitoring product. The San Francisco-based company uses proprietary machine learning algorithms running in Hadoop to score the security risk of every user in an organization, and then monitor user behavior for significant deviations from that score[3].

II. PROFICIENCY USED IN BIG DATA A. Big Table Proprietary distributed database system build on the Google File System. Inspiration from HBase. B. Business intelligence (BI) A type of application software designed to report, analyze and present data. BI tools are often used to read data that have been stored in a data warehouse or data mart. BI tools can also be used to create standard reports that are generated on a periodic basis, or to display information on real-time management dashboards, i.e. integrated displays of metrics that measure the performance of a system.

All rights reserved by www.ijirst.org

237


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.