big data ppt

21
Big Submitted To:- Cse department Presented by: Yash raj sharma(6CS-91) B.Tech VI Sem. Jaipur National University , Jaipur

Upload: yash-raj

Post on 08-Aug-2015

70 views

Category:

Engineering


3 download

TRANSCRIPT

Page 1: Big data ppt

Big Data

Submitted To:-Cse department

Presented by: Yash raj sharma(6CS-91)

B.Tech VI Sem.Jaipur National University , Jaipur

Page 2: Big data ppt

Contents

Introduction

Problem of Data Explosion

Big Data Characteristics

Issues and Challenges in Big Data

Advantages of Big Data

Projects using Big Data

Conclusion

Page 3: Big data ppt

Introduction

Big Data is large volume of Data in structured or unstructured form.

The rate of data generation has increased exponentially by increasing

use of data intensive technologies.

Processing or analyzing the huge amount of data is a challenging

task.

It requires new infrastructure and a new way of thinking about the way

business and IT industry works

Page 4: Big data ppt
Page 5: Big data ppt

Problem of Data Explosion

The International Data Corporation (IDC) study predicts that overall data

will grow by 50 times by 2020.

The digital universe is 1.8 trillion gigabytes (109) in size and stored in 500

quadrillion (1015) files.

Information Bits in the digital universe as stars in our physical universe.

90% Data is in unstructured form.

Page 6: Big data ppt

Volume

Velocity

Variety

Page 7: Big data ppt

Big data can be described by the following characteristics:

Volume – The quantity of data that is generated is very important in this context. It is the size of the data which determines the value and potential of the data under consideration and whether it can actually be considered Big Data or not. The name ‘Big Data’ itself contains a term which is related to size and hence the characteristic.

Variety - The next aspect of Big Data is its variety. This means that the category to which Big Data belongs to is also a very essential fact that needs to be known by the data analysts. This helps the people, who are closely analyzing the data and are associated with it, to effectively use the data to their advantage and thus upholding the importance of the Big Data.

Velocity - The term ‘velocity’ in the context refers to the speed of generation of data or how fast the data is generated and processed to meet the demands and the challenges which lie ahead in the path of growth and development.

Veracity - The quality of the data being captured can vary greatly. Accuracy of analysis depends on the veracity of the source data.

Page 8: Big data ppt

Complexity - Data management can become a very complex process, especially when large volumes of data come from multiple sources. These data need to be linked, connected and correlated in order to be able to grasp the information that is supposed to be conveyed by these data. This situation, is therefore, termed as the ‘complexity’ of Big Data. Factory work and Cyber Physical System may have a 6C system:

1.Connection (sensor and networks),2.Cloud (computing and data on demand),3.Cyber (model and memory),4.content/context (meaning and correlation),5.community (sharing and collaboration), and6.customization (personalization and value).

In this scenario and in order to provide useful insight to the factory management and gain correct content, data has to be processed with advanced tools (analytics and algorithms) to generate meaningful information. Considering the presence of visible and invisible issues in an industrial factory, the information generation algorithm has to be capable of detecting and addressing invisible issues such as machine degradation, component wear, etc. in the factory floor

Page 9: Big data ppt

Issues in Big Data

Issues related to the Characteristics

Storage and Transfer Issues

Data Management Issues

Processing Issues

Page 10: Big data ppt

Issues in Characteristics

Data Volume Issues

Data Velocity Issues

Data Variety Issues

Worth of Data Issues

Data Complexity Issues

Page 11: Big data ppt

Storage and Transfer Issues

Current Storage Techniques and Storage Medium are not appropriate for effectively

handling Big Data.

Current Technology limits 4 Terabytes (1012) per disk, so 1 Exabyte (1018) size data

will take 25,000 Disks.

Accessing that data will also overwhelm network.

Assuming a sustained transfer of 1 Exabyte will take 2,800 hours with a 1 Gbps

capable network with 80% effective transfer rate and 100Mbps sustainable speed.

Page 12: Big data ppt

Data Management IssuesResolving issues of access, utilization, updating, governance, and

reference (in publications) have proven to be major stumbling

blocks.

In such volume, it is impractical to validate every data item.

New approaches and research to data qualification and validation

are needed.

The richness of digital data representation prohibits a

personalized methodology for data collection.

Page 13: Big data ppt

Processing IssuesThe Processing Issues are critical to handle.Example:1 Exabyte = 1000 Petabytes (1015).Assuming a processor expends 100 instructions on one block at 5 gigahertz, the time required for end to-end processing would be 20 nanoseconds. To process 1K petabytes would require a total end-to-end processing time of roughly 635 years.

Effective processing of Exabyte of data will require extensive parallel processing and new analytics algorithms

Page 14: Big data ppt

Challenges in Big DataPrivacy and Security

Data Access and Sharing of Information

Analytical Challenges

Human Resources and Manpower

Technical Challenges

Page 15: Big data ppt

Privacy and SecurityPrivacy and Security are sensitive and includes conceptual,

Technical as well as legal significance.

Most Peoples are vulnerable to Information Theft.

Privacy can be compromised in the large data sets.

The Security is also critical to handle in such large data.

Social stratification would be important arising consequence

Page 16: Big data ppt

Data Access and Sharing of Information

Data should be available in accurate, complete and timely

manner.

The data management and governance process bit complex

adding the necessity to make data open and make it available to

government agencies.

Expecting sharing of data between companies is awkward

Page 17: Big data ppt

Analytical Challenges

Big data brings along with it some huge analytical

challenges.

Analysis on such huge data, requires a large number of

advance skills.

The type of analysis which is needed to be done on

the data depends highly on the results to be obtained.

Page 18: Big data ppt

Technical ChallengesFault Tolerance: If the failure occurs the damage done should be within acceptable threshold rather than beginning the whole task from the scratch. Scalability: Requires a high level of sharing of resources which is expensive and dealing with the system failures in an efficient manner.Quality of Data: Big data focuses on quality data storage rather than having very large irrelevant data. Heterogeneous Data: Structured and Unstructured Data

Page 19: Big data ppt

Advantages of Big DataUnderstanding and Targeting Customers

Understanding and Optimizing Business Process

Improving Science and Research

Improving Healthcare and Public Health

Optimizing Machine and Device Performance

Financial Trading

Improving Sports Performance

Improving Security and Law Enforcement

Page 20: Big data ppt

ConclusionsThe commercial impacts of the Big data have the potential to generate significant productivity growth for a number of vertical sectors.

Big Data presents opportunity to create unprecedented business advantages and better service delivery.

All the challenges and issues are needed to be handle effectively and in a efficient manner.

Growing talent and building teams to make analytic-based decisions is the key to realize the value of Big Data.

Page 21: Big data ppt

Thanks you