big data security - sector data security - fowler.pdfencryption the problem with many hadoop...

26
Big Data Security Kevvie Fowler kpmg.ca

Upload: others

Post on 10-Oct-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

Big Data Security Kevvie Fowler kpmg.ca

Page 2: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

About myself

Kevvie Fowler, CISSP, GCFA Partner, Advisory Services KPMG Canada

Industry contributions

Page 3: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Big data security | definitions

Definitions

Big data Datasets so large/complex they

become difficult to work with using existing technology

Big data technology Specialized technology developed

to manage large/complex data sets

Page 4: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Big data security | industry demand

The big data landscape

Page 5: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Big data security | industry demand

Explosive growth is occurring within the big data market

Apache Hadoop

▪ 54.7% growth (~2018) ▪ 20.9B market by 2018

2012 $11.6B

2018 $46.34B

Big Data Market By Types - Worldwide Forecasts & Analysis (2013 – 2018)

Page 6: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Big data security | Hadoop architecture

Hadoop architecture

Page 7: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Big data security | The challenge

The Hadoop security challenge Architectural design Sheer volume of data to be secured Minimal native security features

Page 8: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Big data security | The challenge

Can’t you secure Hadoop with 3rd party products? Several overlays on the market

RBAC

Logging

Encryption

The problem with many Hadoop

security overlays Don’t scale with the data

Point solutions

Can’t substitute for ground-up security builds

Page 9: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Big data security | The challenge

Big data can be a perfect storm of risk for an organization Massive amount of data

Little effective security

Big data breaches are inevitable They will dwarf the “large” breaches of today

Cost to recover

Investigative abilities

De-centralized storage

You can significantly increase your protection against attack by following 8

steps

Page 10: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Identify big data use data and associated security/privacy requirements If you don’t need sensitive data, don’t store it

Obfuscate sensitive information whenever possible

Big data security | Step #1

1

Page 11: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #2

Use a configuration management tool to deploy and manage your cluster Logging

Management

Cluster Mgt. Solution

2

Page 12: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #3

Validation of nodes and requests

Validate nodes and client applications before admission to the cluster

3

Page 13: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #3

Validation of nodes and requests (continued)

Authentication By default there is no authentication

Secure RPC & HTTP Web consoles (Hadoop’s Web UIs, WebHDFS, and HttpFS)

Simple Authentication and Security Layer (SASL)

Kerberos

Authorization Set your HDFS file permissions

MapReduce ACL’s

3

Page 14: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #4

Secure the underlying OS Server hardening

Encrypt sensitive data-at-rest

4

Page 15: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #5

Use transmission level security Most clusters use RPC, TCP/IP & HTTP

SSL / TLS to authenticate and ensure privacy of communications between cluster nodes

5

Page 16: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #6

Have a choke point

Clients communicate directly with resource managers and nodes implement a choke point to block access to users/IP’s as required.

6

Page 17: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #7

Secure Hadoop-related applications Hadoop extensions 3rd party applications

7

Page 18: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #7

Secure Hadoop-related applications | Hive (continued) Hive is a data warehouse system for Hadoop

HiveQL is a language based on SQL that allows a user friendly front-end to MapReduce

7

Page 19: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #7

Secure Hadoop-related applications | Hive

SQL Injection meets the Hive

7

Page 20: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #7

Secure Hadoop-related applications | Hive (continued) HiveQL includes many operators, functions and expressions commonly abused by

SQL by injection attacks

Type of injection

SQL/ASP.NET

HIVEQL/HUE

Simple Dynamic SQL Injection X X Blind SQL Injection X X Stacked queries X X

7

Count Union Distinct

Wait for Sub queries Expressions joined by OR in a WHERE clause Comparisons between two constants

Page 21: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #7

Secure Hadoop-related applications | Hive (continued)

Protecting against HiveQL injection Accountability (user developed functions, views, logic)

Security reviews of MapReduce/HiveQL applications

Revoke access where possible

Use Hive Server 2!

7

Page 22: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Step #8

Ensure your IR and Forensics program incorporates big data technology

Traditional IR/Forensics practices aren’t effective against big data technology

Potential for enormous organizational impact, little information on how to manage it

10

8

Page 23: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | Future enhancements

Upcoming Hadoop security enhancements

HBASE Security (HBASE-6222)

Token-based authentication (HADOOP-9466)

Encrypted data at rest (HADOOP-9331)

10

Page 24: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

©2013 Moskowitz & Meredith LLP is a Canadian limited liability partnership affiliated with KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

Hadoop security | References

www.intel.com

www.cloudera.com

www.hortonworks.com

Page 25: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

Thank you Kevvie Fowler, CISSP, GCFA Partner Advisory Services Office: (416) 777- 3742 Email: [email protected]

Page 26: Big Data Security - SecTor Data Security - FOWLER.pdfEncryption The problem with many Hadoop security overlays Don’t scale with the data ... Protecting against HiveQL injection Accountability

The information contained herein is of a general nature and is not intended to address the circumstances of any particular individual or entity. Although we endeavor to provide accurate and timely information, there can be no guarantee that such information is accurate as of the date it is received or that it will continue to be accurate in the future. No one should act on such information without appropriate professional advice after a thorough examination of the particular situation.

© 2013 KPMG LLP, a limited liability partnership and the Canadian member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved.

The KPMG name, logo and "cutting through complexity" are registered trademarks or trademarks of KPMG International Cooperative ("KPMG International").

26