federated cloud file system - storage networking … cloud file system (fedcfs) overview framework...

19
2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 1 Federated Cloud File System Framework Ujjwal Lanjewar ([email protected])

Upload: vankhanh

Post on 24-Apr-2018

220 views

Category:

Documents


6 download

TRANSCRIPT

Page 1: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 1

Federated Cloud File System Framework

Ujjwal Lanjewar ([email protected])

Page 2: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 2

Agenda Background Federated Cloud File System (FedCFS) Overview

Framework Filesets in Cloud Global Namespace

Cloud Storage Federation FedCFS Protocol Overview FedCFS Architecture and Components

Architecture Overview Distributed Cloud Interface Object File System Global VFS

Next Steps References

Page 3: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 3

Federated File System (Recap)

Highlights Global Uniform Namespace

Federation of many file servers

Independent of File-Protocol

Built on existing NFS/CIFS File Servers

Terminology NSDB – Namespace DB (LDAP)

FILESET – Directory Tree (Unit of Data)

JUNCTIONS – Mount Points

FSL – Refers to the location of filesets

Reference: IETF ‘77 NFSv4 WG, 2010, James Lentini, NetApp

Page 4: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 4

Motivation

Hybrid Cloud – Organizations using multiple clouds. Need for federation framework for cloud storages.

Global namespace - Need to view distributed cloud data under a common global namespace

Possible FedFS extension (e.g. integration with cloud, NSDB alternative, etc).

Federated Cloud File System (FedCFS) Framework

Page 5: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 5

Overview

Global Uniform namespace for data stored in the hybrid clouds

File Services over Cloud Object Storage

Distributed File Services

Data organized in the form of Filesets (Directory Tree) in cloud

Cloud Storage accessible over Heterogeneous Protocols

Replication HA with dynamic namespace resolution (policy, etc)

Namespace is rendered at the file server itself

Caching of cloud data for performance reasons (Lease for consistency)

Uniform Distributed Cloud Interface (DCI)

Ability to add extensions to file services

Transparent to NFS/CIFS Clients

FedCFS Framework (cont…) Clients

Servers

Cloud Storage

Filesets

Page 6: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 6

Fileset Logical entity representing a Directory Tree Accessible from one or more servers Typically identified with UUID or a globally unique name (FSN)

Fileset Location (FSL)

Cloud Object Storage (e.g. Containers on the cloud) Primay and Secondary Locations

(Secondary Locations are Read only, hosted on different cloud storage) Can also be Directory on the Server (FedFS)

Fileset Migration / Replication

Fileset can have one or more locations (hosted on different cloud) Fileset can be migrated/replicated between cloud

Filesets

Cloud-Storage-1

Cloud-Storage-2

Directory Tree

Fileset Location

Fileset Locations

File

Page 7: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 7

Filesets (cont...) Fileset in Cloud… Fileset is represented with a container / bucket.

Cloud Objects

Directory – Cloud storage object containing information of directories / files underneath in namespace hirarchy

File – Object Storing file data or Information about extents Extents – Used for really large files and stores parts of file data Junctions – Objects that store mount point target FSN (Leaf) File Group – Object storing group of files (For small files)

ROOT Fileset

Links parts of namespaces together to provide global namespace. Read-only (Can be updated with FedCFS admin commands) Contains NSDB to Information about filesets, domain, junctions, etc.

Bucket / Container

ROOT Dir

Directory / Junction / File / Extent

(ROOT Fileset only)

NSDB

Page 8: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 8

Namespace

Single Logical Directory Tree Spans multiple filesets Filesets are connected together by way of Junctions

Namespace DB Hosted in Root Fileset in cloud. Equivalent to NSDB (FedFS).

Can be accessed from multiple admin server. Used to map FSN to FSL

DBs stores information about All the registered filesets (FSN as the key) Directories and Subdirectories (within the ROOT fileset) Junctions and Referral Information

Namespace Management NSDB Adminstration Server (Manages Root Fileset) NSDB Admin Commands (allows namespace management)

Global Namespace

Root Fileset

Junction Fileset

Global Namespace

NSDB Admin Server

Page 9: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 9

Junctions / Mount Points Filesets can be mounted on other filesets through ‘Junctions’ Junction – It contains URL with target fileset information Junctions are independent of FSL Junction Targets are FSN

Junction Resolution Juntction resolution happens through root fileset Happens on the server side Client agnostic Protocol Independant Steps: (Access)

1. Checks for the object type 2. If object is junction, get the Type+Target FSN (Referral) 3. Query ROOT fileset with FSN for FSL 4. Obtain FSL details from ROOT fileset 5. Execute Policy Manager for the best fit FSL 6. Return the FSL as Target for alllowing access

Global Namespace (cont...) Root Fileset

1 3

2

4 5

FSN

FSL

5

Junction Point Target Fileset

Source Fileset

6

Page 10: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 10

Cloud Storage Federation Logical View Physical View

Physical Structure Stored in Hybrid Cloud Environment Fileset data stored as objects Data Replication

Filesets replicated across Clouds Clouds can be positioned like Tiers

Logical View Data is viewed as hirarchical structure In the form of files and folders.

ORG DATA

USA

JOHN

ROOT

FINANCE

FY-15

ORG FY-15 USA

DATA FINANCE FY-15

JOHN DATA FINANCE

Public Cloud-1

Public Cloud-2

Private Cloud

Page 11: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 11

Cloud Storage Federation (cont…) FedCFS Cell / Domain Logical boundary of a global namespace Each Cell contains a ROOT Fileset A Cell contains logically grouped filesets Fileset can technically mount a foreign fileset Multi-Site Site - the location where data is accessed Site contains one or more file servers Multi-Protocol Federation is Protocol Independent File Services are built on top using federation framework Cloud Tiers Cloud can be viewed as Storage Tiers Fileset can be replicated and migrated between Tiers

File Server

Site 1

File Server

Site 2

Clients Clients

NFS SMB ... NFS SMB ...

ORG FY-15 USA

DATA FINANCE FY-15

JOHN DATA FINANCE

Page 12: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 12

FedCFS Protocols FedCFS Administration Fileset Management Namespace Management (Junctions) Namespace Navigation Cloud Interface Object Layout in Cloud Junction Resolution File-Protocol File-Protocol (NFS/CIFS) Federation is Protocol Independent File Services are built on top using federation framework Server to Server Protocol ** Sync / Lock Clustering Cloud to Cloud Protocol ** Inter-Cloud data replication ** Currently outside of Federation protocol

Admin Client NAS Client

File Server Admin Server File Server

Cloud Storage

Inter-Cloud

NFS/CIFS/… FedCFS Administration

FedCFS Navigation

Server To Server

FedCFS Administration FedCFS Navigation

Page 13: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 13

FedCFS Software Architecture

Openstack SWIFT Amazon S3 CDMI Compatible

SWIFT Plugin S3 Plugin CDMI Plugin

Object Cache

Distributed Cloud Interface (DCI)

Object File System (OFS)

OFS Plugin

Global VFS (GVFS)

FUSE VFS Plugin

EXT4

Applications (FS Driver) NFS Ganesha

(FSAL) SAMBA Server (VFS Module)

NFS Clients SMB Clients

Page 14: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 14

FedCFS Software Architecture (cont...) Distributed Cloud Interface (DCI) Uniform Interfaces across clouds Plug-in based approach Object Cache Open-Close Semantics for accessing objects Support for Swift, S3 and CDMI Lease based synchronization Example Caching Interfaces

open_object read_object write_object close_object

Example Non-caching Interfaces delete_object put_object get_object set_object_metadata get_object_metadata create_container delete_container set_container_metadata get_container_metadata

Object File System (OFS)

DCI

Plug-in Manager Disk Cache

S3 CDMI SWIFT

DISK CACHE

Cached Objects

...

Cache ID

Cache ID

Cloud ID

Page 15: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 15

FedCFS Software Architecture (cont...) Global VFS Backbone to file servers (Making it protocol independent) Providing access to data hosted on cloud / local disk Plug-in based (for extension to 3rd party storage) Plan to support server side replication, migration, snapshots Object File System (OFS) File System on top of object storage from cloud File System APIs to easy integration info Applications Understands organization of data stored in cloud

File Services

Global VFS

Plug-in Manager

3rd Party Local

DCI

OFS

Page 16: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 16

Next Steps FedFS and FedCFS compatibility and co-existence Federation of Cloud data with Non-cloud data stored on the disk (local to file server) NSDB Protocol Enhancements Clustering (Share Management / Lock) File Services and Use Cases (e.g. Applications) Define and Standardize

Unified DCI APIs and Packages

Cloud Data Structure / Organization

Inter Cloud Interface (Replication or Migration)

Clustering of File Services (Distributed Applications)

Page 17: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 17

References Glamor: An architecture for file system federation, IBM Journal of R&D, 2008, U. Lanjewar, M.Naik, R.Tewari, IBM

Federated File System Overview, IETF ’77 NFSv4 WG, 2010, James Lentini, NetApp

Federated Namespace BOF, USENIX FAST 2010, James Lentini, NetApp

pNFS, NFSv4.1, FedFS and Future NFS Developments, 2013, Alex McDonald, NetApp

FedFS Standards and Implementation, 2013, Chuck Lever, Oracle

Introducing FedFS on Linux, SDC 2014, Chuck Lever, Oracle

Cloud Data Management Interface (CDMI) v1.0, SNIA.org

Object Storage API v1, developer.openstack.org

Amazon Simple Storage Service, API Reference, docs.aws.amazon.com

Page 18: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 18

Thank You

Page 19: Federated Cloud File System - Storage Networking … Cloud File System (FedCFS) Overview Framework Filesets in Cloud Global Namespace Cloud Storage Federation FedCFS Protocol Overview

2015 Storage Developer Conference - India. © Calsoft Pvt. Ltd. All Rights Reserved. 19

About Author

Ujjwal Lanjewar – Principal Architect, Calsoft Pvt. Ltd. Masters Degree in System Software from BITS PILANI, INDIA. A veteran of storage industry with 17+ years of experience in architecting and developing products for NAS, Global Namespace

with FedFS, Distributed File Systems, Data Protection, Replication and Migration, etc. Working as Architect for designing enterprise scale products across stacks of different layers. About Calsoft Pvt. Ltd. Product Development Services Company (in Storage / Virtualization) with 16 years of existence, headquerterd in Pune, India. Six out of top 10 storage companies work with Calsoft including NetApp, EMC, Cisco, etc.