Loading…
Vault 2016 has ended
Wednesday, April 20
 

8:00am PDT

Registration & Breakfast
Wednesday April 20, 2016 8:00am - 9:00am PDT
State Ballroom Foyer

9:00am PDT

Keynote: Going Big with Open Source Storage - Sage Weil, Ceph Principal Architect, Red Hat
The next generation of distributed storage systems we build and deploy will face an expanding set of requirements: asynchronous multi-datacenter replication, varied consistency models with active/active globally distributed data stores, quality of service management, multiprotocol access, and tiering across an expanding menu of storage technologies and hardware with increasingly open architectures.  This talk will look at what the challenges we face as an open storage community and what problems we'll need to do to address in the next few years.

Speakers
avatar for Sage Weil

Sage Weil

Ceph Project Leader, Red Hat
Sage Weil helped developed Ceph as part of his graduate studies at the University of California, Santa Cruz. After graduating he continued to develop the system and build an open source community around Ceph with the support of DreamHost. In 2012, he co-founded Inktank to productize... Read More →


Wednesday April 20, 2016 9:00am - 9:20am PDT
State Ballroom CD

9:20am PDT

Keynote: Open Standards, Open Source and the Evolving Hardware/Software Interface - Jorge Campello, Director, System & Software Technologies, HGST, A Western Digital Company

Open hardware interfaces are typically developed and defined in Standards Setting Organizations through a process involving broad industry participation. The process of generating the needed software isn’t always coordinated with hardware standards development, and often lags the hardware specification development by quite some time. Open source in general, and Linux in particular, has been dealing with the pragmatic problem of waiting for hardware to be built, which often only happens some time after the standard is published, before software can be written against it.  In this presentation we will discuss this process and some potential ways to reduce the time lag for technology introduction.


Speakers
avatar for Jorge Campello

Jorge Campello

Sr. Director, System & Software Technologies, Western Digital
Jorge Campello holds a Electrical Engineer Degree and a MSc in Electrical Engineering from the Universidade Federal de Pernambuco and a PhD in Electrical Engineering from Stanford University.  He started his career in the storage industry at IBM’s Almaden Research Center.   He... Read More →


Wednesday April 20, 2016 9:20am - 9:35am PDT
State Ballroom CD

9:35am PDT

Keynote: Information Storage in DNA - Nick Goldman, Program Director, The Goldman Group
The amount of information that humans produce and want to store is growing exponentially. I will argue that at present, no long-term _archiving_ --- meaning storing safely, recoverably, for long periods of time with rare access and minimal maintenance requirements --- of digital information is taking place.

Recent advances in genome science led us to look at the possibility of using DNA as a digital archive medium.  In a `proof of principle' experiment, we used standard DNA technologies to store and recover approximately 750kb of digital information in a manner that could be extrapolated to global data scales, incorporating modern methods such as error correcting codes for data integrity.  DNA has numerous properties that are ideal for archiving, and this talk will describe our current experiments and will speculate on the
future of DNA as a digital storage medium.

Speakers
avatar for Nick Goldman

Nick Goldman

Research Group Leader, The Goldman Group
Nick Goldman has a first degree in mathematics and received his PhD in molecular evolution from the Department of Zoology, University of Cambridge, in 1992.  He has worked at the Natural History Museum (London), the MRC-National Institute for Medical Research (London) and the University... Read More →


Wednesday April 20, 2016 9:35am - 9:55am PDT
State Ballroom CD

10:00am PDT

Break
Wednesday April 20, 2016 10:00am - 10:30am PDT
State Ballroom Foyer

10:30am PDT

Managing Persistent Memory - Dan Williams & Tiffany Kasanicky, Intel
With the release of v4.5, the Linux kernel has gained support for a wide range of persistent memory storage applications. From the basics of treating a persistent memory range like a disk, to the more advanced configurations of enabling direct byte-addressable access to applications (DAX) or 3rd-party device access (DMA/RDMA), there is a range of configuration options to consider. This talk combines an overview of the Linux kernel persistent memory access capabilities with an introduction to IXPDIMM, management tooling and libraries for administering the persistent memory capabilities of a platform.

Speakers
TK

Tiffany Kasanicky

Software Architect, Intel Corporation
avatar for Dan Williams

Dan Williams

Software Engineer, Intel
Dan is a Linux kernel developer in Intel Open Source Technology Center primarily focused on storage technologies. Most recently he has been involved in persistent memory enabling as a maintainer of the Linux kernel libnvdimm sub-system. He led the Persistent Memory micro-conference... Read More →


Wednesday April 20, 2016 10:30am - 11:20am PDT
State Ballroom B

10:30am PDT

Open-Channel Solid State Drives - Matias Bjørling CNEX Labs
This presentation gives an introduction to Open-Channels SSDs and LightNVM - the kernel subsystem for Open-Channel SSDs. Open-Channel SSDs allow applications to directly control data placement and garbage collection within an SSD. The benefits are consistent latency and throughput in high-performance workloads and extending SSD lifetime. The primary objective of this talk is to show the possibilities of this new storage interface, its architecture, the current status and roadmap for LightNVM for both upcoming kernel functionality and user-space support.

Speakers
MB

Matias Bjørling

Director, Western Digital
Matias Bjørling researchs emerging storage architectures Western Digital. Before joining the industry, he obtained a Ph.D. in operating systems, and non-volatile storage by doing performance characterization of flash-based SSDs, working on thr linux kernel blk-mq, and its associated... Read More →


Wednesday April 20, 2016 10:30am - 11:20am PDT
State Ballroom A

10:30am PDT

Facebook Warm Storage – Next Generation Storage for Data Warehouse in Hadoop Ecosystem - Kestutis Patiejunas, Facebook
Introduction Very few companies run 24/7 clusters at the scale like Facebook using Hadoop/HDFS. As scale increases HDFS shows its limitations in scalability, performance and availability. HDFS was first designed 10+ years ago based on the assumptions that were valid in 2005. Over the years, numerous significant improvements have been made to the system. However storage, network and compute technology changed at different ratios during last decade. In this paper we describe Warm Storage - grounds up solution that exploits technology inflection points that exist in 2015 and we believe will continue into the future. We describe architecture and some of the implementation details of the novel and more efficient and available storage system optimized for DW workloads.

Speakers
KP

Kestutis Patiejunas

Kestutis Patiejunas is a software engineer at Facebook. He works with a team of engineers on Facebook Warm and Hot Storage – in internal replacement for HDFS. Before he worked on Cold Storage – system for long term data archival where he created hard drive based cold storage and... Read More →


Wednesday April 20, 2016 10:30am - 11:20am PDT
Congressional Room AB

10:30am PDT

GlusterFS and its Distribution Model - Sakshi Bansal, Red Hat
GlusterFS is a scalable network filesystem to create large, distributed storage solutions. Sakshi Bansal and Susant Palai will talk about the need for a distributed file system and give an introduction to GlusterFS. The talk will mainly focus on how GlusterFS manages distributions of files and directories through Distributed Hash Table (DHT). They will also review on some of the recently introduced key features to DHT that improve performance. Finally they will give a short demo on the working of GlusterFS.

Speakers
SB

Sakshi Bansal

Sakshi Bansal is currently working as a software developer at Red Hat for the GlusterFS project. Her main contribution pertain to Distributive Hash Table (DHT) component. She is a FOSS enthusiast and has previously given a lightning talk at DebConf on Python Requests. She has also... Read More →


Wednesday April 20, 2016 10:30am - 11:20am PDT
State Ballroom E

10:30am PDT

What's new in RADOS for Jewel? - Samuel Just, Red Hat
RADOS is the layer of the Ceph storage system responsible for data storage and replication. In this presentation, Sam Just, the PTL of the Ceph core team, will discuss recent developments in RADOS new in the recent Jewel release and exciting upcoming work.

Speakers
SJ

Samuel Just

Engineer, IBM
Sam began working on the Ceph project in 2011. Most of his time currently is spent working on crimson, the next generation ceph-osd implementation.


Wednesday April 20, 2016 10:30am - 11:20am PDT
State Ballroom F

11:30am PDT

Managing Fabric-Attached Memory in The Machine - Rocky Craig, Hewlett Packard Enterprise
Fabric-Attached Memory (Rocky Craig, Hewlett Packard Enterprise) - The Machine from HPE is the prototoype of a novel architecture featuring memory-centric computing. Multiple Linux nodes see fabric-attached NVM in their physical address space as a shared global resource. This presentation introduces the hardware architecture as it relates to the NVM and the challenges in shared management. While NVM presents a key-value paradigm in data retrieval, we chose the route of a global file system. The bulk of the talk will focus on this "Librarian File System" and the benefits of a well-known API versus a proprietary solution.

Speakers
avatar for Rocky Craig

Rocky Craig

Principal Lead, Hewlett Packard Enterprise
Rocky is employed by Hewlett Packard Enterprise, a spinoff of HP where he started in 1978. He has served many roles in the technical space of technical workstations and HP-UX and Linux servers. Rocky is currently co-architect of a Debian-based distribution for The Machine, a new architecture... Read More →


Wednesday April 20, 2016 11:30am - 12:20pm PDT
State Ballroom A

11:30am PDT

Solving the Linux Storage Scalability Bottlenecks - Jens Axboe, Facebook
Flash devices introduced a sudden shift in the performance profile of direct attached storage. With IOPS rates orders of magnitude higher than rotating storage, it became clear that Linux needed a re-design of its storage stack to properly support and get the most out of these new devices.

This talk will detail the architecture of blk-mq, the redesign of the core of the Linux storage stack, and the later set of changes made to adapt the SCSI stack to this new queuing model. Early results of running Facebook infrastructure production workloads on top of the new stack will also be shared.

Speakers
avatar for Jens Axboe

Jens Axboe

Facebook
Jens Axboe is Software Engineer at Facebook, formerly a Fellow at Fusion-io, and Consulting member of staff at Oracle. He also serves as the Linux block layer maintainer. Jens has worked on all things Linux IO related, such as data writeback, IO scheduling, SATA/SCSI, and others... Read More →


Wednesday April 20, 2016 11:30am - 12:20pm PDT
State Ballroom B

11:30am PDT

BoF: Logical DePop - Damien Le Moal, WDC
Some block storage device standards are working on a “logical depop" function which allows a system to decommission a defective physical element, e.g. a disk head or an SSD die or channel, reformat the device and continue using it with a reduced capacity. Such feature can allow reduced operation costs (delayed device replacement) but has the drawback of data loss (data under the remaining valid physical elements) and device downtime during re-formatting.

Online logical depop is another proposed new feature allowing retaining the device valid data and eliminating the need for a re-format. The basic idea is to introduce new commands for the host to discover the ranges of LBAs impacted by a defective element. Using this information, the host can take actions when an element failure event is suspected or reported by the device: deallocate the LBAs served by the defective element and operate the device in a thin-provisioned mode, amputate the LBAs or truncate the device LBA range to restore operation as a fully provisioned device with a lower capacity.

The goal of this BoF is to discuss the usefulness of such features and gather feedback from different application and system point of views for drafting a standard minimizing the impact of this feature on existing systems as well as enabling enhanced functionality for file systems,  device mapper drivers (including logical volume manager) and application level distributed storage systems (key value stores, object stores).

Speakers

Wednesday April 20, 2016 11:30am - 12:20pm PDT
State Ballroom E

11:30am PDT

Ceph and Flash - Brian O’Krafka, SanDisk
Ceph is the leading open-source storage management platform for private cloud and large-scale clustered systems. As flash-based storage has come into the mainstream many of the industry best-practices must be re-examined to properly realize the full value of flash while simultaneously being cost-effective.

Ceph has been rapidly evolving to support large-scale deployment of flash. This presentation will examine the history and current best-practices for deploying flash with Ceph. Future developments in the Ceph platform will also be described and their impact on flash deployments.

Speakers

Wednesday April 20, 2016 11:30am - 12:20pm PDT
State Ballroom F

11:30am PDT

Froyo: Hassle-free Personal Array Management using XFS and DM - Andy Grover, Red Hat
LVM is a powerful, enterprise tool that uses Linux's Devicemapper subsystem to provide a wealth of possibilities for organizing and optimizing attached block storage. But, with power comes complexity. Dealing with errors, or even just making changes may cause infrequent users of the LVM tools to feel uneasy. Froyo provides a more managed storage solution -- feed it disks, and it gives you "a filesystem where your data is safe forever that never runs out of space".

Of course, under the covers are familiar technologies -- devicemapper and XFS, with Froyo pulling the strings. Andy Grover will talk about the development and implementation of Froyo, how it manages seamless online reconfiguration and monitoring of its member disks, and look forward to possible future improvements that leverage existing DM capabilities in new ways.

Speakers

Wednesday April 20, 2016 11:30am - 12:20pm PDT
Congressional Room AB

11:30am PDT

Performance improvements in Replication and Erasure coding in GlusterFS - Anuradha Talur, Red Hat
GlusterFS is a scalable network filesystem that enables users to deploy large distributed storage solution using common off-the-shelf hardware. It provides high availability and redundancy through Automatic File Replication(AFR) and Erasure Coding(EC). In this presentation Anuradha Talur will be talking about improvements put forth in GlusterFS around AFR and EC to effectively tackle some of the long standing performance issues faced by users. These enhancements are targeted towards improving performance of small file workloads, and automatic repair/self-heal so that storage is not in degraded state for longer time. The talk also focus on improvements for scaling up self-heal/repair load when hardware can offer more iops, scaling self-heal load down during heavy traffic on storage from applications, and addressing the high consumption of CPU due to non systematic erasure coding.

Speakers
AT

Anuradha Talur

Red Hat
Anuradha Talur has been working at Red Hat as a software developer for close to three years. She has been an active contributor to GlusterFS, an open source, highly scalable distributed filesystem. She has been predominantly involved in the development of Automatic File Replication... Read More →


Wednesday April 20, 2016 11:30am - 12:20pm PDT
State Ballroom E

12:20pm PDT

Lunch Break (Attendees on Own)
Wednesday April 20, 2016 12:20pm - 2:00pm PDT
TBA

2:00pm PDT

A Small Case Study: Lessons Learned at Facebook - Chris Mason, Facebook
Facebook uses Linux everywhere, and our wide variety of workloads often expose problems in the kernel's handling of storage and memory management.

This talk will focus on issues we've encountered, how we fix them in production, and the projects we have underway to improve the kernel.

Speakers
avatar for Chris Mason

Chris Mason

Software Engineer, Facebook
Chris is a Software Engineer on the kernel team at Facebook, and the maintainer of the Btrfs filesystem. He has been working full time on the kernel for over 16 years, and lives in Rochester New York.


Wednesday April 20, 2016 2:00pm - 2:50pm PDT
State Ballroom B

2:00pm PDT

SMB3 in Samba - Multi-Channel and Beyond - Michael Adam, Red Hat
Samba is the most important open source SMB file server, and arguably, one of the most important SMB
implementations out there. While the Active Directory Server has been the hip topic recently, the file server is really cooking. One of the most active areas of development is SMB version 3, introduced by Microsoft to move focus from pure workstation workload to server workload. SMB3 adds an abundance of new features to the protocol. In particular, Microsoft caught up with Samba/CTDB to offer all-active clustering.

After an overview of the state of Samba's file server, this talk will present the progress of SMB3 in Samba. It will focus on Multi-Channel, the core of which has just been added to Samba 4.4, including a demo. Afterwards an outlook will be given on other active areas like RDMA, persistent file handles, and scale-out SMB clustering, reporting on status and challenges.

Speakers
avatar for Michael Adam

Michael Adam

Principal Software Engineer, Red Hat
Michael Adam is an enthusiastic open source software developer, interested in all things about storage and containers. One of the main developers of Samba since more than a decade, Michael is an engineering manager at Red Hat, leading two worldwide teams: The Samba team for Gluster... Read More →


Wednesday April 20, 2016 2:00pm - 2:50pm PDT
State Ballroom A

2:00pm PDT

GlusterFS @ Facebook - Richard Wareing, Facebook
GlusterFS is an open-source (mostly) POSIX compliant distributed filesystem originally written by Gluster Inc and now maintained by RedHat Inc. Here at Facebook it had humble beginings: a single rack of machines, serving only a single use-case. Over the next 4 years it grew to thousands of machines, hosting 10s of petabytes of data. This presentation is a story of how this transformation occurred and the things we did to make it happen.

I'll cover how we manage & automate 1000's of GlusterFS bricks as well as dive into some of the patches we've contributed to the community to make GlusterFS easier to manage, monitor and scale.

Speakers
RW

Richard Wareing

Production Engineer, Facebook Inc.
Richard Wareing has been a Production Engineer for over 7 years at Facebook, with a passion for Storage Engineering. During the course of his career there he helped scale their GlusterFS (POSIX) install base from nothing to one of the largest install bases in the world.From there... Read More →


Wednesday April 20, 2016 2:00pm - 2:50pm PDT
State Ballroom E

2:00pm PDT

Linux NTB - Allen Hubbe, EMC & Dave Jiang, Intel
After 10+ years of NTB in specialized hardware, PCI-express Non-Transparent Bridge technology is making its entrance into retail off the shelf server solutions. Linux, with its selection of open source drivers for NTB, is strategically positioned to unlock the value of this low cost, low latency, high bandwidth interconnect.

Speakers
AH

Allen Hubbe

EMC
Allen Hubbe is a software engineer in the Core Technologies Division at EMC Corporation. He is currently based out of Durham, North Carolina. Allen is currently focused on developing system level software for storage and networking products which include Hyper Converged Infrastructure... Read More →
avatar for Dave Jiang

Dave Jiang

Software Engineer, Intel Corp


Wednesday April 20, 2016 2:00pm - 2:50pm PDT
Congressional Room AB

2:00pm PDT

Stable CephFS: A Development Perspective - Gregory Farnum, Red Hat
The Ceph upstream community is declaring CephFS stable for the first time in the recent Jewel release, but that declaration comes with caveats: while we have filesystem repair tools and a distributed POSIX filesystem, we have disabled exciting features like horizontally-scalable metadata servers and snapshots. This talk will present — from a developer’s perspective — exactly what features you can expect to see, what blocked the inclusion of other features, and an outline of what work we’ve done to reach this point and what work is required moving forward.

Speakers
GF

Gregory Farnum

Principal Software Engineer, Ceph, Red Hat
Greg Farnum is a long-standing member of the core Ceph development group, having joined the project as the third full-time engineer after graduating from Harvey Mudd College in 2009. Now a Red Hat employee, Greg has done major work on all components of the Ceph ecosystem and currently... Read More →


Wednesday April 20, 2016 2:00pm - 2:50pm PDT
State Ballroom F

3:00pm PDT

Building Ceph Reference Architectures for the Real World - Alejandro Bonilla, SUSE
SUSE will talk about multi-petabyte storage deployments.  The easiest adoption is with new projects that are typically in the low 100s of TBs.  This session will talk about the process of identifying the use cases and architectures that represent a solution the customer will actually buy while minimizing their risk.

Speakers

Wednesday April 20, 2016 3:00pm - 3:50pm PDT
State Ballroom A

3:00pm PDT

MD/RAID-456 Write Journal and Cache - Song Liu, Facebook
Without NVRAM and proper software support, MD/RAID-456 has not been very competitive against hardware RAID in enterprise use cases. Particularly, MD/RAID-456 suffers from potential data loss during unexpected shut down (the write hole); and long write latency from read-modify-write updates (slow fsync). With recent development in NVRAM (NVMe SSD, NVDIMM), it is now possible to bring enterprise level reliability and performance to MD/RAID-456. In this presentation, Song Liu will present joint work with Shaohua Li in MD/RAID-456 write journal and write back cache.

Speakers
avatar for Song Liu

Song Liu

Software Engineer, Facebook
Song Liu is a software engineer in Facebook, working on storage software and hardware design. His interest is in storage and distributed systems.


Wednesday April 20, 2016 3:00pm - 3:50pm PDT
State Ballroom B

3:00pm PDT

CephFS as a Service with OpenStack Manila - John Spray, Red Hat
Ceph is a popular storage backend for OpenStack deployments, in use for block devices via RBD/Cinder and for object storage with RGW. Ceph also includes a filesystem, CephFS which is suitable for integration with Manila, the OpenStack shared filesystem service.

This presentation introduces the new CephFS Native driver for Manila, describing its the implementation of the driver, and how to deploy and use it. Areas of particular interest for Ceph users will include how CephFS snapshots map to Manila's snapshot API, how Ceph's authentication scheme can be used with Manila.

In addition, the direction of future work will be described, including how the driver can be extended to provide an NFS share service based on CephFS, and how this work would integrate with VSOCK-based hypervisor-mediated filesystem access.

Speakers
avatar for John Spray

John Spray

Red Hat
John is a Ceph developer at Red Hat, working from his home office in Edinburgh, Scotland. He works mainly on the Ceph filesystem layer (CephFS).


Wednesday April 20, 2016 3:00pm - 3:50pm PDT
State Ballroom F

3:00pm PDT

pNFS for Block Storage - Sridhar Balasubramanian, NetApp
An integrated solution with pNFS block server and backend block storage comprises of highly optimized layout driver that exposes set of block volume LUNs that hosts the filesystem partition to pNFS clients with appropriate locking/release mechanism for concurrent client access to a file. The implementation includes new SCSI layout type and PR based client fencing for SCSI block storage. The solution is intended to address latency sensitive and scalable bandwidth requirements for high performance data processing workloads through cost optimized storage configurations and performance tiers for productization, and complements NetApp’s open source contributions pertaining to pNFS for file. The project is sponsored as an open source initiative under NetApp Hyperscalar Storage Group Advanced Development organization by closely collaborating with Linux NFS and kernel development community.

Speakers
SB

Sridhar Balasubramanian

NetApp
Speaker name: Sridhar Balasubramanian Bio: Sridhar is currently working as Sr. Engineer, Strategic Architecture @ NetApp RTP. With 25 years in software industry, Sridhar is inventor/co-inventor for 15 US Patents and published 4 Conference papers till date. Sridhar's area of expertise... Read More →


Wednesday April 20, 2016 3:00pm - 3:50pm PDT
Congressional Room AB

3:00pm PDT

Tiering in GlusterFS: Hardware Config Considerations - Veda Shankar, Red Hat
Glusterfs's new tiering feature helps assure that data is available and accessible at the correct performance level thus satisfying both cost and performance concerns. This presentation will explain the Glusterfs's automated tiering feature followed by a live demonstration, and provide performance benchmark results from different hardware configurations. Glusterfs tiering monitors and identifies the activity level of the data and automatically promotes active data to hot SSD based storage and demotes inactive data to cold SATA based storage. Hence, the data movement can happen in either direction, based on the access frequency. The demonstration from Quanta/QCT labs (commodity x86 hardware) will show how to implement a hot tier based on SSD drives (optimized for performance) and attach it to an Erasure coded Glusterfs volume that is based on SATA drives (optimized for cost/capacity).

Speakers
avatar for veda shankar

veda shankar

Technical Marketing, Storage, Red Hat Inc.
I am a Senior Storage Architect at Red Hat and have extensive experience with Red Hat's software defined storage products. I help the BU to create reference architectures on the storage products with various software and hardware partners. I have architected and launched the test... Read More →



Wednesday April 20, 2016 3:00pm - 3:50pm PDT
State Ballroom E

4:00pm PDT

NOVA: A Log-structured File System for Hybrid Volatile/Non-volatile Main Memories - Andiry Xu, UC San Diego
Fast non-volatile memories (NVMs) present challenges to
file system designers. Existing file systems built for HDD
and SSD introduce software overheads that would obscure
the performance that NVMs should provide, but proposed
file systems for NVMs either incur similar overheads or fail
to provide the strong consistency guarantees that applications
require.

NOVA is a file system that adapts conventional
LFS techniques to exploit the NVM characteristics
and provides strong consistency guarantee. NOVA maintains
separate logs for each inode to improve concurrency, and
stores file data outside the log to minimize log size and reduce
garbage collection costs. NOVA’s logs provide metadata,
data, and mmap atomicity, and put indexes in DRAM to
accelerate search operations. Experimental results show that
NOVA outperforms existing file systems by a wide margin.

Speakers
AX

Andiry Xu

UC San Diego
I am a PhD student of Non-Volatile Systems Lab, University of California, San Diego. I'm working under the guidance of Professor Steven Swanson. My research involves operating systems and software optimizations to fully exploit the performance potential of next-generation storage... Read More →


Wednesday April 20, 2016 4:00pm - 4:50pm PDT
State Ballroom B

4:00pm PDT

Performance Analysis of SCSI-mq on 16G FC - Hannes Reinecke, SUSE Labs
With recent kernels block and SCSI multiqueue support has been integrated, showing a big performance boost on high-performance devices. At the same time, there are some features which a missing from multiqueue. Most notably multiqueue does not have any I/O scheduler, and it doesn't support I/O merging.

With high-speed storage these features should matter less, as the hardware is supposed to be fast enough to offset any impact here. However, numbers supporting this are hard to come by.

To validate this we have been running performance tests with Emulex 16G FC cards and NetApp EF-650 storage.

In this presentation we will show the performance results when running a 4.4 kernel. We will be giving a comparison with the original (3.12-based) SCSI implementation as well as the current implementation, with both multiqueue enabled and disabled.

Speakers
avatar for Hannes Reinecke

Hannes Reinecke

Kernel Storage Architect, SUSE Labs
Studied Physics with main focus image processing in Heidelberg from 1990 until 1997, followed by a PhD in Edinburgh 's Heriot-Watt University in 2000. Worked as sysadmin during the studies, mainly in the Mathematical Institute in Heidelberg. Now working at SUSE Labs as Kernel Storage... Read More →


Wednesday April 20, 2016 4:00pm - 4:50pm PDT
State Ballroom A

4:00pm PDT

Arbiter based Replication in Gluster- without 3x Storage Cost and Zero Split-Brains! - Ravishankar N., Red Hat
Replication is an important aspect of any storage solution for backing up data and ensuring High Availability. This is non-trivial in case of distributed systems because achieving consistency and HA with only 2x replication is difficult as there is no notion of quorum (majority voting). But not everyone can afford to store 3x copies just to achieve consensus in case of a disagreement. So they end up using 2x replication but face frequent split-brains due to flaky networks etc. - a common complaint on the gluster-users mailing list.

The presentation will focus on the arbiter configuration in glusterFS replica volumes and explain how it provides the same consistency guarantees as that of a full blown 3 way replica but without the 3x storage cost. It describes how the arbitration logic works to prevents files from ending up in split-brain, how to deploy and monitor them.

Speakers
avatar for Ravishankar N

Ravishankar N

Senior Software Engineer, Red Hat
Ravishankar a.k.a. Ravi is a believer of Linux and OSS. He started out as a Linux user circa 2004 when he got his hands on a Knoppix Live CD after buying a PC and shortly thereafter the Red Hat 9 three-CD pack. Since 2009, he has been working as a developer on Linux on multiple domains... Read More →


Wednesday April 20, 2016 4:00pm - 4:50pm PDT
State Ballroom E

4:00pm PDT

New Ceph Configurations - High Performance Without High Costs - Allen Samuels, SanDisk
Erasure Coding is traditionally limited to Archival workloads due to its performance and computational requirements. All Flash Storage changes that and enables Ceph with Erasure Coding to be a viable solution for active workloads as well. You will hear about how Erasure coding and other techniques can be used with Ceph on All Flash Storage to provide the benefits of Flash at affordable costs. We will discuss recent improvements in Ceph to making it a high performance Cinder block Storage solution on Flash, while lowering the overall storage costs.

Speakers
avatar for Allen Samuels

Allen Samuels

Engineering Fellow, Western Digital
Allen joined SanDisk in 2013 as an Engineering Fellow, he is responsible for directing software development for SanDisk’s system level products. He has previously served as Chief Architect at Weitek Corp. and Citrix, and founded several companies including AMKAR Consulting, Orbital... Read More →


Wednesday April 20, 2016 4:00pm - 4:50pm PDT
State Ballroom F

4:00pm PDT

SMB2 and Linux - A Seamless File Serving Protocol - Jeremy Allison, Google
SMB2 is the default Windows and MacOS X file sharing protocol, but what about making it the default on Linux ? After developing the UNIX extensions to the SMB1 protocol, the Samba developers are planning to add UNIX extensions to SMB2 also. Co-creator of Samba Jeremy Allison will discuss the technical challenges faced in making SMB2 into a seamless file sharing protocol between Linux clients and Samba servers, and how Samba plans to address them. Come learn how Samba plans to make NFS obsolete (again :-) !

Speakers
JA

Jeremy Allison

Samba Team/Engineer, Google
Jeremy Allison is one of the lead developers on the Samba Team, a group of programmers developing an Open Source Windows compatible file and print server product for UNIX systems. Developed over the Internet in a distributed manner similar to the Linux system, Samba is used by all... Read More →


Wednesday April 20, 2016 4:00pm - 4:50pm PDT
Congressional Room AB

5:00pm PDT

On-site Attendee Reception & Sponsor Showcase
Wednesday April 20, 2016 5:00pm - 6:30pm PDT
State Ballroom Foyer
 
Thursday, April 21
 

8:00am PDT

Registration & Breakfast
Thursday April 21, 2016 8:00am - 9:00am PDT
State Ballroom Foyer

9:00am PDT

Keynote: The Future for Linux Storage, Filesystems and Memory Management - James Bottomley, IBM; Mel Gorman, SUSE; Jeff Layton, Primary Data
The three track leaders from the recent LSF/MM Summit will present the outcomes from the summit and some indications of the future directions the Linux ecosystem is taking.
  • James Bottomley, IBM - IO Track Leader
  • Jeff Layton, Primary Data - FS Track Leader
  • Mel Gorman, SUSE Labs - MM Track Leader 

Moderators
avatar for Martin K. Petersen

Martin K. Petersen

Architect, Oracle
Martin K. Petersen has contributed to the Linux Kernel since the early nineties. As Linux Storage Architect at Oracle he focuses on future I/O and storage technologies. Martin also participates in various industry standardization efforts and co-maintains the Linux SCSI stack.

Speakers
avatar for James Bottomley

James Bottomley

Distinguished Engineer, IBM
James Bottomley is a Distinguished Engineer at IBM Research where he works on Cloud and Container technology. He is also Linux Kernel maintainer of the SCSI subsystem. He has been a Director on the Board of the Linux Foundation and Chair of its Technical Advisory Board. He went to... Read More →
MG

Mel Gorman

Kernel Engineer, SUSE Labs
Mel Gorman is a Linux kernel developer specialising in virtual memory management and has been active in the community since 2005. He is employed by SUSE Labs since 2011 and is currently the engineering lead for the SUSE Performance Team. His primary responsibility is leading a... Read More →
avatar for Jeff Layton

Jeff Layton

Sr. Principal Software Engineer, Red Hat
Jeff Layton is a longtime Linux kernel and Ceph developer who specializes in network filesystems. He maintains the Linux kernel CephFS client and is a contributor to many other open source projects.


Thursday April 21, 2016 9:00am - 9:40am PDT
State Ballroom CD

9:40am PDT

Keynote: Application Implications of Persistent Memory - Matthew Wilcox, Intel
Persistent Memory is coming to the enterprise.  Existing software will continue to work as expected, but software that’s open to change will take better advantage of it.  From performance to error handling, Matthew will discuss considerations that storage architects, programmers and system administrators should take into account when planning for this glorious new future.

Speakers

Thursday April 21, 2016 9:40am - 10:00am PDT
State Ballroom CD

10:00am PDT

Keynote: The Consequences of Infinite Storage Bandwidth - Allen Samuels, Software Architect, Fellow, Systems and Software Solutions, SanDisk
Overall increases in CPU and DRAM processing power are falling behind the massive acceleration in available storage and network bandwidth.  Storage management services are emerging as a serious bottleneck.  What does this imply for the datacenter of the future?  How will it affect the physical network and storage topologies?  And how will storage software need to change to meet these new realities?

Speakers
avatar for Allen Samuels

Allen Samuels

Engineering Fellow, Western Digital
Allen joined SanDisk in 2013 as an Engineering Fellow, he is responsible for directing software development for SanDisk’s system level products. He has previously served as Chief Architect at Weitek Corp. and Citrix, and founded several companies including AMKAR Consulting, Orbital... Read More →


Thursday April 21, 2016 10:00am - 10:15am PDT
State Ballroom CD

10:15am PDT

Break
Thursday April 21, 2016 10:15am - 10:30am PDT
State Ballroom Foyer

10:30am PDT

BlueStore: A New, Faster Storage Backend for Ceph - Sage Weil, Red Hat
Traditionally Ceph has made use of local file systems like XFS or btrfs to store its data. However, the mismatch between the OSD's requirements and the POSIX interface provided by kernel file systems has a huge performance cost and requires a lot of complexity. BlueStore, an entirely new OSD storage backend, utilizes block devices directly, doubling performance for most workloads. This talk will cover the motivation a new backend, the design and implementation, the improved performance on HDDs, SSDs, and NVMe, and discuss some of the thornier issues we had to overcome when replacing tried and true kernel file systems with entirely new code running in userspace.

Speakers
avatar for Sage Weil

Sage Weil

Ceph Project Leader, Red Hat
Sage Weil helped developed Ceph as part of his graduate studies at the University of California, Santa Cruz. After graduating he continued to develop the system and build an open source community around Ceph with the support of DreamHost. In 2012, he co-founded Inktank to productize... Read More →


Thursday April 21, 2016 10:30am - 11:20am PDT
State Ballroom E

10:30am PDT

Filesystem Fuzzing with American Fuzzy Lop (AFL) - Vegard Nossum & Quentin Casasnovas, Oracle
American Fuzzy Lop (AFL) is an open source fuzzing framework that relies on code instrumentation of a target program to find inputs that will cause the program to take new (and hopefully interesting) code paths. We have applied AFL to a range of Linux filesystem drivers and have quickly found a multitude of new bugs not found with regular ("dumb") fuzzers. Additionally, the testcases found by AFL can be used as a regression test suite that will help increase the confidence that any future change to the filesystem driver does not accidentally introduce bugs.

Through our presentation we share our techniques and tools directly with filesystem developers and make filesystem fuzzing with AFL more accessible to a wider audience. In this way, bugs can be found (and fixed) faster by the people who already know the filesystem code intimately.

Speakers
QC

Quentin Casasnovas

Senior SW engineer, Oracle
Quentin graduated from EPITA, a french engineering school, in 2010.He's started working as an embedded engineer for MathEmbedded, a Britishstart-up, then as a freelancer for Intel.  He's now working for Oracle inthe Ksplice tight knit team where he prepares Ksplice updates and try... Read More →
VN

Vegard Nossum

Senior Developer, Oracle
Vegard graduated from the University of Oslo in 2012 with a thesis on the topic of SAT solving and has been working for Oracle on Ksplice kernel updates and infrastructure ever since. His experience with the Linux kernel includes writing kmemcheck, a tool for detecting use of uninitialized... Read More →


Thursday April 21, 2016 10:30am - 11:20am PDT
State Ballroom B

10:30am PDT

Persistent Memory and the Handling of Media Errors: How to have your Poison and (not) Consume it too - Vishal Verma, Intel
2016 will be the year of Persistent Memory, and these high speed/capacity, non-volatile memory devices are just around the corner, we’re told. Similar to both current storage and memory devices, NVDIMMs will also be susceptible to developing ‘bad’ locations. Unlike DRAM, these errors will persist across reboots, and have to be handled like we do traditional storage.

With DRAM, if a poison location is read, the CPU takes a machine check exception, and typically reboots, and the poison is cleared. With NVDIMMs, the poison won’t be cleared, and it is likely that the application accessing that location will do so again, causing a reboot loop.

This talk details how we solve this problem and its different manifestations in the Linux kernel, and how we enable userspace applications to become aware of poison blocks and deal with them if they so choose.

Speakers
VV

Vishal Verma

Vishal is a software engineer working for Intel's Open Source Technology Center (OTC). He works on Linux enabling for upcoming technologies. He has been working on enabling for Persistent memory (NVDIMMs) of late, and has spoken at Vault (2015) and Linux Plumbers' Conference (2015... Read More →


Thursday April 21, 2016 10:30am - 11:20am PDT
State Ballroom A

10:30am PDT

Ganesha + Gluster scale out NFSv4 - Kaleb Keithley, Red Hat Gluster Storage
Many enterprises still heavily depend on NFS to access their data from different operating systems and applications. NFS-Ganesha is a user-space NFS server that supports NFSv3, NFSv4, NFSv4.1, as well as pNFS.

GlusterFS has now added NFS-Ganesha server to its NFS stack to eventually replace the 'native' Gluster-NFS server which supports only NFSv3. The integration with NFS-Ganesha now means additional protocol support for NFSv4+, and better security and authentication mechanisms for enterprise use. The recent release of GlusterFS-3.7 introduced clustered, multi-head, active/active NFS support using Pacemaker and Corosync for High Availability. There is also tighter integration with Gluster CLI to manage NFS-Ganesha exports. This presentation is aimed at providing a basic overview of the entire solution and step-by-step configuration.

Speakers
KS

Kaleb S. Keithley

Sr. Principal Software Engineer, Red Hat
Kaleb currently works on Red Hat Gluster Storage and Community GlusterFS. Prior to the Gluster acquisition Kaleb worked on HekaFS, a multi-tenant storage proof-of-concept based on GlusterFS. Prior to coming to Red Hat he worked for EMC's Centera and Atmos divisions on a portable... Read More →


Thursday April 21, 2016 10:30am - 11:20am PDT
State Ballroom CD

10:30am PDT

The Challenges and Opportunities of Open Source Storage - Lessons Learned from Real Life Ceph Engagement with 30+ China Customers - Jian Zhang, Intel
The increasing popularity of cloud storage is attracting more and more companies move their data from their traditional datacenters to cloud and build their storage solutions with Ceph. It is becoming more and more popular building scale out storage solutions in China storage market recently, ranging from IPDC, OEM/ODM to research insistutes. In this session we will presents the challenges and opportunities Ceph based storage solutions faced in those customers. We will first introduce the advantages Ceph demonstrated in their real life production cluster; then shift to the common issues they encountered in different scnearios, including stability concerns like OSD flapping and slow requests, feature missing - high peroformance caching, performance problem. At the last, we will focus on the areas we can improve or optimize for more opportunities like smart failure report and detection.

Speakers
avatar for Jian Zhang

Jian Zhang

Intel
Jian is a senior software engineer of the Cloud Storage Engineer group from Intel Asia Pacific Research & Development Ltd. He is responsible for developmign software storage architecture and solutions based on Intel platforms. He has 7 years of software development, performance analysis... Read More →


Thursday April 21, 2016 10:30am - 11:20am PDT
State Ballroom F

11:30am PDT

Huge Indexes: Algorithms to Track Objects in Cache Tiers - Dan Lambright, Red Hat
A storage cache must implement an index to quickly locate the objects it holds. The index’s design is impacted by the storage medium. For example, a memory cache’s requirements differ from a cache built using storage tiers. In the former, an in-memory hash table or balanced tree may suffice. But in the later, those structures may stumble. The metadata required to track such a large number of objects won’t fit in memory. In such cases, the challenge is to find an index that scales. A further consideration is wether to track elements in LRU order, in which case a sorting mechanism is called for. This talk contrasts 3 cache tiering implementations in Linux that have tackled this problem from GlusterFS, Ceph, and DMcache. Solutions vary from bloom filters to sqlite databases. We will explore their relative pros and cons along the dimensions of performance, overhead, complexity, and more.

Speakers
avatar for Dan Lambright

Dan Lambright

Software Engineer, Red Hat
Dan Lambright is a principal software engineer at Red Hat, where he works on distributed storage systems. Prior to Red Hat is worked at EMC, DELL, and several storage startups. He also teaches as an adjunct professor at the University of Massachusetts, Lowell.


Thursday April 21, 2016 11:30am - 12:20pm PDT
State Ballroom A

11:30am PDT

Scaling the Btrfs Free Space Cache - Omar Sandoval, Facebook
Btrfs is being deployed in more and more production systems, and as a result it has matured in stability and performance. In particular, Facebook's production systems push Btrfs especially hard, which has uncovered some unanticipated scalability bottlenecks. In this presentation, Omar Sandoval will discuss the shortcomings of the original Btrfs free space cache and explain the design of its replacement, the free space tree, which is Btrfs' latest solution to the classic filesystem problem of tracking unallocated space.

Speakers
OS

Omar Sandoval

Omar Sandoval is employed by Facebook where he works on the Linux kernel team. He has made numerous contributions to Btrfs, the most notable being the free space tree.


Thursday April 21, 2016 11:30am - 12:20pm PDT
State Ballroom B

11:30am PDT

GlusterD 2.0 - Managing Distributed File System Using a Centralized Store - Atin Mukherjee, Red Hat
In a typical distributed system managing n number of nodes in an effective way is always a challenge. Distributed systems always demands to meet CAP theorem which is really hard to meet with a good performance numbers. All the nodes participating in the cluster should have the consistent data which is one of the criteria of the CAP theorem. This could be achieved by keeping the configuration details across all the nodes in the cluster,
however this algorithm doesn’t scale (considering n X n number of exchange of information) and can end up having split brain situations. This could be avoided having distributed consistent store across m number of nodes (considered) as leaders where m < n. There are some existing technologies like etcd/consul which provides good abstraction and APIs for centralized store which can be consumed here.

Speakers
avatar for Atin Mukherjee

Atin Mukherjee

Senior S/W Engineer, Red Hat
Atin has been working with Red Hat India Pvt Ltd for its storage business unit as a senior s/w engineer. His key responsibility is to maintain high quality code for GlusterD, the management daemon of Gluster. Prior to storage, Atin has worked in different domains like telecom, BFS... Read More →



Thursday April 21, 2016 11:30am - 12:20pm PDT
State Ballroom CD

11:30am PDT

openATTIC as a Ceph Management System - Lenz Grimmer, openATTIC & Lars Marowsky-Brée, SUSE
Distributed storage solutions such as Ceph pose unique challenges in Enterprise environments. Currently, there is no comprehensive Open Source management system available that integrates Ceph with traditional storage approaches. To remedy this, it-novum and SUSE have partnered to collaborate to bring Ceph support to the openATTIC project.
openATTIC is a comprehensive storage management solution with a clean and intuitive web-based user interface. In development since 5 years, it is based on a modern and extensible architecture built with proven web technologies (AngularJS, Bootstrap) and frameworks (Django). It follows a completely Open Source development process. It supports a wide range of storage technologies, both file- and block-based, e.g. NFS, CIFS, iSCSI and FibreChannel (LIO). All functionality is available via the web, CLI, and RESTful APIs through a common backend.

Speakers
LG

Lenz Grimmer

openATTIC
Lenz Grimmer is product manager for the openATTIC open source storage management system at it-novum. Doing Linux and OSS since 1995, he worked at SUSE, MySQL AB/Sun Microsystems/Oracle in engineering, community relations and product management roles. Frequent speaker about Linux/OSS... Read More →
LM

Lars Marowsky-Brée

SUSE
Lars Marowsky-Brée is a SUSE Distinguished Engineer and the architect for SUSE's Ceph-based SDS products. In his 16 years at SUSE, he has been most well-known for his work on HA, storage, clustering, and distributed systems. He is a frequent speaker and author.


Thursday April 21, 2016 11:30am - 12:20pm PDT
State Ballroom F

12:20pm PDT

Lunch Break (Attendees on Own)
Thursday April 21, 2016 12:20pm - 2:00pm PDT
TBA

2:00pm PDT

Burst Buffer High Performance Storage with 2 Tiers, OrangeFS and Object Stores in the Cloud and on Premise - Mike Marshall, Clemson University
Over the past decade object stores have become more and more prevalent. They provide infinitely addressable storage but cannot be accessed by millions of lines of currently written IO code. The 2-Tier project is integrating the concepts of distributed OrangeFS burst buffer storage optimized for flash with the ability to tier data seamlessly to an object store. This talk will give an overview of the project, its status and provide attendees a peek into recent technology demonstrations on premise and in the AWS cloud with CloudyCluster.

Speakers
avatar for Mike Marshall

Mike Marshall

Omnibond
Mike Marshall was introduced to Unix while a Computer Science student at Clemson University in South Carolina around 1982. Clemson students used an IBM Mainframe at that time, but Mike had a part-time job in the Forestry department where they were using a Radio Shack model 16B running... Read More →


Thursday April 21, 2016 2:00pm - 2:50pm PDT
State Ballroom E

2:00pm PDT

Toward A Unified Block IO Controller - Shaohua Li, Facebook
Linux block cgroup currently has two controllers. blk-throttle is bandwidth/IOPS based. CFQ cgroup is proportion based. Having two separate IO controllers is painful. And CFQ is known not performing optimal for high end storage. Also block multiqueue doesn't support CFQ, leaving block-mq only supports blk-throttle. It's time to have a unified IO controller supporting both bandwidth/IOPS and proportion based control and performing well with high end storage. In this presentation, Shaohua Li will discuss current status of IO controller, challenges of a unified IO controller, proposed solution and development status.

Speakers
SL

Shaohua Li

Software Engineer, Facebook
Shaohua Li is a software engineer in Facebook, working on storage and performance tuning. He has more than 10 years experience in Linux kernel. His interest is storage, memory management and performance tuning.


Thursday April 21, 2016 2:00pm - 2:50pm PDT
State Ballroom A

2:00pm PDT

Understanding Client Side Shared Cache with Pblcacle - Luis Pabon, Red Hat
Cloud data centers deploy a large number of compute nodes to meet increasing application needs. The computes nodes introduce storage cluster contention, resulting in increased application response time. At last year's Linux Vault, we introduced Pblcache[1], a persistent, write-through, user space application block cache based on NetApp’s Mercury paper. It was demonstrated that a it not only reduced application response time, but also dramatically increases the IOPS capacity of a backend storage system. In this presentation we discuss the benefits and disadvantages of a shared client side block cache architecture based on Pblcache. We also discuss the possible methods of integrating Pblcache with Ceph libRBD and QEMU.

[1] https://github.com/pblcache/pblcache

Speakers
avatar for Luis Pabon

Luis Pabon

Principal Software Engineer, CoreOS
Luis Pabón is a software engineer at CoreOS. Prior to joining CoreOS in November of 2016, he worked at Red Hat Storage, NetApp Advanced Technology Group, and at EMC on various storage products. He also previously presented at Vault in 2016 and 2015.


Thursday April 21, 2016 2:00pm - 2:50pm PDT
State Ballroom B

2:00pm PDT

Deploying pNFS over Distributed File Storage - Jiffin Tony Thottan, Red Hat
pNFS is the clustered solution provided by standard NFS protocol which allow NFS clients to access directly and parallelly the Storage device. This is achieved by the separation of metadata from the datapath. Therefore pNFS solution provides better bandwidth utilization, loading balancing across storage devices and significant performance improvement for I/O's. Now consider applying the pNFS over a Distributed Storage instead of native NFS. Here instead of talking to single server, pNFS client can interact directly with all the Storage Servers , i.e I/O distribution become much more effective and also avoids performance bottleneck with in a single server. In this session, Jiffin Tony Thottan is planning to give an overview about pNFS, deploying simple pNFS solution on a Distribute File Storage by taking example of glusterfs + NFS-ganesha and challenges involved in while doing so.

Speakers
avatar for Jiffin Tony Thottan

Jiffin Tony Thottan

Senior Software Engineer, Red Hat
Jiffin Tony Thottan is part of the Red Hat Storage Team working as a Senior Software Engineer. Initially part of the NFS team and contributed to GlusterFS , NFS-Ganesha projects. He was an active contributor for both projects worked on implementing different features like nfsv4acls... Read More →


Thursday April 21, 2016 2:00pm - 2:50pm PDT
State Ballroom CD

2:00pm PDT

Open Source Software-defined Storage Management with CoprHD - John Mark Walker, EMC
CoprHD is an open source software-defined storage controller and API platform. It enables policy-based management and cloud automation of storage resources for block, object and file storage providers, for both traditional storage arrays as well as scale-out storage software. In this talk, we will walk through scenarios where CoprHD can be useful, giving storage admins the ability to add new life to their legacy storage systems, as well as bridging the divide between old and new:

- You have client-server applications running on large servers that connect to centralized storage services. Your developers have created services running on OpenStack infrastructure, and you want to utilize your centralized storage with the same control plane
- You have several tiers of storage, with various parts being scale-out filesystems or object storage and the rest of it SAN arrays over iSCSI. You want to be able to set policies such that your applications utilize the correct tier of storage depending on the use case.

For each use case, we'll demonstrate a solution utilizing CoprHD, and then we'll walk through some best practices:

- When do you need an SDS controller, and when do you not
- I'm using Cinder, Manila, and/or Swift with OpenStack. When should I consider adding CoprHD to the mix?
- I have an orchestration framework and don't need yet another dashboard - how do I use a REST API?
- How do I add my storage system if it's not yet supported?

Speakers

Thursday April 21, 2016 2:00pm - 2:50pm PDT
State Ballroom F

3:00pm PDT

BoF: NVMe over Fabrics - Christoph Hellwig
This BOF session will provide an introduction to the NVMe over Fabrics protocol and the Linux reference implementation from some of the authors of the specification and the Linux implementation.

Speakers
CH

Christoph Hellwig

Christoph Hellwig has been working on Linux Storage and File system projects for 15 years. He works all the way up and down the Storage and File system stack, and runs a business focused on Linux Storage architecture and training.


Thursday April 21, 2016 3:00pm - 3:50pm PDT
State Ballroom E

3:00pm PDT

Building a File Sync and Share Mesh Network with Federation - Frank Karlitschek, ownCloud
The future of global file sync and share is federation of independent cloud servers instead of centralized and monolithic services. It should be possible that dentralized and distributed cloud server work together and behave from a user point of view as one big virtual service. Similar to Email where different organizations can host their own mail server but they exchange mails as if it one big centralized service.

This talk will cover how a global federated mesh network can be build on top of Linux, cluster file systems, virtualization cloud services, http an open APIs, giving an outlook on the upcoming strategical and architectural decisions of the open source community to move the internet to the next level in the years to come.



Speakers
FK

Frank Karlitschek

ownCloud
Frank Karlitschek is a long time open source contributor and former board member of the KDE e.V. In 2010 he started the ownCloud project and is leading the community project since then. In 2011 he co-founded ownCloud Inc. to offer commercial services around ownCloud. Frank has spoken... Read More →


Thursday April 21, 2016 3:00pm - 3:50pm PDT
State Ballroom B

3:00pm PDT

Efficient Compression Hardware Acceleration with Intel® Quickassist Batch and Pack Feature
Bringing down storage cost is the key to any data storage solution. With big Data, IoT growth and social medias, pressure is put on data volume as information being stored grows exponentially. To help this evolving market, Intel is introducing the Quickassist technology.
The Intel® Quickassist Technology provides security and compression acceleration capabilities to improve performance and efficiency on Intel Architecture platforms. Server, networking, big data, and storage applications use Intel QuickAssist Technology to offload servers from handling compute-intensive operations, such as:
 Symmetric cryptography functions including cipher operations and authentication operations
 Public key functions including RSA, Diffie-Hellman, and elliptic curve cryptography
 Compression and decompression functions including DEFLATE
Ultimately, Intel® Quickassist Technology enables users to meet the demands of ever-increasing amounts of data, especially data with the need for encryption and compression. This helps users to ensure applications are fast, secure and available.
Lately Intel released a new feature for the compression service called Batch and Pack. This new feature enables the application that uses the Quickassist API to improve the system performance by reducing not only the compute cycles needed, but also reducing IoPs, DRAM bandwidth and cache utilization. Furthermore it enables fast and efficient reads performance by providing metadata to the application.

Thursday April 21, 2016 3:00pm - 3:50pm PDT
State Ballroom A

3:00pm PDT

Accelerating Ceph performance Profiling and Tuning with CeTune - Jian Zhang, Intel
Ceph is an open-source, massively scalable, software-defined storage system, the demand to benchmarking, profiling and tuning Ceph in an easier and efficient way is increasing. However in production environment, customers still face numerous challenges to drive best performance, including how to troubleshooting the performance issues, identify the best tunings and handle the unexpected performance regression between frequent releases.
In this session, Chendi will introduce the components and capabilities of CeTune; present how to use CeTune to identify software stack bottlenecks and analyze unexpected software behavior; and at the end of the presentation, Chendi will present two case studies on how we improve Ceph cluster performance with CeTune.

Speakers
avatar for Jian Zhang

Jian Zhang

Intel
Jian is a senior software engineer of the Cloud Storage Engineer group from Intel Asia Pacific Research & Development Ltd. He is responsible for developmign software storage architecture and solutions based on Intel platforms. He has 7 years of software development, performance analysis... Read More →


Thursday April 21, 2016 3:00pm - 3:50pm PDT
State Ballroom F

3:00pm PDT

Storage as a Service with Gluster - Vijay Bellur, Red Hat
Gluster is a scale-out storage system that works on commodity hardware. Storage as a Service is an evolving paradigm that enables organizations to provision, consume and de-provision storage on demand. In this session, Vijay will provide an introduction to the architecture of Gluster, Storage as a Service paradigm and recent changes in Gluster that enable Storage as a Service better. Vijay will also detail integrations with projects like Docker, Kubernetes, OpenShift etc. that allows Gluster storage to be consumed as a service.

Speakers
VB

Vijay Bellur

Senior Principal Software Engineer, Red Hat
Vijay Bellur is a co-maintainer for the upstream GlusterFS project and was an architect at Gluster before its acquisition by Red Hat in 2011. He has been involved with building enterprise storage and scalable, distributed systems for the past decade. Vijay works out of the Red Hat... Read More →


Thursday April 21, 2016 3:00pm - 3:50pm PDT
State Ballroom CD

4:00pm PDT

BoF: NVMe over Fabrics - Christoph Hellwig (cont'd)
This BOF session will provide an introduction to the NVMe over Fabrics protocol and the Linux reference implementation from some of the authors of the specification and the Linux implementation.

Speakers
CH

Christoph Hellwig

Christoph Hellwig has been working on Linux Storage and File system projects for 15 years. He works all the way up and down the Storage and File system stack, and runs a business focused on Linux Storage architecture and training.


Thursday April 21, 2016 4:00pm - 4:50pm PDT
State Ballroom E

4:00pm PDT

Lessons Learned Containerizing GlusterFS and Ceph with Docker and Kubernetes - Huamin Chen, Red Hat
With the influx of cutting edge technologies around Linux Containers, contemporary Software Defined Storage (SDS) platforms face significant opportunities as well as challenges. Containers can overcome some of the differences between Linux distributions which can make SDS platforms easier to deploy. For example, a Ceph installation has to be aware of the differences between Linux distributions and carefully pick various dependent software packages. With the introduction of the ceph-docker project, Ceph daemons are put into Linux Containers and started on the Linux host using docker run, therebysimplifying the deployment and administration overhead. An in-depth look of containerizing SDS storage servers is provided in the talk, using lessons learned with Ceph and GlusterFS as a reference.

Likewise, containers can also make SDS platforms easy to consume for the clients. Putting client software into Linux Containers can eliminate package dependency issues which is especially useful when containerized applications are scheduled on random hosts using container orchestrators. However, there are several non-trivial technical issues associated with filesystem client containerization such as setting proper mount propagation modes so that mount points created by containers are visible by the host and other containers and lifecycle management of long running daemon process (as often seen in FUSE based filesystems). This talk examines these challenges in detail.

Lastly, this talk will explore what SDS platforms can offer to provide a better fit for Linux Containers. For example, Docker relies on SELinux labels to permit containers access filesystems. While SELinux is supported by filesystems such as GlusterFS and NFSv4.2, there are still many filesystems that don't support SELinux yet. This point will be illustrated in the presentation using Kubernetes. 

Speakers
avatar for Huamin Chen

Huamin Chen

Engineer, Red Hat
Huamin Chen is a principal software engineer within Red Hat's Emerging Technologies Group. He contributes to Kubernetes and Ceph via his github account rootfs.


Thursday April 21, 2016 4:00pm - 4:50pm PDT
State Ballroom A

4:00pm PDT

Panel Discussion: Large-scale Enterprise Automation of Open Source File Systems at Clemson University
Clemson is migrating from large proprietary file system technologies to open source software. We are mid­deployment of over 10 petabytes of SAS­attached JBOD storage for multiple use cases including traditional file system backup solutions, database backups, user/group file services, and HPC file services.

From the start, deployment and operation of these environments presented challenges; from hardware decisions to performance profiling/benchmarking, from storage pool creation to day­to­day user and dataset management. We’ll discuss these real world pain points and how we solved them (or plan to solve them). Born of our frustrations, what started out as a few basic administration scripts has become a multifunctional automated administration utility whose purpose is to make large­scale deployment and operation of open technologies realistic and feasible.

Speakers
MC

Mike Cannon

Clemson University
Mike Cannon has been with Clemson University since December 2006. Prior to Clemson, Mike was the Storage Manager for NASA. Mike is the director of CIS.
KS

Kevin Stone

Technical Lead & Design Architect, Clemson University
Kevin Stone has held a variety of roles combining systems, programming, andnetworking. Prior to coming to Clemson in 2008 he built backend infrastructure for a USregional ISP. He is currently the technical lead and design architect for Clemson’s coreinfrastructure team.


Thursday April 21, 2016 4:00pm - 4:50pm PDT
State Ballroom B

4:00pm PDT

Big Data Analytics on Object Storage - Hadoop Over Ceph Object Storage with SSD Cache - Jian Zhang, Intel
The Massachusetts Open Cloud (MOC) initiative is a consortium of private and public institutions across Massachusetts dedicated to the creation of publicly available cloud computing that will drive Big Data innovation. In this section, we look at running Hadoop over Ceph Object Storage on a large MOC type cloud environment and several common problems are identified. In the Storage deployment part, we are using a new mid-tier cache architecture with Intel NVMe SSDs as the warm-tier to speed up the performance. We will show the HW configurations and performance results of the reference architecture.

Speakers
avatar for Jian Zhang

Jian Zhang

Intel
Jian is a senior software engineer of the Cloud Storage Engineer group from Intel Asia Pacific Research & Development Ltd. He is responsible for developmign software storage architecture and solutions based on Intel platforms. He has 7 years of software development, performance analysis... Read More →


Thursday April 21, 2016 4:00pm - 4:50pm PDT
State Ballroom F

4:00pm PDT

Bringing GFS2 Clusters to IBM z Systems - Steven Whitehouse, Red Hat
IBM mainframes can be a mystery to those more familiar with x86 Linux clusters, and high availability Linux clustering can be a mystery to those more familiar with mainframe systems. In this presentation, Steven Whitehouse gives an overview of how the pieces fit together to enable an s390x HA Linux cluster for a shared GFS2 filesystem.

Speakers
SW

Steven Whitehouse

Senior Manager, RHEL Filesystems, Red Hat
Steven Whitehouse currently manages the RHEL Filesystems team at Red Hat. His introduction to Linux kernel development came in 1993 when he wrote a small patch for AX.25, he is also the previous maintainer of Linux DECnet and the GFS2 Filesystem. Steven has spoken at a number of conferences... Read More →


Thursday April 21, 2016 4:00pm - 4:50pm PDT
State Ballroom CD
 
Filter sessions
Apply filters to sessions.