GreyBeards talk HPC storage with Molly Rector, CMO & EVP, DDN

oIn our 27th episode we talk with Molly Rector (@MollyRector), CMO & EVP of Product Management/Worldwide Marketing for DDN.  Howard and I have known Molly since her days at Spectra Logic. Molly is also on the BoD of SNIA and Active Archive Alliance (AAA), so she’s very active in the storage industry, on multiple dimensions and a very busy lady.

We (or maybe just I) didn’t know that DDN has a 20 year history in storage and in servicing high performance computing (HPC) customers. It turns out that more enterprise IT organizations are starting to take on workloads that look like HPC activity.

In HPC there are 1000s of compute cores that are crunching on PB of data. For Oil&Gas companies, it’s seismic and wellhead analysis; with bio-informatics it’s genomic/proteomic analysis; and with financial services, it’s economic modeling/backtesting trading strategies. For today’s enterprises such as retailers, it’s customer activity analytics; for manufacturers, it’s machine sensor/log analysis;  and for banks/financial institutions, it’s credit/financial viability assessments. Enterprise IT might not have 1000s of cores at their disposal just yet, but it’s not far off. Molly thinks one way to help enterprise IT is to provide a SuperComputer as a service (ScaaS?) offering, where top 10 supercomputers can be rented out by the hour, sort of like a supercomputing compute/data cloud.

We start early talking about DDN WOS: object store, which can handle archive to cloud or backend tape libraries. Later we discuss DDN ExaScaler and GridScaler, which are NAS appliances for Lustre and massively scale out, parallel file system storage, respectively.

Another key supercomputing storage requirement is  predictable performance. Aside from sophisticated QoS offerings across their products, DDN also offers the IME solution, a bump in the cable, caching system, that can optimize large and small file IO activity for backend DDN NAS scalers. DDN IME is stateless and can be removed from the data path while still allowing IT access  to all their data.

While we were discussing DDN storage interfaces, Molly mentioned they were working on an Omni Path Fabric.  Intel’s new Omni Path Fabric is intended to replace rack scale PCIe networks for HPC.

This months edition is not too technical and runs just over 45 minutes. We only got to SNIA and AAA at the tail end and just for a minute or two. Molly’s always fun to talk to, with enough technical smarts to keep Howard and I at bay, at least for awhile :). Listen to the podcast to learn more.

HeadshotMolly Rector, CMO and EVP Product Management & Worldwide Marketing,  DDN

With 15 years of experience working in the HPC, Media and Entertainment, and Enterprise IT industries running global marketing programs, Molly Rector serves as DDN’s Chief Marketing Officer (CMO) responsible for product management and worldwide marketing. Rector’s role includes providing customer and market input into the company’s product roadmap, raising the Corporate brand visibility outside traditional markets, expanding the partner ecosystem and driving the end-to-end customer experience from definition to delivery.

Rector is a founding member and currently serves as Chairman of the Board for the Active Archive Alliance. She is also the Storage Networking Industry Association’s (SNIA) Vice Chairman of the Board and the Analytics and Big Data committee Vice Chairman. Prior to joining DDN, Rector was responsible for product management and worldwide marketing as CMO at Spectra Logic. During her tenure at Spectra Logic, the company grew revenues consistently by double digits year-over-year, while also maintaining profitability. Rector holds certifications as CommVault Certified System Administrator; Veritas Certified Data Protection Administrator; and Oracle Certified Enterprise DBA: Backup and Recovery. She earned a Bachelor’s of Science degree in biology and chemistry.

Graybeards talk object storage with Russ Kennedy, Sr. VP Prod. Strategy & Cust. Solutions Cleversafe

In our 15th podcast we talk object storage with Russ Kennedy Senior V.P. Product Strategy and Customer Solutions, Cleversafe. Cleversafe is a 10 year old  company sellinge scale out, object storage solutions with a number of interesting characteristics. Howard and I had the chance to talk with Cleversafe at SFD4 (we suggest you view the video if you want to learn more), just about a year ago. But we have both known Russ for a number of years and Ray has done work for Cleversafe in the past.

We haven’t talked about objects storage in the past so this podcast goes over some foundational information about it. Object storage is starting to become more mainstream and general purpose as more interfaces become available and as the amount of data being stored grows out of sight.  Object storage has a flat name space, rich metadata, and relatively rudimentary, native storage access methods. But on top of this one can build sophisticated PB storage environments that can handle high amounts of data throughput, spread this data across multiple sites, and provide highly fault tolerant/highly available storage environments. Object storage will never replace OLTP block oriented storage but for environments with massive unstructured data repositories, it’s probably the best solution out there today.

Cleversafe has some unique characteristics namely their ability to split object storage elements over multiple disparate locations and use erase coding to supply data availability in the event of storage, server, or site failures. Some other object storage systems use 2- or 3-way replication to protect against data loss. But Russ makes the apt comment that when you are talking about PBs of data, replication can cause your storage costs to go up quite fast. Someone mentioned that there are Cleversafe customers that have 15-9’s data availability using erasure coding with only 150% of the original capacity. This is significantly more reliability than what could be obtained by dual or even triple redundancy alone. However, I always find that the weak link in data reliability  discussions such as these is always the software that implements the solution, not the data integrity architecture of the system.

Currently, Cleversafe has many multi-PB installations some of which span continents and others of which are looking to breach an EB (10**21 bytes of storage) of object data. We asked what these customers look like and Russ said lots of Accessors®  (stateless on- and off-ramps for object data) and a lots more Slicestors® (servers holding the statefull storage).

One of the significant barriers to higher object storage adoption has always been their unique, native object storage access protocols. But these days, it turns out that Amazon’s S3 protocol has become the defacto standard for object storage and this is helping accelerate object storage adoption.  In the podcast we discuss how historically, defacto standards have been a successful approach used to introduce new storage access protocols. Cleversafe offers its native RESTful access protocol, S3 and a smattering of others but you can also use other partner solutions if you need standard file access to the object store.

Cleversafe also offers HDFS as another access protocol. With Cleversafe HDFS, Hadoop can access all of it’s data from the Cleversafe object repository. In addition, you can run Hadoop MapReduce on its Slicestor nodes, if you want. Apparently, moving PB of data to analyze it and then deleting it is an expensive and very time consuming proposition, and of course native HDFS uses triple redundancy…

In the podcast, we get into object storage, some of Cleversafe’s advanced functionality, access protocol evolution and more. Listen to the podcast to learn more…

This months episode comes in at a little more than 47 minutes.

Russ Kennedy

Russ Kennedy, Sr VP Product Strategy & Customer Solutions

Russ Kennedy brings more than 20 years experience in the storage industry to Cleversafe as the company’s Senior Vice President of Product Strategy and Customer Solutions. Having rolled up his sleeves working on automated tape libraries, Russ is still attracted to the technological challenges that have shaped the industry and particularly to the innovative approach that Cleversafe delivers to storage.

Russ joined the company initially in 2007 and left in 2009, staying on in an advisory role. In 2011, Russ rejoined the company seeing a clear opportunity to solve the storage needs surrounding the exponential growth of big data and the unique impact that Cleversafe delivers over traditional systems.

Previously, Russ served as the Vice President of Competitive Intelligence at CA Technologies, and was the Senior Director of Engineering and Product Management at Thin Identity Corporation. Russ has an MBA from the University of Colorado at Denver and a bachelor’s degree in Computer Science from Colorado State University.

Greybeards video interview with Marc Farley, Marketing Quaddra Software

This is our first GBoS video interview, sorry about the quality of the video but the audio came through just fine.

Quaddra Software provides a file analytics solution which supplies information and analytical data about massive file stores.This allows customers to get a better handle on just what’s in their file stores.

Please see https://quaddra-sw.com or https://greybeardsonstorage.com/ for more information.

GreyBeards talk enterprise Hadoop with Jack Norris, CMO MapR Technologies

Welcome to our third episode. In this podcast we take a step up from the technical depths to talk with Jack Norris, Chief Marketing Officer of MapR Technologies about their enterprise class Hadoop distribution that customers far and wide are finding a viable solution to today’s Hadoop problems.

This months podcast runs a little over 37 minutes (still trying to get this down – but obviously not succeeding). It seems just when I think we’re close to the end, Howard or I jump in with yet another question that takes us down a different tack.

Back to the episode, the advantages of big data are obvious to many. Hadoop and its ecosystem allow IT to tackle jobs previously impossible to perform, clustering together 1000s of servers, and co-locating compute with data. All this provides an immense platform to perform data analytics on a scale never before possible.

Nonetheless, Hadoop has some weaknesses based on its heritage which make it less than it could be and that’s where MapR Technologies steps in. MapR has taken the Hadoop distribution and yanked out HDSF, replacing it with a completely re-architected/reimplemented enterprise class, fault tolerant storage service. But MapR is more than just better and faster storage, so listen to our talk with Jack to find out more about it. I am really having trouble getting my head around snapshotting a PB of Hadoop data at a whack …

 


Jack Norris, CMO, MapR

Jack leads worldwide marketing efforts for MapR. Jack has over 20 years of enterprise software marketing and product management experience in defining and delivering analytics, storage, and information delivery products. Jack has also held senior executive roles with EMC, Rainfinity, Brio Technology, SQRIBE, and Bain and Company.