Filesystems – Page 10

My first TechFieldDay

By cfheoh | October 16, 2018 - 1:05 am |October 16, 2018 Cisco, Cloud, DellEMC, Drivescale, Filesystems, High Performance Computing, Linux, NetApp, NFS, Oracle, Performance Caching, Scale-out architecture, Storage Field Day, Tech Field Day

3 Comments

[Preamble: I have been invited by GestaltIT as a delegate to their TechFieldDay from Oct 17-19, 2018 in the Silicon Valley USA. My expenses, travel and accommodation are paid by GestaltIT, the organizer and I was not obligated to blog or promote their technologies presented at this event. The content of this blog is of my own opinions and views]

I have attended a bunch of Storage Field Days over the years but I have never attended a Tech Field Day. This coming week, I will be attending their 17th edition, TechFieldDay 17, but my first. I have always enjoyed Storage Field Days. Everytime I joined as a delegate, there were new things to discover but almost always, serendipity happened.

Continue reading →

The Malaysian Openstack storage conundrum

By cfheoh | October 9, 2018 - 8:24 pm |October 9, 2018 CIFS, Cloud, Data Availability, Data Management, Disaster Recovery, Excelero, Fibre Channel, Filesystems, High Performance Computing, HP, Infiniband, Linux, Mellanox, NAS, NFS, NVMe, Openstack, RDMA, Server SAN, SMB, Snapshots, Software Defined Storage, Storage Optimization, Virtualization

Huawei Dorado – All about Speed

By cfheoh | April 1, 2018 - 11:18 am |April 1, 2018 Analytics, Appliance, Big Data, Data, Data Fabric, Data Management, Deduplication, Disks, Filesystems, Flash, High Performance Computing, Hitachi Vantara, Huawei, Hyperconvergence, Machine Learning, NetApp, Performance Benchmark, Performance Caching, Reliability, Scale-out architecture, Snapshots, Solid State Devices, Storage Field Day, Storage Optimization, Virtualization

3 Comments

[Preamble: I was a delegate of Storage Field Day 15 from Mar 7-9, 2018. My expenses, travel and accommodation were paid for by GestaltIT, the organizer and I was not obligated to blog or promote the technologies presented at this event. The content of this blog is of my own opinions and views]

Since Storage Field Day 15 3 weeks ago, the thoughts of the session with Huawei lingered. And one word came to describe Huawei Dorado V3, their flagship All-Flash storage platform is SPEED.

My conversation with Huawei actually started the night before our planned session at their Santa Clara facility the next day. We had a evening get-together at Bourbon Levi’s Stadium. I was with my buddy, Ammar Zolkipli, who coincidentally was in the Silicon Valley for work. Ammar is from Hitachi Vantara Japan, and has been a good friend of mine for over 17 years now.

Shortly, the Huawei team arrived to join the camaraderie. And we introduced ourselves to Chun Liu, not knowing that he is the Chief Architect at Huawei. A big part of that evening was our conversation with him. Ammar and I have immersed in the Oil & Gas EP (Exploration & Production) data management and petrotechnical applications when he was in Schlumberger and after that a reseller of NetApp. I was a Consulting Engineer with NetApp back then. So, the 2 of us started blabbering (yeah, that would be us when we get together to talk technology).

I observed that Chun was very interested to find learn about real world application use cases that would push storage performance to its limits. And I guessed that the best type of I/O characteristics would be small block, random I/O and billions of them, with near-real time latency. After that evening I did some research and could only think of a few, such as deep analytics or some applications with needs for Monte Carlo simulations. Oh, well, maybe I would share that with Chun the following day.

The moment the session started, it was already about the speed prowess of Huawei Storage. It was like the greyhounds unleashed going after the rabbit. In the lineup, the Dorado series stood out.

Continue reading →

Own the Data Pipeline

By cfheoh | March 27, 2018 - 9:50 am |March 27, 2018 Analytics, API, Backup, Big Data, Cloud, Data, Data Archiving, Data Availability, Data Fabric, Data Management, Disaster Recovery, Filesystems, FreeNAS, Hadoop, Hadoop Clusters, HDS, High Performance Computing, Hitachi Vantara, Hyperconvergence, Machine Learning, NAS, NetApp, NFS, Reliability, ROBO, Software Defined Storage, Software-defined Datacenter, Storage Field Day, Storage Tiering, Virtualization

2 Comments

I am a big proponent of Go-to-Market (GTM) solutions. Technology does not stand alone. It must be in an ecosystem, and in each industry, in each segment of each respective industry, every ecosystem is unique. And when we amalgamate data, the storage infrastructure technologies and the data management into the ecosystem, we reap the benefits in that ecosystem.

Data moves in the ecosystem, from system to system, north to south, east to west and vice versa, random, sequential, ad-hoc. Data acquires different statuses, different roles, different relevances in its lifecycle through the ecosystem. From it, we derive the flow, a workflow of data creating a data pipeline. The Data Pipeline concept has been around since the inception of data.

To illustrate my point, I created one for the Oil & Gas – Exploration & Production (EP) upstream some years ago.

Continue reading →

The leapfrog game in Asia with HPC

By cfheoh | March 22, 2018 - 12:52 pm |March 22, 2018 Analytics, Artificial Intelligence, Cloud, Data Management, Deep Learning, Filesystems, High Performance Computing, Infiniband, Katana Logic, Machine Learning, Mellanox, Performance Benchmark, RDMA, ThinkParq

Cohesity SpanFS – a foundational shift

By cfheoh | March 11, 2018 - 1:26 am |March 11, 2018 Analytics, API, Appliance, Big Data, Business Continuity, Cloud, Cohesity, Data, Data Archiving, Data Availability, Data Management, Deduplication, Disaster Recovery, Filesystems, High Performance Computing, Hyperconvergence, MapReduce, Nutanix, Performance Benchmark, Performance Caching, Reliability, ROBO, Scale-out architecture, Snapshots, Software Defined Storage, Software-defined Datacenter, Storage Field Day, Storage Optimization, Storage Tiering, Uncategorized, Virtualization

3 Comments

Cohesity SpanFS impressed me. Their filesystem was designed from ground up to meet the demands of the voluminous cloud-scale data, and yes, the sheer magnitude of data everywhere needs to be managed.

We all know that primary data is always the more important piece of data landscape but there is a growing need to address the secondary data segment as well.

Like a floating iceberg, the piece that is sticking out is the more important primary data but the larger piece beneath the surface of the water, which is the secondary data, is becoming more valuable. Applications such as file shares, archiving, backup, test and development, and analytics and insights are maturing as the foundational data management frameworks and fast becoming the bedrock of businesses.

The ability of businesses to bounce back after a disaster; the relentless testing of large data sets to develop new competitive advantage for businesses; the affirmations and the insights of analyzing data to reduce risks in decision making; all these are the powerful back engine applicability that thrust businesses forward. Even the ability to search for the right information in a sea of data for regulatory and compliance reasons is part of the organization’s data management application.

Continue reading →

Magic happening

By cfheoh | March 8, 2018 - 1:30 pm |March 8, 2018 Amazon, Apple, Backup, BYOD, Cloud, Data Management, Deduplication, Disks, Dropbox, Filesystems, Object Storage, Reliability, Scale-out architecture, Security, Software Defined Storage, Storage Field Day, Uncategorized

2 Comments

[Preamble: I am a delegate of Storage Field Day 15 from Mar 7-9, 2018. My expenses, travel and accommodation are paid for by GestaltIT, the organizer and I am not obligated to blog or promote the technologies presented at this event. The content of this blog is of my own opinions and views]

The magic is happening.

Dropbox, the magical disruptor, is going IPO.

When Dropbox first entered into the market which eventually termed as BYOD (Bring your Own Device), it was a phenomenon. There was nothing else that matched its simplicity and ease-of-use. A file uploaded into the cloud was instantaneously available on the tablets and smart phones. It was on every storage vendor’s presentation slides, using Dropbox as the perennial name dropping tactic to get end users buy-in.

Dropbox was more than that, and it went on to define a whole new market segment known as Enterprise File Synchronization and Sharing (EFSS), together with everybody else such as Box, Easishare (they are here in South East Asia), and just about everybody else. And the executive team at Dropbox knew they were special too, so much so that they rejected a buyout attempt by Apple in 2011.

Today, Dropbox is beyond BYOD and EFSS. They are a full fledged collaboration platform that includes project management, project workflow, file versioning, secure file transfer, smart file synchronization and Dropbox Paper. And they offer comprehensive plans from Basic, Plus and Professional to Business and Enterprise. Their upcoming IPO, I am sure, will give them far greater capital to expand, and realize their full potential as the foremost content-based collaboration platform in the world.

Dropbox began their exodus from AWS a couple of years ago. They wanted to control their destiny and have moved more than 500PB into their own private data center for their customer data. That was half-an-exabyte, people! And two years later, they saved $75million of operating costs after they exited AWS. Today, they have more than 1 Exabyte of customer data! That is just incredible.

And Dropbox’s storage architecture started with a simple foundational design called “Magic Pocket“. Magic Pocket is a “fixed-length, immutable” block storage layer.

The block size is fixed at 4MB chunks (for parallel performance and service resumption reasons), compressed and deduped (for capacity savings reasons), encrypted (for security reasons) and replicated (for high availability reasons).

Continue reading →

My dilemma of stateful storage marriage

By cfheoh | February 17, 2018 - 1:28 pm |February 17, 2018 API, Cloud, Data Management, Datera, Elastifile, Filesystems, Hedvig, High Performance Computing, Hyperconvergence, NFS, Object Storage, OpenIO, Scality, Storage Field Day, Uncategorized, Virtualization, VMware, WekaIO

2 Comments

I should be a love match maker.

I have been spending much hours in the past few months, thinking of stateful data in stateful storage containers and how they would consummate with distributed applications containers and functions-as-a-service (aka serverless, aka Lambda). It still hasn’t made much sense, and I have not solved this problem yet. Although there were bits and pieces that coming together and the jigsaw looked well enough to give a cackled reply, what I have now is still not good enough for me. I am still searching for answers, better than the ones I have now.

The CAP theorem is in center of my mind. Distributed data, distributed states of data are on my mind. And by the looks of things, the computing world is heading towards containers and serverless computing too. Both distributed applications containers and serverless computing make a lot of sense. If we were to engage a whole new world of fog computing, edge computing, IoT, autonomous systems, AI, and other real-time computing, I would say that the future belongs to decentralization. Cloud Computing and having edge systems and devices getting back to the cloud for data is too slow. The latency of micro- or even nano-seconds is just not good enough. If we rely on the present methods to access the most relevant data, we are too late.

Continue reading →

Of Object Storage, Filesystems and Multi-Cloud

By cfheoh | November 22, 2017 - 12:25 pm |November 22, 2017 Amazon, CIFS, Cloud, Cloudian, Data Availability, Data Fabric, Data Management, Elastifile, Filesystems, High Performance Computing, Hyperconvergence, Nasuni, NFS, Object Storage, OpenIO, Openstack, Performance Benchmark, Performance Caching, Reliability, Scale-out architecture, Scality, Server SAN, SMB, Software Defined Storage, Software-defined Datacenter, Storage Optimization, swiftstack, Uncategorized, Virtualization

1 Comment

Data storage silos everywhere. The early clarion call was to eliminate IT data storage silos by moving to the cloud. Fast forward to the present. Data storage silos are still everywhere, but this time, they are in the clouds. I blogged about this.

Object Storage was all the rage when it first started. AWS, with its S3 (Simple Storage Service) offering, started the cloud storage frenzy. Highly available, globally distributed, simple to access, and fitted superbly into the entire AWS ecosystem. Quickly, a smorgasbord of S3-compatible, S3-like object-based storage emerged. OpenStack Swift, HDS HCP, EMC Atmos, Cleversafe (which became IBM SpectrumScale), Inktank Ceph (which became RedHat Ceph), Bycast (acquired by NetApp to be StorageGrid), Quantum Lattus, Amplidata, and many more. For a period of a few years prior, it looked to me that the popularity of object storage with an S3 compatible front has overtaken distributed file systems.

What’s not to like? Object storage are distributed, they are metadata rich (at a certain structural level), they are immutable (hence secure from a certain point of view), and some even claim self-healing (depending on data protection policies). But one thing that object storage rarely touted dominance was high performance I/O. There were some cases, but they were either fronted by a file system (eg. NFSv4.1 with pNFS extensions), or using some host-based, SAN-client agent (eg. StorNext or Intel Lustre). Object-based storage, in its native form, has not been positioned as high performance I/O storage.

A few weeks ago, I read an article from Storage Soup, Dave Raffo. When I read it, it felt oxymoronic. SwiftStack was just nominated as a visionary in the Gartner Magic Quadrant for Distributed File Systems and Object Storage. But according to Dave’s article, Swiftstack did not want to be “associated” with object storage that much, even though Swiftstack’s technology underpinning was all object storage. Strange.

Continue reading →

The power of E8

By cfheoh | November 21, 2017 - 4:22 pm |November 21, 2017 Analytics, API, Big Data, Data Availability, Data Fabric, Data Management, E8 Storage, Filesystems, High Performance Computing, Hyperconvergence, Infiniband, NVMe, PCIe, Performance Benchmark, Performance Caching, RDMA, Scale-out architecture, Server SAN, Software Defined Storage, Solid State Devices, Storage Optimization

2 Comments

[Preamble: I was a delegate of Storage Field Day 14 from Nov 8-10, 2017. My expenses, travel and accommodation were paid for by GestaltIT, the organizer and I was not obligated to blog or promote the technologies presented at this event. The content of this blog is of my own opinions and views]

E8 Storage technology update at Storage Field Day 14 was impressive. Out of the several next generation NVMe storage technologies I have explored so far, E8 came out as the most complete. It was no surprise that they won the “Best of Show” in the Flash Memory Summits for the “Most Innovative Flash Memory Technology” in 2016 and “Most Innovative Flash Memory Enterprise Business Application” for 2017.

Who is E8 Storage?

They came out of stealth in August 2016 and have been making waves with very impressive stats. When E8 was announced, their numbers were more than 10 million IOPS, with 100µsecs for reads and 40µsecs for writes. And in the SFD14 demo, they reached and past the 10 million IOPS numbers.

The design philosophy of E8 Storage is different than the traditional dual controller scale-up storage architecture design or the multi-node scale-out cluster design. In fact, from a 30,000 feet view, it is quite similar to a “SAN-client” design advocated by Lustre, leveraging a very high throughput, low latency network.

Continue reading →

Category Archives: Filesystems

The Malaysian Openstack storage conundrum

Huawei Dorado – All about Speed

Own the Data Pipeline

The leapfrog game in Asia with HPC

Cohesity SpanFS – a foundational shift

My dilemma of stateful storage marriage

Of Object Storage, Filesystems and Multi-Cloud

The power of E8

Recent Posts

Sponsored Ads

Google Adsense

Recent Comments

Google Adsense

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

The magic is happening.

Share this:

Share this:

Share this:

Share this:

Recent Posts

Sponsored Ads

Google Adsense

Recent Comments

Google Adsense