Analytics – Page 5 – Storage Gaga

Storage in a shiny multi-cloud space

By cfheoh | September 14, 2020 - 6:22 pm |September 14, 2020 Amazon, Amazon Web Services, Analytics, API, Artificial Intelligence, Backup, Big Data, Business Continuity, Cloud, Clumio, Containers, Data, Data Archiving, Data Availability, Data Fabric, Data Management, Data Privacy, Data Protection, Data Security, Deep Learning, Digital Transformation, Disaster Recovery, Docker, Druva, Gartner, Google, Google Anthos, High Performance Computing, Kubernetes, Machine Learning, Microsoft, Microsoft Azure, Object Storage, Oracle, Oracle Cloud, Rackspace, Software Defined Storage, Software-defined Datacenter, Storage Tiering, Wasabi Cloud

Leave a comment

The multi-cloud for infrastructure-as-a-service (IaaS) era is not here (yet). That is what the technology marketers want you to think. The hype, the vapourware, the frenzy. It is what they do. The same goes to technology analysts where they describe vision and futures, and the high level constructs and strategies to get there. The hype of multi-cloud is often thought of running applications and infrastructure services seamlessly in several public clouds such as Amazon AWS, Microsoft® Azure and Google Cloud Platform, and linking it to on-premises data centers and private clouds. Hybrid is the new black.

Multicloud connectivity to public cloud providers and on-premises private cloud

Multi-Cloud, on-premises, public and hybrid clouds

And the aspiration of multi-cloud is the right one, when it is truly ready. Gartner® wrote a high level article titled “Why Organizations Choose a Multicloud Strategy“. To take advantage of each individual cloud’s strengths and resiliency in respective geographies make good business sense, but there are many other considerations that cannot be an afterthought. In this blog, we look at a few of them from a data storage perspective.

In the beginning there was …

For this storage dinosaur, data storage and compute have always coupled as one. In the mainframe DASD days. these 2 were together. Even with the rise of networking architectures and protocols, from IBM SNA, DECnet, Ethernet & TCP/IP, and Token Ring FC-SAN (sorry, this is just a joke), the SANs, the filers to the servers were close together, albeit with a network buffered layer.

A decade ago, when the public clouds started appearing, data storage and compute were mostly inseparable. There was demarcation of public clouds and private clouds. The notion of hybrid clouds meant public clouds and private clouds can intermix with on-premise computing and data storage but in almost all cases, this was confined to a single public cloud provider. Until these public cloud providers realized they were not able to entice the larger enterprises to move their IT out of their on-premises data centers to the cloud convincingly. So, these public cloud providers decided to reverse their strategy and peddled their cloud services back to on-prem. Today, Amazon AWS has Outposts; Microsoft® Azure has Arc; and Google Cloud Platform launched Anthos.

Continue reading →

Intel is still a formidable force

By cfheoh | August 17, 2020 - 9:15 am |August 17, 2020 Algorithm, Analytics, Artificial Intelligence, Big Data, Clusters, Composable Infrastructure, Cray Inc, Deep Learning, Disks, Edge Computing, Filesystems, Flash, High Performance Computing, Industry 4.0, Intel, IoT, Linux, Machine Learning, Performance Benchmark, Performance Caching, Scale-out architecture, SNIA, Solid State Devices, Storage Field Day, Storage Market Share, Storage Optimization, Storage Tiering, Tech Field Day

1 Comment

It is easy to kick someone who is down. Bad news have stronger ripple effects than the good ones. Intel® is going through a rough patch, and perhaps the worst one so far. They delayed their 7nm manufacturing process, one which could have given Intel® the breathing room in the CPU war with rival AMD. And this delay has been pushed back to 2021, possibly 2022.

Intel Apple Collaboration and Partnership started in 2005

Their association with Apple® is coming to an end after 15 years, and more security flaws surfaced after the Spectre and Meltdown debacle. Extremetech probably said it best (or worst) last month:

We’ve never seen Intel® struggle like this

If we look deeper (and I am sure you have), all these negative news were related to their processors. Intel® is much, much more than that.

Their Optane™ storage prowess

I have years of association with the folks at Intel® here in Malaysia dating back 20 years. And I hardly see Intel® beating it own drums when it comes to storage technologies but they are beginning to. The Optane™ revolution in storage, has been a game changer. Optane™ enables the implementation of persistent memory or storage class memory, a performance tier that sits between DRAM and the SSD. The speed and more notable the latency of Optane™ are several times faster than the Enterprise SSDs.

Intel pyramid of tiers of storage medium

If you want to know more about Optane™’s latency and speed, here is a very geeky article from Intel®:

Restoring the Balance between Bandwidth and Latency

The list of storage vendors who have embedded Intel® Optane™ into their gears is long. Vast Data, StorOne™, NetApp® MAX Data, Pure Storage® DirectMemory Modules, HPE 3PAR and Nimble Storage, Dell Technologies PowerMax, PowerScale, PowerScale and many more, cement Intel® storage prowess with Optane™.

3D Xpoint, the Phase Change Memory technology behind Optane™ was from the joint venture between Intel® and Micron®. That partnership was dissolved in 2019, but it has not diminished the momentum of next generation Optane™. Alder Stream and Barlow Pass are going to be Gen-2 SSD and Persistent Memory DC DIMM respectively. A screenshot of the Optane™ roadmap appeared in Blocks & Files last week.

Intel next generation Optane roadmap

Continue reading →

Valuing the security value of NAS storage

By cfheoh | August 10, 2020 - 8:22 pm |August 10, 2020 Analytics, API, Appliance, Artificial Intelligence, Asigra, Backup, Business Continuity, CIFS, Cloud, Commvault, Data Availability, Data Corruption, Data Management, Data Protection, Data Security, Digital Transformation, Disaster Recovery, Filesystems, FreeNAS, Gartner, iland, Infrascale, iXsystems, Machine Learning, NAS, NetApp, NFS, QNAP, Security, Snapshots, Synology, TrueNAS

1 Comment

Garmin paid, reportedly millions. Do you sleep well at night knowing that the scourge of ransomware is rampant and ever threatening your business. Is your storage safe enough or have you invested in a storage which was the economical (also to be known as cheap) to your pocket?

Garmin was hacked by ransomware

I have highlighted this before. NAS (Network Attached Storage) has become the goldmine for ransomware. And in the mire of this COVID-19 pandemic, the lackadaisical attitude of securing the NAS storage remains. Too often than not, end users and customers, especially in the small medium enterprises segment, continue to search for the most economical NAS storage to use in their business.

Is price the only factor?

Why do customers and end users like to look at the price? Is an economical capital outlay of a cheap NAS storage with 3-year hardware and shallow technical support that significant to appease the pocket gods? Some end users might decided to rent cloud file storage, Hotel California style until they counted the 3-year “rental” price.

Continue reading →

Persistent Storage could stifle Google Anthos multi-cloud ambitions

By cfheoh | July 27, 2020 - 9:15 am |July 26, 2020 Amazon Web Services, Analytics, API, Big Data, Cloud, Clusters, Containers, Data, Data Management, DellEMC, Docker, Google, Google Anthos, HPE, Kubernetes, Microsoft, Microsoft Azure, NetApp, Object Storage, Portworx, Pure Storage, Robin.io

Leave a comment

To win in the multi-cloud game, you have to be in your competitors’ cloud. Google Cloud has been doing that since they announced Google Anthos just over a year ago. They have been crafting their “assault”, starting with on-premises, and Anthos on AWS. Anthos on Microsoft® Azure is coming, currently in preview mode.

Google CEO Sundar Pichai announcing Google Anthos at Next ’19

BigQuery Omni conversation starter

2 weeks ago, whilst the Google Cloud BigQuery Omni announcement was still under wraps, local Malaysian IT portal Enterprise IT News sent me the embargoed article to seek my views and opinions. I have to admit that I was ignorant about the deeper workings of BigQuery, and haven’t fully gone through the works of Google Anthos as well. So I researched them.

Having done some small works on Qubida (defunct) and Talend several years ago, I have grasped useful data analytics and data enablement concepts, and so BigQuery fitted into my understanding of BigQuery Omni quite well. That triggered my interests to write this blog and meshing the persistent storage conundrum (at least for me it is something to be untangled) to Kubernetes, to GKE (Google Kubernetes Engine), and thus Anthos as well.

For discussion sake, here is an overview of BigQuery Omni.

An overview of Google Cloud BigQuery Omni on multiple cloud providers

My comments and views are in this EITN article “Google Cloud’s BigQuery Omni for Multi-cloud Analytics”.

Continue reading →

Resilient Integrated Data Protection against Ransomware

By cfheoh | June 8, 2020 - 10:37 am |June 8, 2020 Analytics, API, Appliance, Arcserve, Artificial Intelligence, Asigra, Backup, Business Continuity, CIFS, ClamAV, Cloud, Cohesity, Commvault, Data Availability, Data Domain, Data Management, Data Protection, Data Security, DellEMC, Digital Transformation, Disaster Recovery, Druva, Filesystems, FireEye, Hitachi Vantara, Hyperconvergence, iland, Infrascale, Machine Learning, Microsoft Azure, NAS, NFS, Open Compute Project, QNAP, Rubrik, Security, SMB, Snapshots, Sophos, Tape storage, Wasabi Cloud

Leave a comment

Early in the year, I wrote about NAS systems being a high impact target for ransomware. I called NAS a goldmine for ransomware. This is still very true because NAS systems are the workhorses of many organizations. They serve files and folders and from it, the sharing and collaboration of Work.

Another common function for NAS systems is being a target for backups. In small medium organizations, backup software often direct their backups to a network drive in the network. Even for larger enterprise customers too, NAS is the common destination for backups.

Typical NAS backup for small medium organizations.

Backup to Data Domain with NAS Protocols

Backup to Data Domain with NAS (NFS, CIFS) Protocols

Ransomware is obviously targeting the backup as another high impact target, with the potential to disrupt the rescue and the restoration of the work files and folders.

Continue reading →

Down the rabbit hole with Kubernetes Storage

By cfheoh | May 19, 2020 - 9:30 am |May 16, 2020 Acquisition, Algorithm, Amazon Web Services, Analytics, API, Artificial Intelligence, Ceph, Cloud, Clusters, Containers, Data Management, Edge Computing, Elastifile, Filesystems, Flash, Google, Hyperconvergence, Kubernetes, Linux, Minio, NFS, Object Storage

Leave a comment

Kubernetes is on fire. Last week VMware® released the State of Kubernetes 2020 report which surveyed companies with 1,000 employees and above. Results were not surprising as the adoptions of this nascent technology are booming. But persistent storage remained the nagging concern for the Kubernetes serving the infrastructure resources to applications instances running in the containers of a pod in a cluster.

The standardization of storage resources have settled with CSI (Container Storage Interface). Storage vendors have almost, kind of, sort of agreed that the API objects such as PersistentVolumes, PersistentVolumeClaims, StorageClasses, along with the parameters would be the way to request the storage resources from the Pre-provisioned Volumes via the CSI driver plug-in. There are already more than 50 vendor specific CSI drivers in Github.

Kubernetes and the CSI (Container Storage Interface) logos

The CSI plug-in method is the only way for Kubernetes to scale and keep its dynamic, loadable storage resource integration with external 3rd party vendors, all clamouring to grab a piece of this burgeoning demands both in the cloud and in the enterprise.

Continue reading →

Falconstor Software Defined Data Preservation for the Next Generation

By cfheoh | April 27, 2020 - 9:56 am |April 27, 2020 Amazon Web Services, Analytics, API, Appliance, Artificial Intelligence, Backup, Big Data, Business Continuity, Cloud, Clusters, Composable Infrastructure, compression, Containers, Data, Data Archiving, Data Availability, Data Corruption, Data Domain, Data Fabric, Data Management, Data Privacy, Data Protection, Data Security, deduplication, Deduplication, Digital Transformation, Disaster Recovery, Disks, eDiscovery, Falconstor, HDS, Linux, LTFS, LTO, LTO-8, Microsoft, Microsoft Azure, NetApp, RAID, Scale-out architecture, Software Defined Storage, Software-defined Datacenter, Starwind, Storage Tiering, Tape storage, virtual tape library, Virtualization, VTL

Leave a comment

Falconstor® Software is gaining momentum. Given its arduous climb back to the fore, it is beginning to soar again.

Tape technology and Digital Data Preservation

I mentioned that long term digital data preservation is a segment within the data lifecycle which has merits and prominence. SNIA® has proved that this is a strong growing market segment through its 2007 and 2017 “100 Year Archive” surveys, respectively. 3 critical challenges of this long, long-term digital data preservation is to keep the archives

Accessible
Undamaged
Usable

For the longest time, tape technology has been the king of the hill for digital data preservation. The technology is cheap, mature, and many enterprises has built their long term strategy around it. And the pulse in the tape technology market is still very healthy.

The challenges of tape remain. Every 5 years or so, companies have to consider moving the data on the existing tape technology to the next generation. It is widely known that LTO can read tapes of the previous 2 generations, and write to it a generation before. The tape transcription process of migrating digital data for the sake of data preservation is bad because it affects the structural integrity and quality of the content of the data.

In my times covering the Oil & Gas subsurface data management, I have seen NOCs (national oil companies) with 500,000 tapes of all generations, from 1/2″ to DDS, DAT to SDLT, 3590 to LTO 1-7. And millions are spent to transcribe these tapes every few years and we have folks like Katalyst DM, Troika and more hovering this landscape for their fill.

Continue reading →

The Falcon to soar again

By cfheoh | April 20, 2020 - 8:30 am |April 18, 2020 Amazon Web Services, Analytics, API, Backup, Business Continuity, Cloud, Clusters, Containers, Data, Data Archiving, Data Corruption, Data Management, Data Privacy, Data Protection, Data Security, Deduplication, Digital Transformation, Disaster Recovery, Falconstor, LTFS, LTO, LTO-8, Object Storage, RAID, Scale-out architecture, SNIA, Software Defined Storage, Tape storage, virtual tape library, Virtualization, VTL

Leave a comment

One of the historical feats which had me mesmerized for a long time was the 14-year journey China’s imperial treasures took to escape the Japanese invasion in the early 1930s, sandwiched between rebellions and civil wars in China. More than 20,000 pieces of the imperial treasures took a perilous journey to the west and back again. Divided into 3 routes over a decade and four years, not a single piece of treasure was broken or lost. All in the name of preservation.

Today, that 20,000 over pieces live in perpetuity in 2 palaces – Beijing Palace Museum in China and National Palace Museum Taipei in Taiwan

Digital data preservation

Digital data preservation is on another end of the data lifecycle spectrum. More often than not, it is not the part that many pay attention to. In the past 2 decades, digital data has grown so much that it is now paramount to keep the data forever. Mind you, this is not the data hoarding kind but to preserve the knowledge and wisdom which is in the digital content of the data.

[ Note: If you are interested to know more about Data -> Information -> Knowledge -> Wisdom, check out my 2015 article on LinkedIn ]

SNIA (Storage Networking Industry Association) conducted 2 surveys – one in 2007 and another in 2017 – called the 100 Year Archive, and found that the requirement for preserving digital data has grown multiple folds over the 10 years. In the end, the final goal is to ensure that the perpetual digital contents are

Accessible
Undamaged
Usable

All at an affordable cost. Therefore, SNIA has the vision that the digital content must transcend beyond the storage medium, the storage system and the technology that holds it.

The Falcon reemerges

A few weeks ago, I had the privilege to speak with Falconstor® Software‘s David Morris (VP of Global Product Strategy & Marketing) and Mark Delsman (CTO). It was my first engagement with Falconstor® in almost 9 years! I wrote a piece of Falconstor® in my blog in 2011.

Continue reading →

Dell EMC Isilon is an Emmy winner!

By cfheoh | March 16, 2020 - 7:41 am |March 17, 2020 100Gigabit Ethernet, Acquisition, Analytics, Appliance, CIFS, Cloud, Clusters, Containers, Data Availability, Deduplication, deduplication, Deep Learning, Dell, DellEMC, Disks, EMC, Flash, Gartner, High Performance Computing, Isilon, Mellanox, Mellanox Technologies, NAS, NetApp, NFS, Performance Caching, Pure Storage, Qumulo, Scale-out architecture, SMB, Snapshots, Software Defined Storage, Solid State Devices, Storage Field Day, Storage Market Share, Storage Optimization, Storage Tiering, Tech Field Day, WekaIO

2 Comments

[ Disclosure: I was invited by GestaltIT as a delegate to their Storage Field Day 19 event from Jan 22-24, 2020 in the Silicon Valley USA. My expenses, travel, accommodation and conference fees were covered by GestaltIT, the organizer and I was not obligated to blog or promote the vendors’ technologies presented at this event. The content of this blog is of my own opinions and views ]

And the Emmy® goes to …

Yes, the Emmy® goes to Dell EMC Isilon! It was indeed a well deserved accolade and an honour!

Dell EMC Isilon had just won the Technology & Engineering Emmy® Awards a week before Storage Field Day 19, for their outstanding pioneering work on the NAS platform tiering technology of media and broadcasting content according to business value.

A lasting true clustered NAS

This is not a blog to praise Isilon but one that instill respect to a real true clustered, scale-out file system. I have known of OneFS for a long time, but never really took the opportunity to really put my hands on it since 2006 (there is a story). So here is a look at history …

Back in early to mid-2000, there was a lot of talks about large scale NAS. There were several players in the nascent scaling NAS market. NetApp was the filer king, with several competitors such as Polyserve, Ibrix, Spinnaker, Panasas and the young upstart Isilon. There were also Procom, BlueArc and NetApp’s predecessor Auspex. By the second half of the 2000 decade, the market consolidated and most of these NAS players were acquired.

NetApp acquired Spinnaker in 2003
Part of Auspex was acquired by NetApp in 2003; The other by Glasshouse Technologies
Procom was picked up by Sun Microsystems in 2005
Polyserve went to HP in 2007
Ibrix joined HP as well in 2009
Isilon got acquired by EMC in 2010
BlueArc gobbled up by HDS in 2011

Continue reading →

Will there be Trust at Digital Events?

By cfheoh | March 13, 2020 - 5:07 am |March 13, 2020 Amazon Web Services, Analytics, Cloud, Digital Transformation, Storage Field Day, Tech Field Day

1 Comment

[ This article was published on LinkedIn on March 8, 2020. The original article link is here ]

關係 (Guan Xi) is ingrained into the psyche of many Asian cultures and businesses. It is fundamental to build connections and relationships, and consequently forging trust in those relationships. And it is best when it involves a face-to-face communication and building the common foundational belief of one another.

The COVID-19 outbreak is wreaking havoc and may become a global pandemic if the situation continue unabating in the coming months. In light of safety, many vendors are either canceling the physical event or switching to digital events or virtual events. On my radar this past week, there are Dell Tech World, AWS Singapore Summit and Google Cloud Next, to name a few. How do we build trust from these digital and virtual events?

All about the experience

The experience to engage at physical technology events is priceless. Putting the face to the name, to shake the hand and rub shoulders to connect cannot be quantified by just being present. Sharing war stories over coffee or beer, and exchanging good jokes and bad ones over dinner, are experiences which cannot be taken away in our lifetime. That is why I have always thoroughly enjoyed my Field Day experiences since 2014.

I am old school. I believe in 關係, because the kind of camaraderie, the fellowship, the brotherhood or sisterhood built from trust is immeasurable. The chemistry mix of experience would be hard to reproduced. An old hand at EMC once said to my team and I, “I would go to war with you guys any day!“.

The question today is “Can Digital or Virtual Events replicate that experience and build trust?”.

Continue reading →

Category Archives: Analytics