Navigating the cost of cloud storage in the public sector

Tags: ceph , Storage


Like many other industries, organisations in the public sector have been keen to make use of the flexibility offered by cloud computing, but are now observing unpredictable and rising costs. Much of which can be mitigated through careful planning and on-premise infrastructure. 

Government guidance now recommends switching to a strategy of the most appropriate solution for a problem, rather than a one-size-fits-all or carte blanche approach of shifting all applications to the cloud. 

In this blog we will explore some of the challenges encountered by public sector organisations, and the steps they can take to ensure cost-effectiveness, scalability and compliance.

Cloud storage

Cloud computing has taken the world by storm over the past two decades. The scalability, flexibility, and on-demand nature of public clouds is unmatched, but at what cost?

Public clouds provide an easy way to match inconsistent computing demands and usage, yes, but what about storage?  Storage doesn’t ebb and flow like the demands placed on CPUs and memory by high traffic events or end-of-month billing runs. Storage is persistent, needs to be retained and can’t just be shut off when not used. Even when it is not being accessed, there are still charges for holding onto that data.

All public cloud storage offerings have multiple tiers and methods of moving data between them, so what’s the catch?  Whilst putting data into cloud storage is largely free, making use of it, or getting it back out, has variable costs which are hard to predict.

Like many other industries, organisations in the public sector have been keen to make use of the flexibility offered by cloud computing, but are now observing unpredictable and rising costs. Much of which could have been mitigated through careful planning and on-premise infrastructure. 

Cost challenges

Predictability is king when it comes to running IT infrastructure, but with ever tightening budgets, unexpected bills can cause significant headaches for any organisation. This is felt quite acutely in the public sector, where budgets are tightly controlled and there is a greater sense of responsibility to those groups that they are serving. 

On top of that there are features and functionalities needed to ensure data is stored safely and for the correct amount of time, which can drive the cost of cloud computing higher still. It is important to find a balance between what is required and how it can be delivered in the most cost effective way. A self-hosted or co-located system can provide all of these features in a much more predictable fashion than a public cloud.

Reliable data protection

Public sector branches, including government agencies and the healthcare system, have to safely store data for many, many years. In fact, the data needs to be stored well beyond the lifetime of some of the hardware that they will initially deploy. That’s why it is important to select a storage system that not only ensures that data is safely stored using replication or erasure coding, but can also go through hardware refreshes multiple times over its lifetime and not risk the safety or availability of data.

Simplicity

The more complex any system is to manage, the higher the operational costs will be. Administrators will need specialised training and have to be on-call to handle any issues. Therefore, users should look for systems that are highly-available and self-healing, so that inevitable hardware failures are dealt with transparently and things like failed disks can be replaced in batches, rather than in immediate reactive maintenance.

Scalability

As the population grows, the number of people that a public sector organisation has to service will increase with it.  It is therefore important that a storage system allows organisations to transparently grow their storage capacity without the need for downtime or disruption.  

The opposite is also useful too, for example when a project finishes, it can be useful to scale down a cluster, again non-disruptively to allow the hardware to be reused somewhere else.

Compliance

Some of the datasets held by public sector companies also have requirements around proving that the data store is in fact still original and has not been tampered with. For example, imagine the records of births and deaths held by a government. This data should never change, and we need to ensure that when recalled, the data is as it was originally recorded. Similarly in law enforcement scenarios, digital evidence in the form of body camera footage, or crime scene evidence, need to be preserved until the trial and post-trial.

Snapshots, where a point in time version of a volume is created using metadata, is one way to ensure that original data is able to be read as it was originally written. For object storage, object versioning is a similar feature. When an object is overwritten, the older copy is transparently retained, so that it can also be retrieved at a later date.

Traditionally, tape backups have been used as a way of isolating an immutable copy of any data set, but with increasing use of digital evidence tape, recall times are becoming challenging.

Learn more

All of these challenges can be addressed in the public cloud, but is that the most cost effective approach?  Our recent whitepaper shows that for certain use cases, a co-located storage solution adjacent to a public cloud can provide savings of over 2-3x, even outsourced as a fully managed service. FInd out more below:

Additional resources

ceph logo

What is Ceph?

Ceph is a software-defined storage (SDS) solution designed to address the object, block, and file storage needs of both small and large data centres.

It's an optimised and easy-to-integrate solution for companies adopting open source as the new norm for high-growth block storage, object stores and data lakes.

Learn more about Ceph ›

ceph logo

How to optimise your cloud storage costs

Cloud storage is amazing, it's on demand, click click ready to go, but is it the most cost effective approach for large, predictable data sets?

In our white paper learn how to understand the true costs of storing data in a public cloud, and how open source Ceph can provide a cost effective alternative!

Access the whitepaper ›


Interested in running Ubuntu in your organisation? Talk to us today

ceph logo

A guide to software-defined storage for enterprises

Ceph is a software-defined storage (SDS) solution designed to address the object, block, and file storage needs of both small and large data centres.

In our whitepaper explore how Ceph can replace proprietary storage systems in the enterprise.

Access the whitepaper ›


Interested in running Ubuntu in your organisation? Talk to us today

Newsletter signup

Get the latest Ubuntu news and updates in your inbox.

By submitting this form, I confirm that I have read and agree to Canonical's Privacy Policy.

Related posts

Meet the Canonical Ceph team at Cephalocon 2024

Date: December 4-5th, 2024 Location: Geneva, Switzerland In just a few weeks, Cephalocon will be held at CERN in Geneva. After last year’s successful...

Managed storage with Ceph

Treat your open source storage infrastructure as a service What if storage was like coffee: menu driven and truly service oriented? Everyone knows how quick...

How do you select the best enterprise data storage solution for your business?

The choices you make around IT infrastructure have great impact for both business cost and performance, across areas as diverse as operations, finance, data...