Skip to main content
IMHCloud Logo
Back to glossary

Backup

A backup is a recoverable copy of data stored separately from the original source, used to restore files, volumes, or entire systems after data loss, corruption, or accidental deletion.

What is a Backup in cloud hosting?

A backup is a recoverable copy of data stored separately from the original source. In cloud hosting, backups capture the contents of volumes (persistent storage), instances (virtual machines), databases, or configuration files at a specific moment. If the original data becomes corrupted, deleted, or inaccessible, you restore from the backup to recover.

Backups differ from snapshots in scope and purpose. A snapshot captures the exact state of a volume or instance at a point in time and remains tied to the original storage system. A backup is typically exported to a separate storage location, often in a different region or on different hardware, providing protection even if the primary infrastructure fails completely.

Related Terms

  • Volume: A persistent block storage device that attaches to an instance, such as the boot disk or a data disk containing application files that backups commonly protect.
  • Instance: A virtual machine running in the cloud, such as a web server or database server, whose configuration and attached volumes are often included in backup routines.
  • Snapshot: A point-in-time capture of a volume's state, such as a quick checkpoint before applying updates, that remains stored on the same infrastructure as the original.

Why Backups Exist

Cloud infrastructure provides high availability, but it does not protect against all forms of data loss. Hardware failures, software bugs, human error, ransomware attacks, and accidental deletions can destroy data that cannot be recovered from the running system. Without backups, losing critical data means losing days, weeks, or months of work.

Backups exist to provide a safety net. They allow organizations to recover to a known good state after any event that compromises data integrity. Regulatory requirements in healthcare, finance, and other industries often mandate backup retention for compliance purposes. Disaster recovery plans depend on backups being available, tested, and restorable.

What Do Backups Actually Do?

  • Copy data from production volumes, instances, or databases to a separate storage location
  • Store multiple recovery points based on a retention policy, such as daily backups kept for 30 days
  • Enable restoration of individual files, entire volumes, or complete systems
  • Provide version history that allows recovery from before a corruption event occurred
  • Support disaster recovery by storing copies in geographically separate regions

When Would I Use a Backup?

Use backups when you need to protect data that cannot be easily recreated. Production databases, application configurations, user uploads, and business-critical files all require backup protection.

You would use backups before making major system changes, such as upgrading software or migrating to a new infrastructure. They provide rollback capability if the change introduces problems.

Backups are essential when compliance requirements mandate data retention. Financial records, healthcare data, and legal documents often require specific retention periods that backups fulfill.

When Would I NOT Use a Backup?

You might skip backups for ephemeral workloads where the instance and its data are recreated from automation scripts each time. If configuration management tools can rebuild a server from scratch in minutes, backing up its state may be unnecessary.

Backups may not be the right choice for extremely large datasets that change frequently, such as video processing pipelines with terabytes of temporary files. The cost and time to back up such data may exceed the value of recovery. In these cases, architecting the system to be recreatable from source data is more practical.

You should not rely on backups alone for high availability. Backups address data recovery after a failure, not continuous uptime. For zero-downtime requirements, use replication and redundancy instead of treating backups as the primary protection mechanism.

Real-World Example

Company A runs an e-commerce platform on cloud infrastructure. Their database volume stores customer orders, product inventory, and transaction history. They configure automated daily backups that run at 2:00 AM, retaining 30 daily backups and 12 monthly backups.

One morning, a developer accidentally runs a script that deletes six months of order records. The production database no longer contains the historical data. Company A restores the database from the backup taken the previous night, losing only orders placed between 2:00 AM and the time of the incident. Within two hours, operations resume normally. Without the backup, reconstructing the order history would have been impossible.

Frequently Asked Questions

What is the difference between a full, incremental, and differential backup?

A full backup copies all data every time it runs, providing a complete standalone recovery point but consuming more storage and time. An incremental backup copies only data that changed since the last backup of any type, making it fast and storage-efficient but requiring a chain of backups to restore. A differential backup copies all data changed since the last full backup, offering a middle ground between restore speed and storage consumption.

How often should I run backups?

Backup frequency depends on how much data loss your organization can tolerate. If losing one hour of data is acceptable, hourly backups are sufficient. If losing even minutes of transactions is unacceptable, you need more frequent backups combined with transaction logging or replication. Most organizations run daily backups with longer intervals for archival copies.

Where should I store my backups?

Store backups in a location separate from your production data. This typically means a different availability zone, a different region, or a dedicated backup storage service. Storing backups on the same hardware as production data defeats the purpose, since a hardware failure would destroy both.

How do I know if my backups are working?

Test your backups regularly by performing restore operations. A backup that cannot be restored is worthless. Many organizations schedule monthly or quarterly restore tests to verify backup integrity. Monitoring backup job completion status and checking backup sizes for anomalies also helps detect problems early.

How long should I keep backups?

Retention periods depend on business requirements and regulatory obligations. Common policies keep daily backups for 30 days, weekly backups for 3 months, and monthly backups for 1 year. Industries with compliance requirements may mandate specific retention periods, such as 7 years for financial records. Balance retention length against storage costs and the likelihood of needing older recovery points.

Summary

  • A backup is a recoverable copy of data stored separately from the original source
  • Backups protect against data loss from hardware failure, human error, corruption, and ransomware
  • Full backups copy everything, incremental backups copy changes since the last backup, and differential backups copy changes since the last full backup
  • Store backups in a separate location from production data for disaster protection
  • Test backups regularly to ensure they can actually be restored when needed