Please be aware that version 1.6.1 of this module may not restore your datastore correctly due to a bug that is corrected in version 2.0 ( MGNLBACKUP-99 - Refactor backup module to use RepositoryCopier API in order to achieve stability CLOSED ).

The Backup module allows you to take manual and scheduled backups of content and configuration. The module backs up Magnolia configuration and all versions of content in all workspaces.

Installing

Maven is the easiest way to install the module. Add the following dependency to your bundle:

<dependency>
  <groupId>info.magnolia</groupId>
  <artifactId>magnolia-module-backup</artifactId>
  <version>1.6.1</version>
</dependency>

Pre-built jars are also available for download. See Installing a module for help.

Taking a manual backup

  1. Go to Tools > Backup.
  2. In Manual backup enter the target path for backup.
  3. Click Backup. The target ID will be created if it doesn't already exist.

The backup consists of the following files:

  • repoConfig.zip - all configuration files necessary to restore the repositories.
  • history.gz - versions of all versionable content from all repositories.
  • <workspace_name>.gz - one file per workspace. Holds all content of given workspace.
  • blobs - folder with binary data from all the workspaces.

(warning) Backup files contain confidential information on how to connect to the database. Keep your backup files in a secure location.

Scheduling automatic backups

  1. Go to Tools > Backup.
  2. In Automatic backup set the time, frequency and target directory
  3. Click Schedule backup.

The tool registers an automatic backup task in the module configuration in /modules/backup/config

Node nameValue

 modules

 

 backup

 

 config

 

 tasks

 

 weeklyBackup

 

 automatedExecution

 

 cron

0 0 6 * * *

 enabled

true

 class

info.magnolia.module.backup.BackupRun

 name

weeklyBackup

 targetPath

 /tmp/backup

Viewing existing tasks

Scheduled automatic backups display in a list in Existing tasks.

Restoring a backup

To restore the previously backed up data, use the restore script in the Backup module bundle.

The restore script is meant to recreate an instance, not to patch an existing one. For it to work correctly you should clean your database schema before running it.

(warning) There is no UI for running the restore script. Run the script before starting Magnolia so that the script can re-create the repository.

To make sure the webapp directory is properly setup before running the restore, we recommend you take the WAR file or webapp from the bundle, add all your custom modules, startup the server once and perform the installation. Perform a backup of this new instance. After that, shutdown the server, delete the repositories directory and run the restore as specified below.

Usage:

restore [options]

Options:

  • -backup <dir>. Backup folder.
  • -help
  • -lib <dir>. Directory containing Magnolia's jar files. Default is the webapp's directory suffixed with WEB-INF/lib/.
  • -propertyfiles <arg>. List of property files to load. By default, files specified in web.xml are used. If the properties are not specified in web.xmldefault configuration properties are used.
  • -saveAfter <num>. Persistence counter. The lower the number, the less memory is used and the slower the restore runs. Default value is 100.
  • -servername <name>. Server name. Used for dynamic property file finding.
  • -webapp <dir>. Webapp's root folder. Default is the current directory.
  • -webappname <name>. Webapp name. Used for dynamic property file finding. Default is the directory name of the webapp.

Example:

./bin/restore -backup /path/to/backup/backup_0812305_1234 -webapp /path/to/magnolia/webapp

Tip

You should reindex your repository once the restore script is done since the Lucene indexer works asynchronously. Even though the restore script has completed indexing may be ongoing and incomplete.

Tip

Assign the restore script more memory by changing the EXTRA_JVM_ARGUMENTS variable in the script file. On a Linux server you need to grant the correct permissions to the script file in order to run it.

Best practice

For safety reasons, run the restore from time to time on a test system to verify the validity of the backup files. Depending on the amount of data changed over the time, perhaps do so every month or every quarter.

Deleting search indexes

Backup data does not contain search indexes. Indexes are created on the fly when running restore. As the indexer runs asynchronously, the indexing may still be in progress when restore is done. For this reason, we recommend that you remove all indexes before starting up a restored instance. This ensures that indexes are freshly generated on startup (Note that this process can take a couple of minutes). To remove indexes just delete all index folders from the workspaces. On unix systems you can do so by executing rm -rf repositories/magnolia/workspaces/*/index.

Database size

During restore all entries are reinserted into the database sequentially. This ensures that the restored database is not fragmented and is slightly smaller than the equivalent database before backup.

Changing persistence managers

You can use backup/restore to migrate data to another persistence manager and to another database.

To change the persistence manager configuration to which the restore script restores the database:

  1. Unzip all entries from repoConfig.zip. You can find this file inside each backup folder. 
  2. Change the persistence manager configuration in each file.
  3. Zip the files.
  4. Run the restore.

    This step is not necessary if you performed a backup of the new instance. Get the repoConfig.zip file from that backup and run the restore script

#trackbackRdf ($trackbackUtils.getContentIdentifier($page) $page.title $trackbackUtils.getPingUrl($page))