Apache Geode CHANGELOG

How Disk Stores Work

Overflow and persistence use disk stores individually or together to store data.

Disk storage is available for these items:

  • Regions. Persist and/or overflow data from regions.
  • Server’s client subscription queues. Overflow the messaging queues to control memory use.
  • Gateway sender queues. Persist these for high availability. These queues always overflow.
  • PDX serialization metadata. Persist metadata about objects you serialize using Geode PDX serialization.

Each member has its own set of disk stores, and they are completely separate from the disk stores of any other member. For each disk store, define where and how the data is stored to disk. You can store data from multiple regions and queues in a single disk store.

This figure shows a member with disk stores D through R defined. The member has two persistent regions using disk store D and an overflow region and an overflow queue using disk store R.

What Geode Writes to the Disk Store

This list describes the items that Geode comprise the disk store:

  • The members that host the store, and information on their status, such as which members are online and which members are offline and time stamps.
  • A disk store identifier.
  • Which regions are in the disk store, specified by region name.
  • Colocated regions that the regions in the disk store are dependent upon.
  • A set of files that specify all keys for the regions, as well as all operations on the regions. Given both keys and operations, a region can be recreated when a member is restarted.

Geode does not write indexes to disk.

Disk Store State

The files for a disk store are used by Geode as a group. Treat them as a single entity. If you copy them, copy them all together. Do not change the file names.

Disk store access and management differs according to whether the member is online or offline.

While a member is running, its disk stores are online. When the member exits and is not running, its disk stores are offline.

  • Online, a disk store is owned and managed by its member process. To run operations on an online disk store, use API calls in the member process, or use the gfsh command-line interface.
  • Offline, the disk store is just a collection of files in the host file system. The files are accessible based on file system permissions. You can copy the files for backup or to move the member’s disk store location. You can also run some maintenance operations, such as file compaction and validation, by using the gfsh command-line interface. When offline, the disk store’s information is unavailable to the distributed system. For partitioned regions, region data is split between multiple members, and therefore the start up of a member is dependent on and must wait for all members to be online. An attempt to access an entry that is stored on disk by an offline member results in a PartitionOfflineException.