Skip to main content

Monitoring a Redis cluster, nodes, and databases

Last update:

In Redis Managed Databases, you can monitor the cluster state.

To assess the general state of the cluster, check its status.

For a more detailed analysis, you can:

The time in the control panel matches the time set on your device and does not depend on the region where the cluster is hosted.

note

For example, you have created a cluster in Tashkent, in the uz-1 pool. On the device from which you logged into the control panel, the Moscow time zone is set. The time on the metrics charts will be displayed in the Moscow time zone.

View cluster status

  1. In the control panel top menu, click Products and select Managed Databases.

  2. Open the Active tab.

  3. In the cluster row, check the status.

    ACTIVECluster is available
    CREATINGCluster is being created
    UPDATINGChanges are being applied to the cluster
    RESIZINGCluster is scaling
    ERRORAn error occurred, create a ticket
    DISK FULL

    Disk is full; the cluster is read-only. To make the cluster read and write, scale the cluster and select a configuration with a larger disk size

    DEGRADEDSome cluster nodes are unavailable
    DELETINGCluster is being deleted

View cluster node status

  1. In the control panel top menu, click Products and select Managed Databases.
  2. Open the Active tab.
  3. Open the cluster page → Monitoring tab.
  4. In the Cluster monitoring block, click Cluster nodes.
  5. Select the nodes for which you want to view metrics.
  6. View the available cluster node metrics.

Cluster node metrics in the control panel

MemoryUsed memory excluding OS cache and buffers, in percent or gigabytes
vCPUvCPU usage of cluster nodes in percent
CPU iowaitPercentage of time the CPU spent waiting for I/O
Disk

Occupied disk space in percent or gigabytes. It takes into account the part of disk space reserved for service needs and unavailable for database placement. For more information about reserving disk space, see the Using disk space in a Redis cluster instructions

Load Average

Average system load over a period of time. Shows how many processes are being processed by cluster cores. The indicator is presented as three values — for one minute, five minutes, and 15 minutes. These values should not exceed the number of cores on the node

OOM

Number of processes that terminated with an Out of Memory error due to insufficient RAM

Disk load

Read and write speed in KB/s or the number of read and write operations per second

Network loadNumber of bits or packets sent and received via the network interface

View database status

  1. In the control panel top menu, click Products and select Managed Databases.
  2. Open the Active tab.
  3. Open the cluster page → Monitoring tab.
  4. In the Cluster monitoring block, click Databases.
  5. Select the nodes for which you want to view metrics.
  6. View the available database metrics.

Database metrics in the control panel

Evicted and expired keys

Two parameters are displayed:

  • Evicted — number of keys evicted due to memory limit exceeded;
  • Expired — number of expired keys
Number of keysNumber of keys in all databases and number of keys with a defined time-to-live (TTL)
ConnectionsNumber of connections to cluster databases
Slow queries

Number of slow queries per second. Queries taking longer than 0.01 seconds are considered slow

Queries

Number of successful and failed requests per second. Requests are considered failed if the key does not exist, was evicted due to memory limit exceeded, or its time-to-live (TTL) has expired

Export metrics in Prometheus format

  1. Get a token.
  2. Get metrics in Prometheus format.

1. Get a token

The token provides access to metrics for all clusters of a project in one pool.

  1. In the control panel top menu, click Products and select Managed Databases.

  2. Open the Active tab.

  3. Open the cluster page → Monitoring tab.

  4. In the Prometheus tokens block, click Create token. The token will be generated automatically.

  5. Copy the token. To do this, click in the token row.

2. Get metrics in Prometheus format

Historical information for clusters is unavailable; metrics are requested in real time only. A list of all supported metrics in Managed Databases and their descriptions can be viewed in the Metrics in Prometheus format table.

  1. Add the following to the Prometheus configuration file:

    scrape_configs:
    - job_name: get-metrics-from-dbaas
    scrape_interval: 1m
    static_configs:
    - targets:
    - '<domain>'
    scheme: https
    authorization:
    type: Bearer
    credentials: <token>

    Specify:

    • <domain> — a Managed Databases API domain. This is the URL part for accessing the API without https:// and /v1, for example ru-3.dbaas.selcloud.ru. The URL depends on the region and pool, and can be found in the list of URLs;
    • <token> — the token you copied when obtaining the token in step 5.
  2. Open the page in your browser where the metrics in Prometheus format will be available:

    http://<ip_address>:9090/targets

    Specify <ip_address> — the IP address where Prometheus is installed.

  3. Configure monitoring and alerts for database clusters yourself.

Metrics in Prometheus format

Metrics in Prometheus format are provided for all clusters. A specific cluster can be found by the database cluster ID in the ds_id label.

dbaas_memory_percentUsed memory excluding OS cache and buffers (RAM) in percent
dbaas_memory_bytesUsed memory excluding OS cache and buffers (RAM) in bytes
dbaas_oom_countNumber of processes that terminated with an Out of Memory error due to insufficient RAM
dbaas_cpuvCPU usage on database cluster nodes in percent
dbaas_cpu_iowaitI/O wait time in percent
dbaas_disk_percent

Occupied disk space in percent. It takes into account the part of disk space reserved for service needs and unavailable for database placement. For more information about reserving disk space, see the Using disk space in a Redis cluster instructions

dbaas_disk_bytes

Occupied disk space in bytes. It takes into account the part of disk space reserved for service needs and unavailable for database placement. For more information about reserving disk space, see the Using disk space in a Redis cluster instructions

dbaas_disk_read_iopsNumber of read operations per second
dbaas_disk_write_iopsNumber of write operations per second
dbaas_disk_read_bytesDisk read speed in bytes per second
dbaas_disk_write_bytesDisk write speed in bytes per second
dbaas_node_load1Average system load over one minute. Shows how many processes are being processed by cluster cores
dbaas_node_load5Average system load over five minutes. Shows how many processes are being processed by cluster cores
dbaas_node_load15Average system load over 15 minutes. Shows how many processes are being processed by cluster cores
dbaas_network_receive_bytesNumber of bytes received via the network interface
dbaas_network_transmit_bytesNumber of bytes sent via the network interface
dbaas_network_receive_packetsNumber of packets received via the network interface per second
dbaas_network_transmit_packetsNumber of packets sent via the network interface per second
dbaas_role

Node role:

  • 0 — role unknown;
  • 1 — master;
  • 2 — replica