Skip to main content
Monitoring of Redis cluster, nodes and databases
Last update:

Monitoring of Redis cluster, nodes and databases

In Redis cloud databases, you can monitor the health of the cluster.

To assess the overall state of the cluster check his status..

For a more detailed analysis, you can:

You can find out if your disk is about to run out of space by using disk fill notifications.

View cluster status

  1. In control panels go to Cloud platformDatabases.
  2. In the cluster row, look at the status.
ACTIVEThe cluster is available
CREATINGA cluster is created
UPDATINGChanges are applied to the cluster
RESIZINGThe cluster is scalable
ERRORThere's been a mistake, file a ticket
DISK FULL

The disk is full and the cluster is read-only. To make the cluster read and write, scale cluster and select a configuration with a larger disk size

DEGRADEDSome nodes in the cluster are unavailable
DELETINGThe cluster is being deleted

View the status of the node cluster

  1. In control panels go to Cloud platformDatabases.
  2. Open the cluster page → tab Monitoring.
  3. In the block Cluster monitoring click Nodes of the cluster.
  4. In the field Server select the node whose metrics you want to view.
  5. See what's available node cluster metrics.

Cluster node metrics in the control panel

vCPUHow many percent of the node cluster cores are utilized
CPU iowaitHow much percent of the processor's time was spent waiting for I/O
MemoryMemory utilization excluding cache and operating system buffers in percent or gigabytes
DiskUsed disk space in percent or gigabytes. The file system reserves 4% of the disk space for service needs, so the space occupied is greater than the actual space used by this 4%
Network load

The number of bits or packets sent and received over the network interface

Disk, read and write speed

Data read speed in B/s and data write speed in KB/s

Disk, number of IOPS

Number of read and write operations per second

OOM

Number of processes that ended with an error Out of Memory due to lack of RAM

Load Average

The average value of system load over a period of time. Shows how many processes are processed by the cluster cores. The indicator is presented in the form of three values — for one minute, five minutes and 15 minutes. These values should not be greater than the number of cores on the node

View the status of the databases

  1. In control panels go to Cloud platformDatabases.
  2. Open the cluster page → tab Monitoring.
  3. In the block Cluster monitoring click Databases.
  4. In the field Server select the node whose metrics you want to view.
  5. See what's available database metrics.

Database metrics in the dashboard

ConnectionsNumber of connections to the cluster databases
InquiriesNumber of successful and unsuccessful requests per second. Unsuccessful requests are those in which the key does not exist, is superseded due to exceeding the memory limit, or its lifetime (TTL) has expired
Number of keysNumber of keys in all databases and number of keys with a specified lifetime (TTL)
Preempted and obsolete keys

Two parameters are displayed:

  • Evicted — number of keys evicted due to exceeding the memory limit;
  • Expired — number of keys with expired lifetime.
Slow queries

Number of slow requests per second. Requests that take longer than 0.01 seconds to complete are considered slow requests

Export metrics in Prometheus format

  1. Get a token.
  2. Get metrics in Prometheus format.

Get a token

The token gives access to the metrics of all clusters project in one bullet.

  1. In control panels go to Cloud platformDatabases.

  2. Open the cluster page → tab Monitoring.

  3. In the block Tokens for Prometheus click Add token. The token will be generated automatically.

  4. Copy the token. To do this, in the token row, click .

Get metrics in Prometheus format

Historical information for clusters is not available — metrics are requested only in real time. The list of all metrics that are supported in cloud databases and their description can be seen in the following table Metrics in Prometheus format.

  1. Add to the Prometheus configuration file:

    scrape_configs:
    - job_name: get-metrics-from-dbaas
    scrape_interval: 1m
    static_configs:
    - targets:
    - '<pool>.dbaas.selcloud.ru'
    scheme: https
    authorization:
    type: Bearer
    credentials: <monitoring_token>

    Specify:

    • <pool> — pool in which the token is valid, e.g. ru-3. The address (URL) depends on the region and pool, you can look in the URL list;
    • <monitoring_token> — a monitoring token that you received Previously.
  2. Open a page in your browser where Prometheus-formatted metrics will be available:

    http://<ip_address>:9090/targets

    Specify <ip_address> — IP address where Prometheus is installed.

  3. Independently configure monitoring and alerts for database clusters.

Metrics in Prometheus format

Metrics in Prometheus format are provided for all clusters. A specific cluster can be found by the database cluster identifier in the label ds_id.

dbaas_memory_percentMemory utilization excluding cache and operating system buffers (RAM) in percent
dbaas_memory_bytesOccupied memory excluding cache and operating system buffers (RAM) in bytes
dbaas_oom_countNumber of processes that ended with an error Out of Memory due to lack of RAM
dbaas_cpuPercent vCPU utilization on database cluster nodes
dbaas_cpu_iowaitI/O waiting time in percent
dbaas_disk_percentPercentage of disk space occupied. The file system reserves 4% of disk space for service needs, so the occupied space is larger than the actual space used by this 4%
dbaas_disk_bytesOccupied disk space in bytes. The file system reserves 4% of the disk space for service needs, so the occupied space is larger than the actual space used by this 4%
dbaas_disk_read_iopsNumber of read operations per second
dbaas_disk_write_iopsNumber of recording operations per second
dbaas_disk_read_bytesDisk read speed in bytes per second
dbaas_disk_write_bytesData write speed to disk in bytes per second
dbaas_node_load1The average value of system load in one minute. Shows how many processes are processed by the cluster cores
dbaas_node_load5The average system utilization over five minutes. Shows how many processes are processed by the cluster cores
dbaas_node_load15The average value of system utilization over 15 minutes. Shows how many processes are processed by the cluster cores
dbaas_network_receive_bytesNumber of bytes received through the network interface
dbaas_network_transmit_bytesNumber of bytes sent through the network interface
dbaas_network_receive_packetsNumber of packets received through the network interface per second
dbaas_network_transmit_packetsNumber of packets sent through the network interface per second
dbaas_role

Role of the node:

  • 0 — role unknown;
  • 1 — master;
  • 2 — replica

Disk fullness notifications

If the cluster disk is 80% full, a notification will appear in the dashboard and will be sent to the email of the Account Owner and those users subscribed to the the "Services and services" notification category.

If the cluster disk is 95% full or more, the cluster will go into status DISK_FULL and will be read-only. To make the cluster read-write, scale the cluster and select a configuration with a larger disk size.