Cluster of three nodes with Apache Kafka

Apache Kafka is a distributed software message broker for processing real-time data streams from The Apache Software Foundation. The broker allows collecting, storing and processing large amounts of data from various sources and transferring it to data processing and analytics systems. Apache Kafka uses a distributed cluster architecture with three nodes for resilience to failures.

You can create a cluster of three nodes with a ready-made Apache Kafka application. In Russia, the cluster nodes work with the configured SelectOS 1.1 operating system . In other countries - Ubuntu 24.04.

In Selectel, to create a cluster, you must first create a temporary cloud server with Apache Kafka. Creating a temporary server initiates the creation of a cluster of three nodes. Each node is a cloud server with the configuration that was selected when the temporary cloud server was created.

The name of each server in the cluster will be in the format cs-<server_name>-<cluster_number>, where <server_name> is the server name and <cluster_number> is the number of the server in the cluster from 1 to 3.

The three cloud servers created are on the same subnet as the temporary cloud server.

Next, a Kafka cluster will be configured on the three cloud servers. If the cluster is configured, the temporary cloud server is removed. If the temporary cloud server is not removed, there was a problem when configuring the Kafka cluster. To identify the problem, look at the logs.

The cost of the cluster will be calculated as the cost of three cloud servers. Learn more about the cloud platform payment model in the article Payment Model and Pricing.

Create a cluster of three nodes with Apache Kafka

To configure Apache Kafka at creation, you must specify user data - user configuration parameters that are used to connect to the OpenStack API.

Create a service user.
Create a cloud server with Apache Kafka.

1. Create a service user

Create a service user In the permission, select the role member or reader and access area Projects. Users can be created by Account owner or users with the role iam_admin.

2. Create a cloud server with Apache Kafka

In the Dashboard, on the top menu, click Products and select Cloud Servers.
Click Create Server.
Fill in the blocks:
- Name and location
- Source
- Configuration
- Disks
- Network
- Safety
- Access
- Additional settings
Check the price of the cloud server. Note that once configured, the price for a three-node cluster will be calculated using the formula: cloud server price * 3.
Click Create.

Name and location

Enter the server name. This will be set as the host name in the operating system.
Select the region and pool segment in which the server will be created. The list of available server configurations and resource costs depends on the pool segment. Once the server is created, you cannot change the pool segment.

Source

Open the Applications tab.
Select Cloud Kafka.
Optional: if you need another current or archived version of the application, select the desired version in the Version field.

Configuration

The process of selecting a configuration depends on the pool.

ru-3b, ru-7a and ru-7b
Other pools

Choose a configuration of 2 vCPU and 8 GB RAM if the server is needed for development and testing. If the server is needed for moderate workloads in a productive environment, select a configuration of 8 vCPUs and 32 GB RAM or more.

Two types of server configurations are available for all rulers except Shared and Dedicated:

fixed configurations - configurations of lines with different technical characteristics in which the resource ratio is fixed;
arbitrary configurations - configurations in which any resource ratio can be specified.

The configurations use different processors depending on the line and pool segment. You can customize the selected configuration. After the server is created, you can change the configuration.

Fixed configuration
Arbitrary configuration

Open the tab with the ruler.
Click Fixed.
Optional: you can customize the configuration:

3.1 Expand the block with description of configuration settings.

3.2 Optionally: select the processor manufacturer. Manufacturer selection is not available in all pools.

3.3 Optional: If you do not want the vCPU of the cloud server to be assigned to physical CPU cores, clear the **Dedicated Co **res checkbox. See the Dedicated Cores instruction for more information.

3.4 Optional: If you want to disable Hyper-Threading for a server with dedicated cores, clear the Hyper-Threading (SMT) checkbox .

3.5 Optional: if you create a server with dedicated cores and want to place a multiprocessor server on one NUMA-node, check the checkbox Mandatory placement on one NUMA-node. You can host a server with 4 vCPUs or more on a single NUMA-node. If cloud server resources cannot be placed on a single node, it will not be created. For more information, see the Placement on a single NUMA node section of the Dedicated Cores tutorial.
Select a configuration.
If both local and network volumes are available in the selected configuration, select the disk to be used as the boot disk:
- local disk - check the box Local SSD NVMe disk. A server with a local disk can only be created from images and applications;
- network drive - do not check the checkbox.
The amount of RAM allocated to the server may be less than the amount specified in the configuration - the operating system kernel reserves some RAM depending on the kernel version and distribution. You can check the allocated amount on the server with sudo dmesg | grep Memory.

Open the tab with the ruler.
Click Arbitrary.
Optional: you can customize the configuration:

3.1 Expand the block with description of configuration settings.

3.2 Optionally: select the processor manufacturer. Manufacturer selection is not available in all pools.

3.3 Optional: If you do not want the vCPU of the cloud server to be assigned to physical CPU cores, clear the **Dedicated Co **res checkbox. See the Dedicated Cores instruction for more information.

3.4 Optional: If you want to disable Hyper-Threading for a cloud server with dedicated cores, clear the Hyper-Threading (SMT) checkbox .

3.5 Optional: If you create a cloud server with dedicated cores and want to host a multiprocessor server on a single NUMA node, check the **Mandatory Hosting on a single NUMA n **ode checkbox. You can host a cloud server with 4 vCPUs or more on a single NUMA-node. If the cloud server resources cannot be placed on one node, it will not be created. For more information, see the Placement on a single NUMA node section of the Dedicated Cores tutorial.
Specify the number of vCPUs and the size of RAM.
If you want to add GPUs to the server, click Add GPU, select the GPU type, and specify the number of GPUs. The selected lineup will change to a GPU lineup. If you selected a non-GPU optimized image in the Source block and you have the Auto Select GPU Image toggle switch enabled, the selected image will automatically change to a GPU optimized image.
If both local and network volumes are available in the selected configuration, select the disk to be used as the boot disk:
- local disk - check the box Local SSD NVMe disk. A server with a local disk can only be created from images and applications;
- network drive - do not check the checkbox.
The amount of RAM allocated to the server may be less than the amount specified in the configuration - the operating system kernel reserves some RAM depending on the kernel version and distribution. You can check the allocated amount on the server with sudo dmesg | grep Memory.

Two types of configurations are available for the server:

fixed configurations - Several ranges with different specifications in which the resource ratio is fixed;
arbitrary configurations where any resource ratio can be specified.

The configurations use different processors depending on the line and pool segment. After the server is created, you can change the configuration.

Fixed configuration
Arbitrary configuration

Click Fixed.
Open the tab with the desired ruler.
Select a configuration.
If both local and network volumes are available in the selected configuration, select the disk to be used as the boot disk:
- local disk - check the box Local SSD NVMe disk. A server with a local disk can only be created from images and applications;
- network drive - do not check the checkbox.
The amount of RAM allocated to the server may be less than the amount specified in the configuration - the operating system kernel reserves some RAM depending on the kernel version and distribution. You can check the allocated amount on the server with sudo dmesg | grep Memory.

Click Arbitrary.
Specify the number of vCPUs and the size of RAM.
If you want to add GPUs to the server, click Add GPU, select the GPU type and specify the number of GPUs. If in the Source block you have selected an image without GPU optimization and you have enabled the Auto Select GPU Image toggle switch, the selected image will automatically change to a GPU-optimized image.
If both local and network volumes are available in the selected configuration, select the disk to be used as the boot disk:
- local disk - check the box Local SSD NVMe disk. A server with a local disk can only be created from images and applications;
- network drive - do not check the checkbox.
The amount of RAM allocated to the server may be less than the amount specified in the configuration - the operating system kernel reserves some RAM depending on the kernel version and distribution. You can check the allocated amount on the server with sudo dmesg | grep Memory.

Disks

If you did not check the Local SSD NVMe disk checkbox when setting up the configuration, the first specified network volume will be used as the server boot disk. To configure it:

1.1 Select the type of network boot disk.

1.2 Specify the size of the network boot disk in GB or TB. Observe the maximum size limits for network volumes.

1.3 If you selected the Universal v2 disk type, specify the total number of read and write operations in IOPS. After creating the disk, you can change the number of IOPS - decrease or increase. There is no limit to the number of IOPS changes.
Optional: add an additional server network volume:

2.1 Click Add.

2.2 Select the type of network drive.

2.3 Specify the network volume size in GB or TB. Observe the maximum size limits of the network volumes.

2.4 If you selected the Universal v2 disk type, specify the total number of read and write operations in IOPS. After creating a disk, you can change the number of IOPS - decrease or increase. There is no limit to the number of IOPS changes.

After the server is created, you can connect new additional disks.

Network

A server can be added to a new private subnet or to an existing one.

Private - a subnet without access from the Internet or with a single static public IP address. The public IP address connects to a server that will be accessible from the Internet.

Existing private subnet
New private subnet

Click Private Subnet.
Expand the block with private subnet settings.
In the Subnet field, select an existing subnet that is connected to the cloud router with access to the external network.
In the Private IP field, specify the private IP address of the server.

Click Private Subnet.
Expand the block with private subnet settings.
If you have existing networks in the project, in the Subnet field, select New Subnet.
Optional: change the CIDR of the subnet.
Optional: enable the DHCP toggle switch.
Optional: in the Gateway field, change the IP address of the default gateway.
In the Network field, select the existing network where the subnet will be created or New network. For a private subnet with a public IP address, a router will be automatically created router-<network_name>, where <network_name> is the network name.
If you selected New Network, enter a name for the network.

Safety

Select security groups to filter traffic on server ports. Without security groups, traffic will not be allowed. If there is no block, port security is disabled on the server network . With traffic filtering disabled, all traffic will be allowed.

Access

Place an SSH key for the project on the server for secure connection:

1.1 If the SSH key for the project is not added to the cloud platform, click Add SSH Key, enter the key name, insert the public key in OpenSSH format, and click Add.

1.2 If an SSH key for the project is added to the cloud platform, select an existing key in the SSH key field. The SSH key is available only in the pool in which it is hosted.
Optional: in the Password field for "root":

2.1 Copy the password of the root user - a user with unlimited rights to all actions on the system.

2.2 Save the password in a safe place and do not transmit it in public.

Additional settings

If you plan to create multiple servers and want to improve the fault tolerance of your infrastructure, add the server to a placement group:

1.1 To create a new group, in the Placement Group field, click New.

1.2. Select New Group and enter a group name.

1.3 Select a policy for hosting on different hosts:
- preferably soft-anti-affinity. The system will try to place servers on different hosts. If there is no suitable host when creating a server, it will be created on the same host;
- anti-affinity is mandatory. Servers in a group must be located on different hosts. If there is no suitable host when creating a server, the server will not be created.
1.4 If a group has been created, select the placement group in the Placement Group field.
To add additional information or filter servers in the list, add server tags. Operating system and configuration tags are automatically added. To add a new tag, enter a tag in the Tags field.
To add a script that will be executed by the cloud-init agent when the operating system first starts up, in the Automation block in the User data field:
- open the Text tab and paste the script with text;
- or open the File tab and upload a file with the script.
```
#cloud-config

write_files:
- path: "/opt/gomplate/values/user-values.yaml"
  permissions: "0644"
  content: |
    ServiceUsername: <username>
    ServicePassword: <password>
    ProjectId: <project_id>
    CloudRegion: <pool>
    CloudZone: <pool_segment>
    AccountNumber: <account_id>
```
Specify:
- <username> - the name of the service user. You can look in the control panel: in the top menu, click Account and go to Users → tab Service Users (the section is available only to the Account Owner and the user with the role of iam_admin);
- <password> - service user password. You can view it when creating a user or change it to a new one.
- <project_id> - Project ID. Can be copied in control panel: in the top menu click Products → Cloud Servers → open the projects menu → in the project line, click ;
- <pool> - pool in which the cloud server will be created, e.g. ru-3;
- <pool_segment> - pool segment in which the cloud server will be created, e.g. ru-3b;
- <account_id> - account number in the control panel. You can look in control panel in the upper right corner.

View logs

If the temporary cloud server is not removed, there was a problem when configuring the Apache Kafka cluster. For example, the service user has insufficient permissions, an incorrect password was entered, or the settings in user data are incorrect.

Connect to the server that you you created earlier.
To see the error log, open the cloud-init-output.log file in a vi text editor:
```
vi /var/log/cloud-init-output.log
```
Depending on the error log, grant service user access or change the user data settings.
Re-create the server.
If you cannot fix the problem yourself, create a ticket and attach logs with errors.