General product information Data Processing Platform
Big Data Platform (Big Data Platform) — solution based on partner product ITS Data Processing Platform The platform is designed to collect, store and analyze large volumes of data. The platform is designed to collect, store and prepare large volumes of data for analysis.
You can build a platform from scratch on Selectel servers or modernize your data platform with a move to Selectel servers.
The solution includes:
- selection, provision and maintenance of Selectel servers;
- deployment and customization of the platform to your requirements;
- platform maintenance and monitoring.
Connect the platform
To connect the platform leave a request at selectel.ru. We'll find platform configuration and calculate value.
Platform configuration
A minimum of four leases are required to accommodate the platform dedicated servers. The operating system is CentOS 7.
Platform components utilize free software and technology to work with big data and build analytical systems.
- Apache Kafka (version 2.4.1) is a batch broker for real-time message delivery;
- Apache Spark (version 2.4.7) is a real-time streaming framework;
- Apache Airflow is software for creating, monitoring, and batch processing data;
- Apache Hadoop (version 2.10.1) is a framework for distributed processing of large amounts of data in clusters of computers;
- Greenplum (version 6.9.0) is a relational DBMS with massively parallel architecture for storing structured data;
- ClickHouse is a columnar analytic DBMS for real-time big data processing;
- Apache Superset is a web application for creating reports and dashboards based on ready-made queries.
Cost
Money for using the platform is deducted from your balance every month. Before using the platform top up.
The cost of the platform depends on the number of servers and will be calculated after processing the application and selecting the configuration.