当前位置:网站首页>General scheme for improving reading and writing ability of online es cluster
General scheme for improving reading and writing ability of online es cluster
2022-06-24 03:17:00 【house. zhang】
The problem background :
Business is using ES Cluster read ES data , If at the same time ES Cluster write task , Will meet RT The rising situation , There will be some jitter , Especially in the computing framework, the degree of concurrency is greatly increased ES When the cluster writes, jitter occurs , At present, the big data computing cluster reduces concurrent writes . In the future, we still hope to increase the degree of concurrency , Speed up writing , Expected to be right ES Cluster read performance challenges
The current situation :
At present, it is used online 5 platform 64C 128G 1THDD, The machine configuration is relatively high , The use is relatively stable , Some jitter occurs when the cluster reads and writes a lot at the same time , It didn't happen FGC Etc , The average latency is on the order of milliseconds . The amount of data occupied by the cluster index is about 300-500G. The default configuration is used for cluster construction , No, right ES Node roles are distinguished , That is to say 5 Two nodes can undertake Master / Data / Ingest / Coordinating / Machine Learning Responsibility for , With the increase of data and business volume, it is estimated that there will be challenges in the future .
According to the current monitoring data ES The overall situation is relatively stable , Whether to expand the capacity and adjust the deployment architecture, or make a comprehensive evaluation according to the business usage and cluster performance monitoring data .
As shown in the figure below, the data monitoring data :
chart : In the past 7 Days according to cloud query response time
Data access monitoring data P99 The response time is mostly in 30ms within , In rare cases, more than 100ms situation , combination ES Monitoring cluster data ( See the final reference links and data for details ) The existing cluster architecture can be maintained temporarily , In the later stage, continue to observe the monitoring data to split the role and expand the capacity of the cluster .
ES Cluster deployment :
Basic knowledge of
Usually ES Node types with the following roles in the cluster Master / Data / Ingest / Coordinating / Machine Learning
The roles of each role are as follows :
- Master node , In charge of the management of fragmentation 、 Cluster management , Cluster state management , If you leave the node alone Master, From the perspective of high availability and avoiding brain crack , Generally, three sets are configured in the production process , The cluster will automatically select 1 Taiwan is the main node .
- Data Node node : This node is mainly responsible for data storage , It plays a crucial role in data expansion . Reading and writing data will find the corresponding Data Node node .
- Coordinating Node node : The coordination node is mainly responsible for coordinating the requests of the client , Distribute the received request to the appropriate node , And put the results together . For example, the client requests to query the data of an index , The coordination node will distribute the request to the... That holds the relevant data DataNode node , Find the corresponding slice , The results of the query are collected and returned to . And each node plays a role by default Coordinating Node Responsibility for .
- Ingest Node: Ingest node Specially preprocess the indexed documents , Occurs before indexing real documents , Play the role of data processing .
A node will play these roles by default , In the development environment, the amount of data is usually small, and a node is usually deployed ES colony . In the production environment, it needs to be based on the amount of data , Throughput of writes and queries , Choose the right deployment method , Usually, if there are enough resources, the best practice is to set up a single role node , As shown in the figure below :
Node parameter configuration
Role configuration suggestions
role | cpu | Memory | disk |
|---|---|---|---|
Master | Low configuration | Low configuration | Low configuration |
Data Node | High configuration | High configuration | High configuration |
ingest | High configuration | Medium configuration | Low configuration |
Coordinating | in / High configuration | in / High configuration | Low configuration |
Future plans :
Separate several nodes and deploy them into ingest role Hang a... In the front LB It mainly undertakes some data access operations , Independent several coordinatiing node Hang a... In the front LB It is mainly used for data processing, query, aggregation and reading , When there are a lot of complex queries and aggregations in the system , increase Coordinating node , Easy to increase query performance .
summary
With the increase of business volume and data volume , At present ES The cluster uses the default configuration , No, right ES How to distinguish between node roles , In the future, it is estimated that it will be under certain pressure and challenges , Currently, according to the monitoring data and ES Cluster monitoring , Temporarily meet business needs , The subsequent cluster architecture needs to be adjusted , Split roles and responsibilities . Some nodes are deployed in a mixed manner , Or completely independent , When the amount of data is too large , Disk capacity cannot meet the demand , You can add data nodes , When there are a lot of complex queries and aggregations in the system , increase Coordinating node , Increase query performance , At the same time, it can be right Coordinating and ingest、Data Split nodes , Further reduce the pressure borne by the node .
Reference data :
Index creation and data reading latency in the past six hours ( Unit millisecond )
Search and write data delay in the past six hours ( Unit millisecond )
Index creation and write data latency in the past 24 hours ( Unit millisecond )
边栏推荐
- If the cloud knows that security is important
- What are the security guarantees for cloud desktop servers? What are the cloud desktop server platforms?
- Grpc: how to add API Prometheus monitoring interceptors / Middleware?
- The server size of the cloud desktop. The cloud desktop faces the server configuration requirements
- Cloud desktop server resource planning, what are the advantages of cloud desktop
- [hot] with a budget of only 100 yuan, how to build a 1-year web site on Tencent cloud??
- Principle of efficient animation Implementation-A preliminary exploration of jetpack compose
- RI Geng series: tricks of using function pointers
- [1024 programmers' day] Why do some programmers leave work earlier than you?
- 14. Tencent cloud IOT device side learning - data template application development
猜你喜欢

2022-2028 Global Industry Survey and trend analysis report on portable pressure monitors for wards
![[summary of interview questions] zj5](/img/d8/ece82f8b2479adb948ba706f6f5039.jpg)
[summary of interview questions] zj5
![[51nod] 2106 an odd number times](/img/af/59b441420aa4f12fd50f5062a83fae.jpg)
[51nod] 2106 an odd number times

Simple and beautiful weather code

2022-2028 global medical modified polypropylene industry research and trend analysis report
![[51nod] 3216 Awards](/img/94/fdb32434d1343040d711c76568b281.jpg)
[51nod] 3216 Awards

What is etcd and its application scenarios

2022-2028 global anti counterfeiting label industry research and trend analysis report
![[51nod] 3047 displacement operation](/img/cb/9380337adbc09c54a5b984cab7d3b8.jpg)
[51nod] 3047 displacement operation

Sorting out of key vulnerabilities identified by CMS in the peripheral management of red team (I)
随机推荐
Cp/rm/mv parameters
EIP maximum EIP EIP remote desktop access
Get to know MySQL database
Understanding Devops from the perspective of decision makers
What does elastic public IP mean? The advantages of elastic public IP
Can elastic public IP be bound to a home server? The difference between elastic public IP and fixed IP
[51nod] 3047 displacement operation
Why do cloud desktops use rack servers? Why choose cloud desktop?
Applicationclientprotocol of yarn source code
2022-2028 global indoor pressure monitor and environmental monitor industry research and trend analysis report
Tencent cloud CIF engineering efficiency summit ends perfectly
What technology does cloud computing elasticity scale? What are the advantages of elastic scaling in cloud computing?
2022-2028 global cancer biopsy instrument and kit industry research and trend analysis report
What are the advantages of EIP? What is the relationship between EIP and fixed IP?
Coding CD of Devops
Go program lifecycle
Tencent cloud CVM starts IPv6
Shopee Clickhouse cold and hot data separation storage architecture and Practice
RI Geng series: tricks of using function pointers
Actual combat | how to use micro build low code to realize tolerance application