当前位置:网站首页>Data distribution optimization: how to deal with data skew?
Data distribution optimization: how to deal with data skew?
2022-07-20 21:06:00 【Yixiaojun】
In a slice cluster , The data will be distributed to different instances according to certain distribution rules . such as , In the use of Redis Cluster or Codis when , The data will first follow CRC The calculated value of the algorithm is right Slot( Logic slot ) modulus , meanwhile , be-all Slot It will be assigned to different instances by the operation and maintenance administrator . such , The data is saved to the corresponding instance .
Although this method is relatively simple to implement , But it can easily lead to a problem : Data skew .
There are two types of data skew .
- The amount of data is skewed : In some cases , The data on the instance is unevenly distributed , There is a lot of data on an instance .
- Data access : Although the amount of data on each cluster instance varies little , But the data on an instance is hot data , Very frequently visited .
If data skew occurs , So a lot of data is saved , Or the processing pressure of instances with hot data will increase , Slow down , It may even cause the instance to run out of memory resources , To collapse . This is what we should avoid when applying slicing clusters .
Today's lesson , I'll talk to you , How do these two kinds of data skew happen , How should we deal with .
Causes and Countermeasures of data skew
First , Let's look at the causes of data skew and Countermeasures .
When data skew occurs , The data is unevenly distributed on multiple instances of the slice cluster , A large amount of data is concentrated on one or several instances , As shown in the figure below :
that , How does data skew come about ? There are three main reasons , They are saved on an instance bi
边栏推荐
猜你喜欢
5. Complex graph network
风格迁移篇--CCPL:Contrastive Coherence Preserving Loss for Versatile Style Transfer多功能风格转换的对比连贯保持损失
Where is the scan function of mobile browser and what is its function
QT下载安装教程
最新《神经数据压缩导论》综述
How to install the blue bookmark plug-in in the secure browser?
3. Introduction to 3D point cloud foundation - pointnet
Install the SVN tool TortoiseSVN
场馆系统好用吗
【组合逻辑电路】——通用译码器
随机推荐
面试官:解释一下 ThreadLocal 核心原理
Kernel Pwn 入门 (4)
How to install the blue bookmark plug-in in the secure browser?
【组合逻辑电路】——显示译码器
SQL注入基础详解
数据分布优化:如何应对数据倾斜?
157 million: Shenzhen smart · smart traffic management service application platform project
接口、压力测试工具入门指南
Collection of functions commonly used in vs + QT interface design
QT下载安装教程
scrapy的入门使用
Wsl2 installation tutorial and modifying the default installation directory to other disks
在线会议中人脸面部轮廓图像提取(三)——Dlib库人脸面部轮廓图像特征提取
各位大佬,请问MySQL CDC 对于无主键表 怎么切片?
How to meet the security needs of various cloud environments in the era of digital economy?
3. Introduction to 3D point cloud foundation - pointnet
"Xiaodeng in operation and maintenance" searches log data as network security intelligence
【微信小程序】picker滚动选择器(85/100)
Two ways to add background image in vs+qt interface (very practical)
Is the venue system easy to use