当前位置:网站首页>Data warehouse: Exploration and practice of integrating flow and batch
Data warehouse: Exploration and practice of integrating flow and batch
2022-07-23 06:45:00 【Freedom3568】
List of articles
brief introduction
Mention the integration of flow and batch , I have to mention the traditional big data platform —— Lambda framework . It can effectively support offline and real-time data development requirements , However, the high development and maintenance costs and inconsistent data caliber caused by the separation of the stream and batch data links cannot be ignored .
It is ideal to meet the data processing requirements of both streams and batches through a set of data links , That is, flow batch integration . In addition, we believe that there are still some intermediate stages in the integration of flow and batch , For example, it is of great significance to only realize the unification of computing or storage .
Take the example of computing unification only , Some data applications require high real-time performance , For example, we hope that the end-to-end data processing delay will not exceed one second , This is for the current open source 、 It is a great challenge for the unified storage of stream and batch . Take the data Lake as an example , Its data visibility is similar to commit Interval dependent , And then Flink do checkpoint Is related to the time interval , This feature is combined with the length of the data processing link , It can be seen that it is not easy to do end-to-end processing for one second . So for this kind of demand , It is also feasible to achieve only computational unification . Reduce the development and maintenance costs of users through unified calculation , Solve the problem of inconsistent data caliber .

In the process of the integration of flow and batch Technology , The challenges we face can
边栏推荐
- R语言绘制哑铃图
- Thread类中run和start的区别
- Jmeter-记一次自动化造数引发的BeanShell写入excel实例
- 英飞凌推出全球首款采用后量子加密技术进行固件更新的TPM安全芯片
- SFM与MVS区别
- 【机器学习基础】无监督学习(5)——生成模型
- selenium报错解决
- Zhang Li, President of China Electronic Information Industry Development Research Institute: building China's leading open source value chain
- China's open source is moving towards the second tier!
- 1. Summary of strengthening learning foundation
猜你喜欢
![[machine learning] principle and practice of model selection (performance measurement)](/img/1c/ef18230452c613aad67fcafc04e87d.png)
[machine learning] principle and practice of model selection (performance measurement)

Redis cluster setup

How to configure a cute little shark theme for typera?

ICML2022 | ROCK: 关于常识因果关系的因果推理原则

【机器学习】模型选择(性能度量)原理及实战

中年危机提前来临的一代人,还能够从容生活吗?

Thread类中run和start的区别
![Rllib学习 - [4] - AlgorithmConfig详细介绍 [Pytorch版]](/img/1e/95078ad64a17686463547e8ba87cb1.png)
Rllib学习 - [4] - AlgorithmConfig详细介绍 [Pytorch版]

图文并茂演示小程序movable-view的可移动范围

Globally and Locally Consistent Image Completion 论文笔记
随机推荐
数组-稀疏矩阵的转置
OWA邮件系统登录双因子认证(短信认证)方案
2022华为开发者大赛中国区开幕式重磅启动!
Properties of node process object
Simple face detection using mediapipe and opencv
Drawing lollipop chart with R language
XML架构
使用mediapipe和OpenCV实现摄像头实时人脸检测
R语言求导数
Difference between SFM and MVs
[机器学习] -[传统分类问题] - 朴素贝叶斯分类 + 逻辑回归分类
Dao智能合约Dapp系统开发技术
Urllib download (urlretrieve())
暑假笔记1
lasso回归结果美化
如何打包你的项目并且可以让别的用户通过pip安装
剑指offer专项突击版第7天
[JS] ES6 arrow function
【机器学习基础】无监督学习(5)——生成模型
JDBC MySQL基本操作