当前位置:网站首页>RDD的执行原理
RDD的执行原理
2022-06-24 06:44:00 【斯沃福德】
从计算的角度来讲,数据处理过程中需要计算资源(内存 & CPU)和计算模型(逻辑)。执行时,需要将计算资源和计算模型进行协调和整合。
流程概括:
①准备资源
②创建Driver和Executor节点
②然后将应用程序的数据处理逻辑分解成一个一个的计算任务task。
③然后将任务task发到已经分配资源的计算节点executor上, 按照指定的计算模型进行数据计算。最后得到计算结果
1. 启动 Yarn 集群环境(准备资源)

2. Spark 通过申请资源创建调度节点Driver和计算节点Executor

Driver和Executor都是运行在NodeManager上面的 !
ResourceManager是用于管理的,所以真正运行任务的是NodeManager
3. Spark 框架根据需求将计算逻辑根据分区划分成不同的task任务

Driver用于在Executor节点之间调度task任务
多个RDD会组合形成关联,再分解为多个Task任务,并放到TaskPool任务池中(因为需要调度task任务)
4. 调度节点Driver将任务根据计算节点状态发送到对应的计算节点进行计算

调度节点Driver会将Task从任务池中取出,然后根据节点状态、首选位置来发送到不同的Executor进行计算
从以上流程可以看出 RDD 在整个流程中主要用于将逻辑进行封装,并生成 Task 发送给Executor 节点执行计算
边栏推荐
- Alibaba cloud full link data governance
- [机缘参悟-29]:鬼谷子-内揵篇-与上司交往的五种层次
- 第三方软件测试公司如何选择?2022国内软件测试机构排名
- Actual target shooting - skillfully use SMB to take down the off-line host
- Learning to use BACnet gateway of building control system is not so difficult
- Étalonnage de la caméra (objectif et principe d'étalonnage)
- [pointnet] matlab simulation of 3D point cloud target classification and recognition based on pointnet
- 向量操作与坐标转换相关方法
- Deploy L2TP in VPN (Part 1)
- The first common node of two linked lists_ The entry of the link in the linked list (Sword finger offer)
猜你喜欢

Session & cookie details
![选择器(>,~,+,[])](/img/7e/2becfcf7a7b2e743772deee5916caf.png)
选择器(>,~,+,[])

buuctf misc 从娃娃抓起

与(&&)逻辑或(||),动态绑定结合三目运算

Ultra wideband pulse positioning scheme, UWB precise positioning technology, wireless indoor positioning application

Étalonnage de la caméra (objectif et principe d'étalonnage)
![buuctf misc [UTCTF2020]docx](/img/e4/e160f704d6aa754e85056840e14bd2.png)
buuctf misc [UTCTF2020]docx

Description of module data serial number positioning area code positioning refers to GBK code
![buuctf misc [UTCTF2020]docx](/img/e4/e160f704d6aa754e85056840e14bd2.png)
buuctf misc [UTCTF2020]docx

How can win11 set the CPU performance to be fully turned on? How does win11cpu set high performance mode?
随机推荐
Global and Chinese market of offshore furnaces 2022-2028: Research Report on technology, participants, trends, market size and share
bjdctf_ 2020_ babystack
Ultra wideband pulse positioning scheme, UWB precise positioning technology, wireless indoor positioning application
[mrctf2020] thousand layer routine
Fine! Storage knowledge is a must for network engineers!
【008】表格数据逐行筛选,跳出for循环及跳过本次循环思路_#VBA
图形技术之管线概念
Global and Chinese market of anion sanitary napkins 2022-2028: Research Report on technology, participants, trends, market size and share
相机标定(标定目的、原理)
20 not to be missed ES6 tips
Win10 build webservice
向量操作与坐标转换相关方法
What is the mentality of spot gold worth learning from
Blue Bridge Cup seven segment code (dfs/ shape pressing + parallel search)
L2TP connection failure guide in VPN
[equalizer] bit error rate performance comparison simulation of LS equalizer, def equalizer and LMMSE equalizer
PCL point cloud random sampling by ratio
C escape character
[pointnet] matlab simulation of 3D point cloud target classification and recognition based on pointnet
[WUSTCTF2020]爬