当前位置:网站首页>RDD的执行原理
RDD的执行原理
2022-06-24 06:44:00 【斯沃福德】
从计算的角度来讲,数据处理过程中需要计算资源(内存 & CPU)和计算模型(逻辑)。执行时,需要将计算资源和计算模型进行协调和整合。
流程概括:
①准备资源
②创建Driver和Executor节点
②然后将应用程序的数据处理逻辑分解成一个一个的计算任务task。
③然后将任务task发到已经分配资源的计算节点executor上, 按照指定的计算模型进行数据计算。最后得到计算结果
1. 启动 Yarn 集群环境(准备资源)

2. Spark 通过申请资源创建调度节点Driver和计算节点Executor

Driver和Executor都是运行在NodeManager上面的 !
ResourceManager是用于管理的,所以真正运行任务的是NodeManager
3. Spark 框架根据需求将计算逻辑根据分区划分成不同的task任务

Driver用于在Executor节点之间调度task任务
多个RDD会组合形成关联,再分解为多个Task任务,并放到TaskPool任务池中(因为需要调度task任务)
4. 调度节点Driver将任务根据计算节点状态发送到对应的计算节点进行计算

调度节点Driver会将Task从任务池中取出,然后根据节点状态、首选位置来发送到不同的Executor进行计算
从以上流程可以看出 RDD 在整个流程中主要用于将逻辑进行封装,并生成 Task 发送给Executor 节点执行计算
边栏推荐
- [learn FPGA programming from scratch -42]: Vision - technological evolution of chip design in the "post Moorish era" - 1 - current situation
- [MySQL usage Script] clone data tables, save query data to data tables, and create temporary tables
- (CVE-2020-11978)Airflow dag中的命令注入漏洞复现【vulhub靶场】
- [equalizer] bit error rate performance comparison simulation of LS equalizer, def equalizer and LMMSE equalizer
- What are the dazzling skills of spot gold?
- MySQL case: analysis of full-text indexing
- New ways to play web security [6] preventing repeated use of graphic verification codes
- 6000多万铲屎官,捧得出一个国产主粮的春天吗?
- How can win11 set the CPU performance to be fully turned on? How does win11cpu set high performance mode?
- 向量操作与坐标转换相关方法
猜你喜欢

How to connect the Bluetooth headset to the computer and how to connect the win10 computer to the Bluetooth headset

Camera calibration (calibration purpose and principle)

Prefix and topic training

Win10 build webservice

MaxCompute远程连接,上传、下载数据文件操作
![[equalizer] bit error rate performance comparison simulation of LS equalizer, def equalizer and LMMSE equalizer](/img/45/61258aa20cd287047c028f220b7f7a.png)
[equalizer] bit error rate performance comparison simulation of LS equalizer, def equalizer and LMMSE equalizer
![[MRCTF2020]千层套路](/img/8e/d7b6e7025b87ea0f43a6123760a113.png)
[MRCTF2020]千层套路

20 not to be missed ES6 tips
![[image fusion] image fusion based on directional discrete cosine transform and principal component analysis with matlab code](/img/21/a5a973f06ea002755a8a2a4431dcd8.png)
[image fusion] image fusion based on directional discrete cosine transform and principal component analysis with matlab code
![[image feature extraction] image feature extraction based on pulse coupled neural network (PCNN) including Matlab source code](/img/b3/26cfa385aa357c3a7a77e9db47e94c.png)
[image feature extraction] image feature extraction based on pulse coupled neural network (PCNN) including Matlab source code
随机推荐
How can genetic testing help patients fight disease?
Commandes de console communes UE
Shader 常用函数
Bjdctf 2020 Bar _ Babystack
The first common node of two linked lists_ The entry of the link in the linked list (Sword finger offer)
Alibaba cloud full link data governance
选择器(>,~,+,[])
RDD basic knowledge points
The latest crawler tutorial in 2021: video demonstration of web crawling
Deploy L2TP in VPN (Part 1)
The fund management of London gold is more important than others
[image fusion] image fusion based on NSST and PCNN with matlab code
Detailed explanation of C language compilation, link and operation
Software performance test analysis and tuning practice path - JMeter's performance pressure test analysis and tuning of RPC Services - manuscript excerpts
Global and Chinese market of anion sanitary napkins 2022-2028: Research Report on technology, participants, trends, market size and share
[image fusion] multi focus and multi spectral image fusion based on pixel saliency and wavelet transform with matlab code
[image fusion] image fusion based on directional discrete cosine transform and principal component analysis with matlab code
Face pincher: a hot meta universe stylist
【Vulhub靶场】】zabbix-SQL注入(CVE-2016-10134)漏洞复现
Win11 points how to divide disks? How to divide disks in win11 system?