当前位置:网站首页>SparkSQL设计及入门,220722,
SparkSQL设计及入门,220722,
2022-07-23 06:41:00 【啊六六六】

DataFrame:分布式数据表:数据 + 表结构




spark快的原因之一,都是基于磁盘,
spark快的原因之一,shuffle过程中,,
也就是说我们的sparkshuffle是对MR的shuffle进行了优化, 使得spark的shuffle只会实现我们指定的一些功能(分组排序分区中一部分), 所以比MR高效很多?
ETL数据清洗:不合法数据过滤掉,RDD【Str】转换RDD【Tuple】

70%

SparkSession中会包含SparkContext对象
User Define Funection
read:读取离线的数据
readStream:读取实时数据流的数据

RDD:foreach
DataFrame:show

select id, 'id'
id:列名
“id”:常量,值

DateSet特点:支持泛型,支持Schema

RDD【Row】
整形:int,
长整形:long ==bigint

review,,,
preview,,,

边栏推荐
- "Computing beast" Inspur nf5468a5 GPU server open trial free application
- QNX修改系统时间
- Point target simulation of SAR imaging (III) -- Analysis of simulation results
- Space shooting Part 2-3: dealing with the collision between bullets and the enemy
- Method of entering mathematical formula into mark down document
- Shooting games lesson 1-2: using sprites
- Beifu PLC and C transmit int array type variables through ads communication
- 第七天筆記
- ROS2自学笔记:Gazebo物理仿真平台
- 第十天笔记
猜你喜欢

JVM detailed parsing

射击游戏 第 1-2 课:使用精灵

Beifu PLC and C # regularly refresh IO through ads communication

Point target simulation of SAR imaging (III) -- Analysis of simulation results

网易白帽子黑客训练营笔记(2)

php连接sql server

使用fastjson解析以及赋予json数据时,json字段顺序不一致问题

Charles抓包工具测试实战
![[visual scheduling software] Shanghai daoning brings netronic downloads, trials and tutorials to SMB organizations](/img/2f/ea0d6ceefca84ef4aeef9c384861f9.png)
[visual scheduling software] Shanghai daoning brings netronic downloads, trials and tutorials to SMB organizations

【 Visual Dispatching Software】 Shanghai Dow Ning apporte netronic download, Trial, tutoriel pour l'Organisation SMB
随机推荐
射击游戏 第 1-2 课:使用精灵
Beifu PLC and C transmit string type through ads communication
GOM引擎版本为什么玩家会自动掉线或闪退?
【MUDUO】Poller抽象类
Ti single chip millimeter wave radar code walk (XXV) -- angular dimension (3D) processing flow
Machine learning, Wu Enda, logical regression
回溯法解决 八皇后问题
数据库-视图详探
0722~ thread pool extension
Space shooting part 1: player spirit and control
Feynman learning method (redis summary)
Beifu and C transmit real type through ads communication
[jzof] 08 next node of binary tree
Shooting lesson 1-01: Introduction
接口测试-简单的接口自动化测试Demo
DeFi 永不消亡?
关于this指针
【记录】golang跨平台编译
Changes in the pattern of NFT trading market: from one dominant company to a hundred schools of thought
Deep understanding of the underlying framework of wechat applet (I)