当前位置:网站首页>Instance tunnel use
Instance tunnel use
2022-06-27 06:23:00 【The south wind knows what I mean】
Instance Tunnel Use
1. To perform a select Inquire about :
[email protected] odps_test_project>select * from wc_in;
ID = 20170724071705393ge3csfb8
2. Use Instance Tunnel Download Command to download the execution result to a local file
[email protected] odps_test_project> tunnel download -h true -cf true instance://huorong_bigdata/20220329031048178gumx85t32 C:\Users\HR\Desktop\weekReport\ Weekday Wednesday .csv;\
2017-07-24 15:18:47 - new session: 2017072415184785b6516400090ca8 total lines: 8
2017-07-24 15:18:47 - file [0]: [0, 8), result
downloading 8 records into 1 file
2017-07-24 15:18:47 - file [0] start
2017-07-24 15:18:48 - file [0] OK. total: 44 bytes
download OK
// View results
cat result
slkdfj
hellp
apple
tea
peach
apple
tea
teaa
Parameter description
command :
tunnel download instance://<[project_name/]instance_id> <path>
Be careful
odps The following parameters are set by default
instance_tunnel_max_record=10000
The number of download lines is limited to 1w Within the line , Big watch, go odps_config.ini Put it down in the file , There will be no line limit for downloading
Parameter description
Required parameters
path
Download the path and name of the data file .
There are two options for the archive path of data files : You can archive files directly to MaxCompute Client's bin Directory , here path It needs to be set to the file name . Suffix name ; You can also archive files to other paths , for example D Discoid test Folder ,path I need to set to D:\test\ file name . Suffix name .
[project.]table[/partition]
Target table name . If it's a partition table , You need to specify to the last level partition . If it is not a table in the current space , You need to specify the space name where the table is located .
Optional parameters
-acp
If the specified partition does not exist , Automatically create the target partition . The default value is False.
-bs
Specify each upload to Tunnel Data block size . The default value is 100 MiB(1 MiB=1024×1024 Byte).
-c
Specify local data file encoding . Default not set , Download source data .
-cf
Specify whether to be CSV file , The default value is False.
explain Upload Support only TXT、CSV Format file , Default upload TXT file . If upload CSV file , You need to specify the -cf Parameters , Please also download the latest version of MaxCompute client .
-cp
Specify whether to upload data after local compression , Reduce network traffic . The default value is True.
-dbr
Specify whether to ignore dirty data ( Multiple columns 、 Less column 、 The column data types do not match ). The default value is False.
True: Ignore all data that does not conform to the table definition .
False: If you encounter dirty data , Then an error message will be given , The original data in the target table will not be contaminated .
-dfp
Appoint DATETIME Type data format , The default is yyyy-MM-dd HH:mm:ss. If you want to specify the time format to the millisecond level , have access to tunnel upload -dfp 'yyyy-MM-dd HH:mm:ss.SSS'.DATETIME Please refer to data type version description for details of data type .
-fd
Specify column delimiters for local data files . The default value of comma is English (,).
-h
Specifies whether the data file includes a header . The default value is False. If True, be dship The header will be skipped and the data will be uploaded from the second row .
-mbr
Specify the tolerable amount of dirty data . When this amount of data is exceeded , Termination of the upload . The default value is 1000 strip .
-ni
Appoint NULL Data identifier . The default value is an empty string .
-ow
Specifies whether the uploaded data overwrites the table or partition . The default value is False, The uploaded data is accumulated . Examples are as follows .
tunnel upload -overwrite true log.txt test_project.test_table/p1="b1",p2="b2";
-rd
Specify the row delimiter for the local data file . The default value is \r\n.
-s
Specify whether to scan local data files . The default value is True.
True: Scan the data first , If the data format is correct , Reimport data .
False: Do not scan data , Import data directly .
Only: Scan only local data , Do not continue importing data after the scan .
-sd
Appoint Session Catalog .
-ss
Specify strict schema patterns . The default value is True. If False, The extra data will be discarded , Insufficient fields will be filled in NULL.
-t
Appoint Threads The number of . The default value is 1.
-te
Appoint Tunnel Of Endpoint.
-time
Specify whether to track the upload time . The default value is False.
-tz
Designated time zone . The default value is the local time zone , for example Asia/Shanghai.
边栏推荐
- 程序猿学习抖音短视频制作
- JVM overall structure analysis
- 高斯分布Gaussian distribution、线性回归、逻辑回归logistics regression
- 下载cuda和cudnn
- 使用CSDN 开发云搭建导航网站
- How to check the frequency of memory and the number of memory slots in CPU-Z?
- 汇编语言-王爽 第9章 转移指令的原理-笔记
- 【QT小记】QT元对象系统简单认识
- Altium designer 19 device silk screen label position shall be placed uniformly in batches
- JVM调优思路
猜你喜欢

卷积神经网络---CNN模型的应用(找矿预测)

Multithreading basic part part 1

JVM object composition and storage

Yaml file encryption

Altium Designer 19 器件丝印标号位置批量统一摆放

快速实现Thread Mesh组网详解

多线程基础部分Part3

JVM class loading mechanism

Assembly language - Wang Shuang Chapter 13 int instruction - Notes

Assembly language - Wang Shuang Chapter 8 two basic problems in data processing - Notes
随机推荐
汇编语言-王爽 第9章 转移指令的原理-笔记
[cultivation system] common regular expressions
Kubesphere cluster configuration NFS storage solution - favorite
ORA-00909: 参数个数无效,concat引起
【QT小记】QT元对象系统简单认识
古典密码体制--代换和置换
IDEA一键生成Log日志
KubeSphere 集群配置 NFS 存储解决方案-收藏版
G1 and ZGC garbage collector
爬虫学习5---反反爬之识别图片验证码(ddddocr和pytesseract实测效果)
【QT小点】实现看门狗功能,检测外部程序是否在运行
JVM overall structure analysis
Yaml file encryption
块级元素&行内元素
427-二叉树(617.合并二叉树、700.二叉搜索树中的搜索、98. 验证二叉搜索树、530.二叉搜索树的最小绝对差)
技术人员创业一年心得
0.0.0.0:x的含义
Multithreading basic part2
创建一个基础WDM驱动,并使用MFC调用驱动
Keep 2 decimal places after multiplying SQLSEVER fields