当前位置:网站首页>Using restcloud ETL shell component to schedule dataX offline tasks
Using restcloud ETL shell component to schedule dataX offline tasks
2022-06-26 15:15:00 【RestCloud】

RestCloud ETL Community Edition is a data integration tool , Provide visual multi data pipeline construction 、 Data source management 、 Operation monitoring and authority management functions .
1. The scene that :
For some offline data synchronization tools that are already using Alibaba DataX Users of , Want to run regularly every day ( Or manual operation )job The task is very difficult , You need to enter the control interface and click the command line , Is it simpler 、 The quick way ? Yes ! Use RestCloud ETL Shell Component build process , Realize timing / Run the task manually .
2. Overall flow chart :

3. Implementation steps :
3.1 Create a process , Pull an execution from the script execution component category Shell Script components

3.2 To configure Shell The script content
Here, the remote server is called , Fill in the... Of the remote server IP、 The logged in user ID、 Password and what needs to be executed shell Script .
Shell The script can use variables here , Dynamically obtain the data in the process or http Request incoming data .
Shell The sample script code is as follows :
============
#!/bin/bash
#!/usr/bin/env python
source /etc/profile
python /usr/wgx/datax/bin/datax.py -p "-DlastSuccessEndTime='2022-05-12 18:37:27'" \
/usr/wgx/datax/job/mysql-job3.json
============
DataX Of Job The sample code is as follows :

3.3 Configure assertion execution logic
The running results will be stored in shellResult variable , Therefore, it can be obtained and judged in the flow line .
If you return 0 On behalf of success , Other returns are unsuccessful , If it is not successful, we can send a nailing message to alert the message .

Successful routing line configuration 
Failed routing line configuration
After the failure, it will be sent through the nailing message :


Nail configuration
such , The whole process has been configured .
3.4 Configure scheduled scheduling
We can return to the process interface , Modify process configuration .
Here we use Only the primary server can run that will do .
Operation mode : Here we choose the mode of scheduled scheduling to run
Scheduling strategy : Select the scheduling strategy according to your own needs
Task scheduling mode :
Task queue collection mode : Means to put the task in the queue , Run freely by the server
All cluster servers can run simultaneously : Indicates that all servers execute
Only the primary server can run : It is applicable when the performance of the primary server is higher than that of other servers , Or just one server Run time usage
It is uniformly dispatched by the dispatching server : It means that the dispatcher of the system uniformly schedules

4. Execution results
We can manually run and view the scheduling records

The specific process operation flow line can be viewed by clicking graphic control

We enter datax You can view the log files generated by the operation

When executed Shell When the failure , Then send a pinning message

Come here , Completed through RestCloud ETL Shell Component to realize remote scheduling datax Mission .

Thank you for your attention and support ! For more features , Please join RestCloud ETL Community free download experience , Download address :https://etl.restcloud.cn
边栏推荐
- One click analysis hardware /io/ national network performance script (strong push)
- SAP sales data actual shipment data export sales
- Document 1
- Seurat转h5ad总结
- Execution of commands in the cluster
- 10 minutes to understand bim+gis fusion, common BIM data formats and characteristics
- TCP/IP协议竟然有这么多漏洞?
- TCP 复位攻击原理
- vue中缓存页面 keepAlive使用
- R语言epiDisplay包的dotplot函数通过点图的形式可视化不同区间数据点的频率、使用by参数指定分组参数可视化不同分组的点图分布、使用cex.X.axis参数指定X轴轴刻度数值标签字体的大小
猜你喜欢

Smoothing data using convolution

About selenium common. exceptions. Webdriverexception: message: an unknown server side error solution (resolved)

【ceph】CephFS 内部实现(二):示例--未消化

TCP congestion control details | 1 summary

小程序:uniapp解决 vendor.js 体积过大的问题

BLE抓包调试信息分析

数据库-视图

Database - sequence
![[tcapulusdb knowledge base] Introduction to tcapulusdb general documents](/img/7b/8c4f1549054ee8c0184495d9e8e378.png)
[tcapulusdb knowledge base] Introduction to tcapulusdb general documents
![[CEPH] cephfs internal implementation (II): example -- undigested](/img/87/6eb214550faf1f0500565c1610ff3b.png)
[CEPH] cephfs internal implementation (II): example -- undigested
随机推荐
Pytorch深度学习代码技巧
Kubernetes的pod调度
重磅白皮书发布,华为持续引领未来智慧园区建设新模式
[CEPH] cephfs internal implementation (I): Concept -- undigested
Is it safe to open a stock account through the account opening link given by the broker manager? I want to open an account
Mongodb series window environment deployment configuration
redis集群的重新分片与ASK命令
English grammar_ Adjective / adverb Level 3 - original sentence pattern
R语言使用glm函数构建泊松对数线性回归模型处理三维列联表数据构建饱和模型、使用step函数基于AIC指标实现逐步回归筛选最佳模型、使用summary函数查看简单模型的汇总统计信息
【TcaplusDB知识库】TcaplusDB单据受理-创建业务介绍
Sikuli 基于图形识别的自动化测试技术
Halcon C# 设置窗体字体,自适应显示图片
5 figures illustrate the container network
Deployment of kubernetes' controller
【小程序实战系列】小程序框架 页面注册 生命周期 介绍
Talk about the recent situation of several students from Tsinghua University
Function: crypto JS encryption and decryption
R语言glm函数逻辑回归模型、使用epiDisplay包logistic.display函数获取模型汇总统计信息(自变量初始和调整后的优势比及置信区间,回归系数的Wald检验的p值)、结果保存到csv
shell脚本多进程并发写法实例(高阶修炼)
Optimizing for vectorization