Sqoop简介及安装部署

Sqoop简介及安装部署

简介:

Apache Sqoop是专为Apache Hadoop和结构化数据存储如关系数据库之间的数据转换工具的有效工具。你可以使用Sqoop从外部结构化数据存储的数据导入到Hadoop分布式文件系统或相关系统如Hive和HBase。相反,Sqoop可以用来从Hadoop的数据提取和导出到外部结构化数据存储如关系数据库和企业数据仓库。
Sqoop专为大数据批量传输设计,能够分割数据集并创建Hadoop任务来处理每个区块。

下面介绍下安装部署的步骤:

1.下载安装包并解压

作者使用的是sqoop-1.4.6-cdh5.7.0.tar.gz版本

# wget下载安装包 也可下载到本地 然后上传至Linux [hadoop@hadoop000 software]$ pwd /home/hadoop/software [hadoop@hadoop000 software]$ wget http://archive.cloudera.com/cdh5/cdh/5/sqoop-1.4.6-cdh5.7.0.tar.gz # 解压 [hadoop@hadoop000 software]$ tar -xzvf sqoop-1.4.6-cdh5.7.0.tar.gz -C /home/hadoop/app/ # 查看解压目录 [hadoop@hadoop000 sqoop-1.4.6-cdh5.7.0]$ ls -lh total 1.9M drwxr-xr-x 2 hadoop hadoop 4.0K Jul 3 16:00 bin --执行脚本目录 -rw-rw-r-- 1 hadoop hadoop 60K Mar 24 2016 build.xml -rw-rw-r-- 1 hadoop hadoop 1.1K Mar 24 2016 cdh.build.properties -rw-rw-r-- 1 hadoop hadoop 35K Mar 24 2016 CHANGELOG.txt drwxr-xr-x 4 hadoop hadoop 4.0K Jul 3 16:00 cloudera -rw-rw-r-- 1 hadoop hadoop 6.8K Mar 24 2016 cloudera-pom.xml -rw-rw-r-- 1 hadoop hadoop 9.7K Mar 24 2016 COMPILING.txt drwxr-xr-x 2 hadoop hadoop 4.0K Jul 3 16:00 conf --配置文件目录 drwxr-xr-x 5 hadoop hadoop 4.0K Jul 3 16:00 docs --文档 drwxr-xr-x 2 hadoop hadoop 4.0K Jul 3 16:00 ivy -rw-rw-r-- 1 hadoop hadoop 17K Mar 24 2016 ivy.xml drwxr-xr-x 2 hadoop hadoop 4.0K Jul 3 16:00 lib --lib依赖包 -rw-rw-r-- 1 hadoop hadoop 15K Mar 24 2016 LICENSE.txt -rw-rw-r-- 1 hadoop hadoop 505 Mar 24 2016 NOTICE.txt -rw-rw-r-- 1 hadoop hadoop 19K Mar 24 2016 pom-old.xml -rw-rw-r-- 1 hadoop hadoop 1.1K Mar 24 2016 README.txt -rw-rw-r-- 1 hadoop hadoop 1012K Mar 24 2016 sqoop-1.4.6-cdh5.7.0.jar --完整jar包 -rw-rw-r-- 1 hadoop hadoop 6.5K Mar 24 2016 sqoop-patch-review.py -rw-rw-r-- 1 hadoop hadoop 641K Mar 24 2016 sqoop-test-1.4.6-cdh5.7.0.jar drwxr-xr-x 7 hadoop hadoop 4.0K Mar 24 2016 src --源码 drwxr-xr-x 4 hadoop hadoop 4.0K Jul 3 16:00 testdata
2.配置环境变量
# 添加sqoop环境变量 可加入全局 也可只配置个人环境变量 [hadoop@hadoop000 ~]$ sudo vi/etc/profile export SQOOP_HOME=/home/hadoop/app/sqoop-1.4.6-cdh5.7.0 export PATH=$SQOOP_HOME/bin:$PATH [hadoop@hadoop000 ~]$ source /etc/profile
3.修改sqoop配置文件
[hadoop@hadoop000 conf]$ pwd /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/conf [hadoop@hadoop000 conf]$ cp sqoop-env-template.sh sqoop-env.sh # 添加hadoop及hive目录 [hadoop@hadoop000 conf]$ vi sqoop-env.sh #Set path to where bin/hadoop is available export HADOOP_COMMON_HOME=/home/hadoop/app/hadoop-2.6.0-cdh5.7.0 #Set path to where hadoop-*-core.jar is available export HADOOP_MAPRED_HOME=/home/hadoop/app/hadoop-2.6.0-cdh5.7.0 #set the path to where bin/hbase is available #export HBASE_HOME= #Set the path to where bin/hive is available export HIVE_HOME=/home/hadoop/app/hive-1.1.0-cdh5.7.0 #Set the path for where zookeper config dir is #export ZOOCFGDIR= 
4.拷贝jdbc驱动包到sqoop/lib目录下
# 将Hive lib目录下的mysql驱动包拷贝过来 [hadoop@hadoop000 lib]$ pwd /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/lib [hadoop@hadoop000 lib]$ cp /home/hadoop/app/hive-1.1.0-cdh5.7.0/lib/mysql-connector-java-5.1.46.jar .
5.sqoop简单测试使用
# 查看命令帮助 [hadoop@hadoop000 ~]$ sqoop help Warning: /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/../hbase does not exist! HBase imports will fail. Please set $HBASE_HOME to the root of your HBase installation. Warning: /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/../hcatalog does not exist! HCatalog jobs will fail. Please set $HCAT_HOME to the root of your HCatalog installation. Warning: /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/../accumulo does not exist! Accumulo imports will fail. Please set $ACCUMULO_HOME to the root of your Accumulo installation. Warning: /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/../zookeeper does not exist! Accumulo imports will fail. Please set $ZOOKEEPER_HOME to the root of your Zookeeper installation. 18/07/03 16:23:05 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6-cdh5.7.0 usage: sqoop COMMAND [ARGS] Available commands: codegen Generate code to interact with database records create-hive-table Import a table definition into Hive eval Evaluate a SQL statement and display the results export Export an HDFS directory to a database table help List available commands import Import a table from a database to HDFS import-all-tables Import tables from a database to HDFS import-mainframe Import datasets from a mainframe server to HDFS job Work with saved jobs list-databases List available databases on a server list-tables List available tables in a database merge Merge results of incremental imports metastore Run a standalone Sqoop metastore version Display version information See 'sqoop help COMMAND' for information on a specific command. # 查看sqoop版本 [hadoop@hadoop000 ~]$ sqoop version Warning: /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/../hbase does not exist! HBase imports will fail. Please set $HBASE_HOME to the root of your HBase installation. Warning: /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/../hcatalog does not exist! HCatalog jobs will fail. Please set $HCAT_HOME to the root of your HCatalog installation. Warning: /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/../accumulo does not exist! Accumulo imports will fail. Please set $ACCUMULO_HOME to the root of your Accumulo installation. Warning: /home/hadoop/app/sqoop-1.4.6-cdh5.7.0/../zookeeper does not exist! Accumulo imports will fail. Please set $ZOOKEEPER_HOME to the root of your Zookeeper installation. 18/07/03 16:23:30 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6-cdh5.7.0 Sqoop 1.4.6-cdh5.7.0 git commit id Compiled by jenkins on Wed Mar 23 11:30:51 PDT 2016 # 这里的警告是因为我没有配置hbase,zookeeper,HCatalog

Read more

前端防范 XSS(跨站脚本攻击)

目录 一、防范措施 1.layui util  核心转义的特殊字符 示例 2.js-xss.js库 安装 1. Node.js 环境(npm/yarn) 2. 浏览器环境 核心 API 基础使用 1. 基础过滤(默认规则) 2. 自定义过滤规则 (1)允许特定标签 (2)允许特定属性 (3)自定义标签处理 (4)自定义属性处理 (5)转义特定字符 常见场景示例 1. 过滤用户输入的评论内容 2. 允许特定富文本标签(如富文本编辑器内容) 注意事项 更多配置 XSS(跨站脚本攻击)是一种常见的网络攻击手段,它允许攻击者将恶意脚本注入到其他用户的浏览器中。

详细教程:如何从前端查看调用接口、传参及返回结果(附带图片案例)

详细教程:如何从前端查看调用接口、传参及返回结果(附带图片案例)

目录 1. 打开浏览器开发者工具 2. 使用 Network 面板 3. 查看具体的API请求 a. Headers b. Payload c. Response d. Preview e. Timing 4. 实际操作步骤 5. 常见问题及解决方法 a. 无法看到API请求 b. 请求失败 c. 跨域问题(CORS) 作为一名后端工程师,理解前端如何调用接口、传递参数以及接收返回值是非常重要的。下面将详细介绍如何通过浏览器开发者工具(F12)查看和分析这些信息,并附带图片案例帮助你更好地理解。 1. 打开浏览器开发者工具 按下 F12 或右键点击页面选择“检查”可以打开浏览器的开发者工具。常用的浏览器如Chrome、Firefox等都内置了开发者工具。下面是我选择我的一篇文章,打开开发者工具进行演示。 2. 使用

Cursor+Codex隐藏技巧:用截图秒修前端Bug的保姆级教程(React/Chakra UI案例)

Cursor+Codex隐藏技巧:用截图秒修前端Bug的保姆级教程(React/Chakra UI案例) 前端开发中最令人头疼的莫过于那些难以定位的UI问题——元素错位、样式冲突、响应式失效...传统调试方式往往需要反复修改代码、刷新页面、检查元素。现在,通过Cursor编辑器集成的Codex功能,你可以直接用截图交互快速定位和修复这些问题。本文将带你从零开始,掌握这套革命性的调试工作流。 1. 环境准备与基础配置 在开始之前,确保你已经具备以下环境: * Cursor编辑器最新版(v2.5+) * Node.js 18.x及以上版本 * React 18项目(本文以Chakra UI 2.x为例) 首先在Cursor中安装Codex插件: 1. 点击左侧扩展图标 2. 搜索"Codex"并安装 3. 登录你的OpenAI账户(需要ChatGPT Plus订阅) 关键配置项: // 在项目根目录创建.