Sqoop Command

Compress command:

To decrease the size of data after importing to HDFS, we can use the option –compress while executing the command.

sqoop import --connect jdbc:mysql://localhost:3306/retail_db --username root --password cloudera --table categories --hive-import --create-hive-table --hive-table vanshu.sqoop_retailer -m1 --compress;

 

direct command:

Sqoop can handle bulk transfers very well. We can speed up the transfers by using the –direct parameter. Sqoop can handle bulk transfers very well. You can speed up the transfers by using the –direct parameter.

sqoop import --connect jdbc:mysql://localhost:3306/retail_db --username root --password cloudera --table categories --direct --target-dir /user/cloudera/Retailer-sqoop--split-by category_id-m1;

 

 

Incremental Append in Sqoop:

1)Insert  new values in tables present in Mysql:

INSERT INTO categories VALUES(60,9,"Cricket12");

INSERT INTO categories VALUES(61,9,"Volley ball");

INSERT INTO categories VALUES(62,8,"Basket ball");

 

2) Sqoop import incremental append command:

sqoop import --connect jdbc:mysql://localhost:3306/retail_db --username root --password cloudera --table categories --target-dir /user/cloudera/sqoop-incre1 -m1--incremental append--check-column category_id--last-value 59;

 

 

Check-column: used as reference column

59: last value which is present in table before inserting new values.

Retrivesparticular columns.

 

sqoop import --connect jdbc:mysql://localhost:3306/retail_db --username root --password cloudera --table categories --columns "category_name, category_department_id" --split-by category_id --target-dir /user/cloudera/retailer-sqoop -m1

 

Incremental Last modified:

·        Works with records which are having time stamp.

·        We need to mention the last time stamp which is present other than which has been inserted newly.

1)Insert values into table orders.

insert into orders values(68884,"2018-05-30 00:00:04",5584,"ON_HOLD");

 

2)Syntax:

sqoop import --connect jdbc:mysql://localhost:3306/retail_db --username root --password cloudera --table orders --target-dir /user/cloudera/sqoop-inc123 -m1 --incremental lastmodified --check-column order_date --last-value "2014-07-23 00:00:00";

 

Sqoop-HBase Integration:

INTERNAL MYSQL

·       Create a table in Hbase:

·       Sqoop import command for importing data into hbase table:

sqoop import --connect jdbc:mysql://quickstart.cloudera:3306/gusto --username root --password cloudera--table visit123 --hbase-table visits --column-family v_details--hbase-row-key name -m 1;

Hbase-row-key name: row key  from table

v_details: column family from Hbase

visit123: table in mysql

·       Verify in Hbase :

scan ‘visits’;

 

EXTERNAL MYSQL:

·       Create a table in Hbase:

Create ‘data’,’data_details’

·       Sqoop import command for importing data into hbase table:

sqoop import --connect jdbc:mysql://192.168.56.1/vanshu --username root --password root --table data1--hbase-table data --column-familydata_details --hbase-row-key sno -m 1

 

·       Verify in Hbase :

Scan ‘data’

 


Comments

Popular posts from this blog

Hadoop

Problem Statement Of Real Estate Use Cases

Problem Statement Of Bank Marketing analysis