Apache Sqoop Cookbook

Computers / Databases / General, Ebook

Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and time-consuming task. This handy cookbook provides dozens of ready-to-use recipes for using Apache Sqoop, the command-line interface application that optimizes data transfers between relational databases and Hadoop.

Sqoop is both powerful and bewildering, but with this cookbook’s problem-solution-discussion format, you’ll quickly learn how to deploy and then apply Sqoop in your environment. The authors provide MySQL, Oracle, and PostgreSQL database examples on GitHub that you can easily adapt for SQL Server, Netezza, Teradata, or other relational systems.

  • Transfer data from a single database table into your Hadoop ecosystem
  • Keep table data and Hadoop in sync by importing data incrementally
  • Import data from more than one database table
  • Customize transferred data by calling various database functions
  • Export generated, processed, or backed-up data from Hadoop to your database
  • Run Sqoop within Oozie, Hadoop’s specialized workflow scheduler
  • Load data into Hadoop’s data warehouse (Hive) or database (HBase)
  • Handle installation, connection, and syntax issues common to specific database vendors
Download Now Read Online

Apache Sqoop Cookbook


Download Now Read Online

Author by : Kathleen Ting
Languange Used : en
Release Date : 2013-07-02
Publisher by : "O'Reilly Media, Inc."

Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and ti

Apache Sqoop Cookbook


Download Now Read Online

Author by : Kathleen Ting
Languange Used : en
Release Date : 2013-07-02
Publisher by : "O'Reilly Media, Inc."

Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and ti

Apache Sqoop Cookbook


Download Now Read Online

Author by : Kathleen Ting
Languange Used : en
Release Date : 2013
Publisher by : Oreilly & Associates Incorporated

Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and ti

Hadoop Real World Solutions Cookbook Second Edition


Download Now Read Online

Author by : Tanmay Deshpande
Languange Used : en
Release Date : 2016-03-29
Publisher by : Packt Publishing

Over 90 hands-on recipes to help you learn and master the intricacies of Apache Hadoop 2.X, YARN, Hive, Pig, O

Apache Hive Cookbook


Download Now Read Online

Author by : Hanish Bansal
Languange Used : en
Release Date : 2016-04-29
Publisher by : Packt Publishing Ltd

Easy, hands-on recipes to help you understand Hive and its integration with frameworks that are used widely in

Instant Apache Sqoop


Download Now Read Online

Author by : Ankit Jain
Languange Used : en
Release Date : 2013-01-01
Publisher by : Packt Publishing Ltd

Filled with practical, step-by-step instructions and clear explanations for the most important and useful task

Programming Pig


Download Now Read Online

Author by : Alan Gates
Languange Used : en
Release Date : 2016-11-09
Publisher by : "O'Reilly Media, Inc."

For many organizations, Hadoop is the first step for dealing with massive amounts of data. The next step? Proc

Leave a Reply