Data Integration (DI) Basics

Talend Open Studio for Data Integration dramatically improves the efficiency of data integration job design through an easy-to-use graphical development environment. It enables rapid deployment and reduces maintenance costs with prebuilt connectors to all source and target systems, with support for all types of data integration, data migration, and data synchronization operations.

This course enables you to use Talend Open Studio for Data Integration for real work as quickly as possible. It focuses on the basic functionality of the Studio and how it can be used to build reliable, maintainable data integration tasks that solve practical problems: extracting data from common database and file formats, transforming it, and integrating it into targets. The skills learned in this course are applicable to the entire Talend product line. This course serves as the basis for all Talend training and is the prerequisite for many of the other courses.

Duration 2 days
Target Audience Anyone who wants to use Talend Studio to perform data integration and management tasks. Examples include Project Managers, Software Developers, Business Intelligence Experts, and Database Administrators.
Prerequisites Basic computer skills. Familiarity with Java or another programming language, as well as SQL and general database concepts is helpful.
Course Objectives
After completing this class, you will be able to:
  • Start Talend Open Studio for Data Integration
  • Link Talend Studio to your Talend account, registering a new account if necessary
  • Create a Talend project to contain tasks
  • Create a Talend Job to perform a specific task
  • Add and configure components to handle data input, data transformation, and data output
  • Run a Talend Job and examine the results
  • Build a visual model of a Talend Job or project
  • Copy an existing Job as the basis for a new Job
  • Store configuration information centrally for use in multiple components
  • Extend data from one source with data extracted from a second source
  • Log data rows in the console rather than storing them
  • Troubleshoot a join by examining failed lookups
  • Use components to filter data
  • Generate sample data rows
  • Execute Job sections conditionally
  • Duplicate output flows
  • Create a schema for use in multiple components
  • Create variables for component configuration parameters
  • Run a Job to access specific values for the variables
  • Employ mechanisms to kill a Job under specific circumstances
  • Include Job elements that change the behavior based on the success or failure of individual components or subjobs
  • Connect to a database from a Talend Job
  • Use a component to create a database table
  • Write to and read from a database table from a Talend Job
  • Filter unique data rows
  • Perform aggregate calculations on rows
  • Write data to an XML file from a Talend Job
  • Use components to create an archive and delete files
  • Use a Talend component to access a Web Service
  • Extract specific elements from a Web Service reply
  • Store Web Service access information for use in multiple Jobs
  • Write an XML document to a file
  • Add comments to document a Job and its components
  • Generate HTML documentation for a Job
  • Export a Job
  • Run an exported Job independently of Talend Open Studio
  • Create a new version of an existing Job
Course Agenda Day 1:
  • Start Talend Open Studio
  • Create a Job
  • Read Input
  • Transform Data
  • Run a Job
  • Combine Columns
  • Build a Business Model
  • Duplicate a Job
  • Create Metadata
  • Create a Join
  • Capture Join Failures
  • Correct the Lookup
  • Update the Business Model
  • Generate Rows
  • Create the Join
  • Add Constraints
  • Replicate Output
  • Use Context Variables
  • Use Repository Context Variables
  • Handle Errors

Day 2:

  • Work with Databases
  • Create a New Project
  • Create Customer Metadata
  • Create the Customer Table
  • Create the Product Table
  • Set up Sales Files
  • Set Up the Shop Stag Table
  • Join the Data
  • Finalize the Job
  • Work with Web Services
  • Access a Web Service
  • Use Advanced WSDL Process
  • Document the Job
  • Run Jobs Standalone
  • Export a Job