talend open studio introduction - osscamp 2014

21
Talend Open Studio Sat Apr 19, 2014 Brij Bhushan Sharma Sr. Software Engineer

Post on 21-Oct-2014

435 views

Category:

Technology


7 download

DESCRIPTION

Talend Open Studio is the most open, innovative and powerful data integration solution on the market today. Talend Open Studio for Data Integration allows you to create ETL (extract, transform, load) jobs.

TRANSCRIPT

Page 1: Talend Open Studio Introduction - OSSCamp 2014

Talend Open Studio

Sat Apr 19, 2014Brij Bhushan SharmaSr. Software Engineer

Page 2: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

What is Talend Open Studio?

Page 3: Talend Open Studio Introduction - OSSCamp 2014

2

What is Talend Open Studio

● Talend Open Studio is the most open, innovative and powerful data integration solution on the market today.

● Talend Open Studio for Data Integration allows you to create ETL (extract, transform, load) jobs.

● A graphical integrated development environment with an intuitive Eclipse-based interface.

● Draw procedures linking components, each component performs an operation.

● Produces fully editable Java (or Perl) code

Page 4: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

Main features and benefits of that solution:

● Business modeling● Graphical development● Drag-and-drop job design● Metadata-driven design and execution● Real-time debugging● Robust execution● A unified repository for storing and reusing metadata

Page 5: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

ETL is a common process in Data Integration

Extract: reading data from different datasources (database, flat files, spreadsheet files, web services, etc).

Transfom: converting data in a form so that it can be placed in another container (database, web services, files etc). Cleaning, computations and verifications are also performed.

Load: write the data in the target format.

Page 6: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

Important concepts in Talend Data

Integration Studio

Page 7: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

What is a repository?

A repository is the storage location Talend Data Integration Studio uses to

gather data related to all of the technical items that you use either to describe business models or to design Jobs.

Page 8: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

What is a project?

Projects are structured collections of technical items and their associated

metadata.

All of the Jobs and business models you design are organized in Projects.

Page 9: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

What is a workspace? A workspace is the directory where you

store all your project folders. You need to have one workspace

directory per connection (repository connection). Talend enables you to

connect to different workspace directories, if you do not want to use the

default one.

Page 10: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

What is a component?A component is a preconfigured connector used to perform a specific data integration

operation, no matter what data sources you are integrating: databases, applications, flat

files, Web services, etc.

A component can minimize the amount of hand-coding required to work on data from

multiple, heterogeneous sources.

Page 11: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

What is an item? An item is the fundamental technical unit in a project. Items are grouped, according to their

types,as: Job Design, Business model, Context, Code, Metadata, etc. One item can

include other items. For example, the business models and the Jobs you design are items, metadata and routines you use

inside your Jobs are items as well.

Page 12: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio-User Interface

Page 13: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

What is a repository?

A repository is the storage location Talend Data Integration Studio uses to

gather datarelated to all of the technical items that

you use either to describe business models or to design Jobs.

Page 14: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

Small Demo on Talend Job

Page 15: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

Source Data

Page 16: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

Lookup Table

Page 17: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

Target Table

Page 18: Talend Open Studio Introduction - OSSCamp 2014

2

Talend Open Studio

Have a look on the main job

Page 19: Talend Open Studio Introduction - OSSCamp 2014

2

T-Map: Mapping Component

Page 20: Talend Open Studio Introduction - OSSCamp 2014

2

Java Function Used

Page 21: Talend Open Studio Introduction - OSSCamp 2014

Talend Open Studio

Thank You!

Have a Question:Catch me on: [email protected]