1. what is the alias name of pentaho data integration (pdi...

14
1. What is the alias name of Pentaho Data Integration (PDI) project? a. ETL b. Kettle c. Pentaho Reporting d. BI Server 2. Pentaho Data Integration is a … tool. a. ETL b. Kettle c. Pentaho Reporting d. BI Server 3. Which application is not a part of PDI? a. Pan b. Carte c. Ant d. Spoon 4. What is the prerequisite application for PDI? a. .NET b. Java c. 7Zip d. Firefox

Upload: trinhkhuong

Post on 29-Aug-2019

255 views

Category:

Documents


0 download

TRANSCRIPT

1. What is the alias name of Pentaho Data Integration (PDI) project?

a. ETL

b. Kettle

c. Pentaho Reporting

d. BI Server

2.

Pentaho Data Integration is a … tool.

a. ETL

b. Kettle

c. Pentaho Reporting

d. BI Server

3. Which application is not a part of PDI?

a. Pan

b. Carte

c. Ant

d. Spoon

4. What is the prerequisite application for PDI?

a. .NET

b. Java

c. 7Zip

d. Firefox

5.

Which is not a job’s evaluation condition?

a. Follow when result is error

b. Unconditional

c. Follow when result is true

d. Follow when result is false

6. Which one is the wrong statement?

a. Job can executes a transformation

b. Transformation can execute a transformation

c. Kitchen executes a job

d. Pan executes a transformation

7. What is the required step for a job?

a. Display Msgbox Info

b. Dummy

c. Hop

d. Start

8. What is the file extension for PDI’s transformation?

a. kjb

b. xml

c. ktr

d. csv

9. What is the file extension for PDI’s job?

a. kjb

b. xml

c. ktr

d. csv

10. The following concept is not available for a job.

a. Hop

b. Step

c. Row

d. Variable

11.

How do you include a file as attachment in job’s Mail step?

a. By listing it directly in the step’s configuration

b. Attach General file option

c. Mail Validator step

d. Get File step

12. Which step not part of PDI’s transformation?

a. Microsoft Excel Input

b. Google Analytics

c. Start

d. LDAP Input

13. What databases can be accessed by PDI?

a. Oracle

b. SQL Server

c. MySQL

d. All of the above

14.

What option we use in Table Output’s step to produce or alter a table?

a. Truncate Table

b. Partition data over tables

c. SQL

d. All of the above

15.

If you want to transform column headers into row values, what step do you use?

a. Row Flattener

b. Split field to Rows

c. Row Normaliser

d. Row Denormaliser

16.

In Row Normaliser step, what output field do you specify to a group of header columns?

a. Type Field

b. Field

c. Type

d. New field

17. What does “Add Sequence” step do?

a. Copy or duplicate previous step

b. Adding a series of number

c. Sequence a flow of many steps

d. Cross Join rows

18. What does “Select Values” step not able to do?

a. Passing through a number of defined columns

b. Removing a number of defined columns

c. Changing meta data of columns

d. Replacing values in defined columns

19. If you want the email attachment, what option you need to specify in Microsoft Excel Input?

a. Accept Filenames from Previous Step

b. Full Filename field

c. Sheets

d. Add filenames to result

20. What step do you use to read data from an Oracle table?

a. Text File Input

b. OLAP Input

c. Table Input

d. All of the above

21. If I need to change a data format on existing data column, which step should I use?

a. Select Values

b. Modified Java Script Value

c. Filter Rows

e. Row Denormaliser

22.

What does Stream Lookup step do?

a. Looking for a related data and return one row or no row

b. Looking for a related data and return rows or no row

c. Inner Join of two data streams

d. Cross Join of two data streams

23. What file location or protocol does Microsoft Excel Input step does not support?

a. File system

b. HTTP Protocol

c. FTP Protocol

d. Hadoop File System (HDFS)

24. What step do you use for calculating numbers?

a. Calculator Step

b. Modified JavaScript Value

c. Formula

d. All of the above

25. What does Filter Rows do?

a. Passing only match rows

b. Passing match and unmatch rows

c. a and b

d. none of the above

26.

From the architecture shown above what capabilities, does PDI have?

a. Consume external data sources

b. Replacing scripting language

c. Construct staging database

d. All of the above

27.

What make staging database differs from data warehouse?

a. Clean data

b. Reference and helpertable

c. High quality data

d. Aggregate table

28. What makes data warehouse significantly different from its data sources?

a. Data modelling

b. Raw data

c. Data enrichment

d. Data aggregation

29. What is the benefit of staging database?

a. Breaking down process

b. Load balance the whole I/O

c. Helping data model reconstruction

d. All of the above

30.

fact_penjualan

FK3 sk_produk

FK1 sk_karyawan

FK2 sk_waktu

FK4 sk_cabang

jumlah_unit

nilai_penjualan

dim_karyawan

PK sk_karyawan

kode_cabang

nama_cabang

kode_karyawan

nama_karyawan

dim_waktu

PK sk_waktu

hari

kuartal

bulan

tahun

dim_cabang

PK sk_cabang

kode_propinsi

nama_propinsi

kode_cabang

nama_cabang

dim_produk

PK sk_produk

kode_kategori

nama_kategori

kode_produk

nama_produk

From the star schema shown above, what table does not come from the source system?

a. dim_produk

b. dim_cabang

c. dim_waktu

d. fact_penjualan

31. In What type of multi-dimensional modelling table do you put in measure columns?

e. Transaction Tables

f. Fact Tables

g. Dimension Tables

h. Helper Tables

32. What schemas do you use in Multidimensional Modelling?

a. 3rd normal form and Palindrome

b. Star and Snowflake

c. Palindrome

d. Star

33. What is the other alias for surrogate key?

a. Primary key

b. Data Warehouse key

c. Additional key

d. Technical key

34. Do we treat surrogate and business key as the same key in data warehouse?

a. Yes, they should be in the same column

b. No, they should be in different columns

c. a and b

d. a or b

35. What is the disadvantage of multi-dimensional modelling?

a. Slow Query

b. Complex Structure

c. Hard to maintain

d. Lack of clarity

36. What is the mechanism to generate surrogate key?

a. Random Number

b. Sequence Number

c. Auto Number

d. All of the above

37. What is the most representative concept of dimension table design?

a. Flat Table

b. Many Joins

c. Less Joins

d. Denormalization

38. Where is the best location to implement Change Data Capture mechanism?

a. Data source

b. ETL script to compare both sources

c. Destination

d. None of the above

39. What step to apply Slowly Changing Dimension (SCD) case?

a. Combination Lookup / Update

b. Dimension Lookup / Update

c. Insert / Update

d. Update

40.

What does Punch Through mean in the Dimension Lookup / Update step?

a. Creating new row

b. Updating last known row

c. Updating first known row

d. Updating all related rows

41. What is the characteristic of type 1 SCD?

a. Updating the row with the same business key

b. Inserting new row based on change, duplicating the same business key

c. All of the above

d. None of the above

42. What is the benefit of using type 1 SCD?

a. Speed up query

b. Accurately keep all historical information

c. Easy to maintain

d. All of the above

43. What is the benefit of using type 2 SCD?

a. Speed up query

b. Accurately keep all historical information

c. Easy to maintain

d. All of the above

44. Which is the correct variable expression in PDI?

a. $VAR

b. %VAR

c. $$VAR$$

d. ${VAR}

45.

Given the transformation stream like above, ${NAMA} variable is given a new value in “Set

Variables” step. Is it possible to read the new value of ${NAMA} in “Write to log” step?

a. Yes

b. No

c. Depending on the variable scope

d. All of the above

46.

Given the jobshown above, ${MYNAME} variable is set to a new value using “Set Variables” step.

Is it possible to read the new value of ${MYNAME} in “Display Msgbox Info” step?

a. Yes

b. No

c. Depending on the variable scope

d. All of the above

47.

Given the transformation shown above, what does the dashed red line hop mean?

a. Error handling of a row

b. Error handling of a batch of rows

c. Error handling of similar rows

d. All of the above

48. What file do you create when you decide to use Windows Task Scheduler to execute a

transformation?

a. .ktr

b. .sh

c. .exe

d. .bat

49. What command line option do you use to refer to a transformation or a job file?

a. --file

b. --log

c. -file

d. -log

50.

You want to schedule process through a job, which step do you use?

a. Transformation

b. Job

c. Start

d. Shell

e.