1. what is the alias name of pentaho data integration (pdi...
TRANSCRIPT
1. What is the alias name of Pentaho Data Integration (PDI) project?
a. ETL
b. Kettle
c. Pentaho Reporting
d. BI Server
2.
Pentaho Data Integration is a … tool.
a. ETL
b. Kettle
c. Pentaho Reporting
d. BI Server
3. Which application is not a part of PDI?
a. Pan
b. Carte
c. Ant
d. Spoon
4. What is the prerequisite application for PDI?
a. .NET
b. Java
c. 7Zip
d. Firefox
5.
Which is not a job’s evaluation condition?
a. Follow when result is error
b. Unconditional
c. Follow when result is true
d. Follow when result is false
6. Which one is the wrong statement?
a. Job can executes a transformation
b. Transformation can execute a transformation
c. Kitchen executes a job
d. Pan executes a transformation
7. What is the required step for a job?
a. Display Msgbox Info
b. Dummy
c. Hop
d. Start
8. What is the file extension for PDI’s transformation?
a. kjb
b. xml
c. ktr
d. csv
9. What is the file extension for PDI’s job?
a. kjb
b. xml
c. ktr
d. csv
10. The following concept is not available for a job.
a. Hop
b. Step
c. Row
d. Variable
11.
How do you include a file as attachment in job’s Mail step?
a. By listing it directly in the step’s configuration
b. Attach General file option
c. Mail Validator step
d. Get File step
12. Which step not part of PDI’s transformation?
a. Microsoft Excel Input
b. Google Analytics
c. Start
d. LDAP Input
13. What databases can be accessed by PDI?
a. Oracle
b. SQL Server
c. MySQL
d. All of the above
14.
What option we use in Table Output’s step to produce or alter a table?
a. Truncate Table
b. Partition data over tables
c. SQL
d. All of the above
15.
If you want to transform column headers into row values, what step do you use?
a. Row Flattener
b. Split field to Rows
c. Row Normaliser
d. Row Denormaliser
16.
In Row Normaliser step, what output field do you specify to a group of header columns?
a. Type Field
b. Field
c. Type
d. New field
17. What does “Add Sequence” step do?
a. Copy or duplicate previous step
b. Adding a series of number
c. Sequence a flow of many steps
d. Cross Join rows
18. What does “Select Values” step not able to do?
a. Passing through a number of defined columns
b. Removing a number of defined columns
c. Changing meta data of columns
d. Replacing values in defined columns
19. If you want the email attachment, what option you need to specify in Microsoft Excel Input?
a. Accept Filenames from Previous Step
b. Full Filename field
c. Sheets
d. Add filenames to result
20. What step do you use to read data from an Oracle table?
a. Text File Input
b. OLAP Input
c. Table Input
d. All of the above
21. If I need to change a data format on existing data column, which step should I use?
a. Select Values
b. Modified Java Script Value
c. Filter Rows
e. Row Denormaliser
22.
What does Stream Lookup step do?
a. Looking for a related data and return one row or no row
b. Looking for a related data and return rows or no row
c. Inner Join of two data streams
d. Cross Join of two data streams
23. What file location or protocol does Microsoft Excel Input step does not support?
a. File system
b. HTTP Protocol
c. FTP Protocol
d. Hadoop File System (HDFS)
24. What step do you use for calculating numbers?
a. Calculator Step
b. Modified JavaScript Value
c. Formula
d. All of the above
25. What does Filter Rows do?
a. Passing only match rows
b. Passing match and unmatch rows
c. a and b
d. none of the above
26.
From the architecture shown above what capabilities, does PDI have?
a. Consume external data sources
b. Replacing scripting language
c. Construct staging database
d. All of the above
27.
What make staging database differs from data warehouse?
a. Clean data
b. Reference and helpertable
c. High quality data
d. Aggregate table
28. What makes data warehouse significantly different from its data sources?
a. Data modelling
b. Raw data
c. Data enrichment
d. Data aggregation
29. What is the benefit of staging database?
a. Breaking down process
b. Load balance the whole I/O
c. Helping data model reconstruction
d. All of the above
30.
fact_penjualan
FK3 sk_produk
FK1 sk_karyawan
FK2 sk_waktu
FK4 sk_cabang
jumlah_unit
nilai_penjualan
dim_karyawan
PK sk_karyawan
kode_cabang
nama_cabang
kode_karyawan
nama_karyawan
dim_waktu
PK sk_waktu
hari
kuartal
bulan
tahun
dim_cabang
PK sk_cabang
kode_propinsi
nama_propinsi
kode_cabang
nama_cabang
dim_produk
PK sk_produk
kode_kategori
nama_kategori
kode_produk
nama_produk
From the star schema shown above, what table does not come from the source system?
a. dim_produk
b. dim_cabang
c. dim_waktu
d. fact_penjualan
31. In What type of multi-dimensional modelling table do you put in measure columns?
e. Transaction Tables
f. Fact Tables
g. Dimension Tables
h. Helper Tables
32. What schemas do you use in Multidimensional Modelling?
a. 3rd normal form and Palindrome
b. Star and Snowflake
c. Palindrome
d. Star
33. What is the other alias for surrogate key?
a. Primary key
b. Data Warehouse key
c. Additional key
d. Technical key
34. Do we treat surrogate and business key as the same key in data warehouse?
a. Yes, they should be in the same column
b. No, they should be in different columns
c. a and b
d. a or b
35. What is the disadvantage of multi-dimensional modelling?
a. Slow Query
b. Complex Structure
c. Hard to maintain
d. Lack of clarity
36. What is the mechanism to generate surrogate key?
a. Random Number
b. Sequence Number
c. Auto Number
d. All of the above
37. What is the most representative concept of dimension table design?
a. Flat Table
b. Many Joins
c. Less Joins
d. Denormalization
38. Where is the best location to implement Change Data Capture mechanism?
a. Data source
b. ETL script to compare both sources
c. Destination
d. None of the above
39. What step to apply Slowly Changing Dimension (SCD) case?
a. Combination Lookup / Update
b. Dimension Lookup / Update
c. Insert / Update
d. Update
40.
What does Punch Through mean in the Dimension Lookup / Update step?
a. Creating new row
b. Updating last known row
c. Updating first known row
d. Updating all related rows
41. What is the characteristic of type 1 SCD?
a. Updating the row with the same business key
b. Inserting new row based on change, duplicating the same business key
c. All of the above
d. None of the above
42. What is the benefit of using type 1 SCD?
a. Speed up query
b. Accurately keep all historical information
c. Easy to maintain
d. All of the above
43. What is the benefit of using type 2 SCD?
a. Speed up query
b. Accurately keep all historical information
c. Easy to maintain
d. All of the above
44. Which is the correct variable expression in PDI?
a. $VAR
b. %VAR
c. $$VAR$$
d. ${VAR}
45.
Given the transformation stream like above, ${NAMA} variable is given a new value in “Set
Variables” step. Is it possible to read the new value of ${NAMA} in “Write to log” step?
a. Yes
b. No
c. Depending on the variable scope
d. All of the above
46.
Given the jobshown above, ${MYNAME} variable is set to a new value using “Set Variables” step.
Is it possible to read the new value of ${MYNAME} in “Display Msgbox Info” step?
a. Yes
b. No
c. Depending on the variable scope
d. All of the above
47.
Given the transformation shown above, what does the dashed red line hop mean?
a. Error handling of a row
b. Error handling of a batch of rows
c. Error handling of similar rows
d. All of the above
48. What file do you create when you decide to use Windows Task Scheduler to execute a
transformation?
a. .ktr
b. .sh
c. .exe
d. .bat
49. What command line option do you use to refer to a transformation or a job file?
a. --file
b. --log
c. -file
d. -log
50.
You want to schedule process through a job, which step do you use?
a. Transformation
b. Job
c. Start
d. Shell
e.