› sites › default › files › file › ... · fury business intelligence• pdi integration...
TRANSCRIPT
FURY BUSINESS INTELLIGENCE
Presented by: Don Slaunwhite
Copyright © 2017 Canadian Internet Registration Authority (“CIRA”). All rights reserved. This material is proprietary to CIRA, and may not be
reproduced in whole or in part, in either electronic or printed formats, without the prior written authorization of CIRA.
OVERVIEW
• Architecture / Technology
• ETL Design
• Data Warehouse Design
• Reporting and Analytics
• Testing and Change Management Processes
2
PENTAHO OPEN SOURCE
3
FURY BI ARCHITECTURE
4
ETL
5
PENTAHO DATA INTEGRATION
• Spoon – IDE for creating Transformations (row level data manipulation steps )
6
ICANN Report Extract
PENTAHO DATA INTEGRATION
• Spoon – IDE for creating Jobs ( ETL workflow)
7
PENTAHO DATA INTEGRATION
• Spoon – sample transformation
8
PENTAHO DATA INTEGRATION
• Detailed execution logging with Audit schema
9
DATA WAREHOUSE
10
STAR SCHEMA – DIMENSIONAL MODEL
11
• Foundation for Analytic Queries through Analytic BI Tools (Saiku / Report Designer)
Registry Dimension (Multi-Tenancy)
RELATIONAL MODEL
12
• Used for detail querying though BI Tools
REPORTING AND ANALYTICS
13
REPORTING AND ANALYTICS
• Pentaho BA Server - Open Source Community Edition
14
REPORTING AND ANALYTICS
• Report Designer- Desktop Report Creation Tool
15
REPORTING AND ANALYTICS
• Report Designer- Desktop Reporting Tool
16
Drill Through
REPORTING AND ANALYTICS
• Plugins – Saiku Analytics
17
REPORTING AND ANALYTICS
• Community Dashboard Framework / Editor
18
REPORTING AND ANALYTICS
• Embedded Dashboard components
19
TESTING AND CHANGE MANAGEMENT
20
• Git used for source control and deployment
• QA Testing architecture leveraged for ETL Regression
6. Regress
7. Release
TAKEAWAYS FROM OPEN SOURCE BI PROJECT
21
• PDI Community Edition is a robust ETL/Data integration tool which provides a great amount of flexibility when dealing with a variety of traditional data sources as well as “Big Data” data sources.
• PDI integration with Pentaho Reporting provides the ability to easily burst complex reports to a wide audience via file management and emailing capabilities.
• PDI IDE provides for rapid ETL development and testing environment
• Deep ETL logging framework provides detailed insights into ETL performance and runtime metrics.
• PDI CE lacks scheduling infrastructure (available in Enterprise Edition) therefore requiring separate scheduling software and processes.
TAKEAWAYS FROM OPEN SOURCE BI PROJECT
22
• Pentaho Business Analytics Platform provides tools and processes to deliver advanced reporting and analytics capabilities to end users.
• Development tools are extremely flexible and configurable, but are not as easy and intuitive to use as established enterprise BI vendor software.
• For advanced reporting capabilities and dashboard creation, javascript coding skills are required.
• Documentation related to advanced reporting functionality is not readily available.
• Active community of developers provides flexibility through plugins, but plugin quality can vary greatly.
• Attractive, dynamic dashboards can be created with open source framework. Complex interactive dashboards can be integrated into other applications, but this requires more advanced coding skills.
CIRA IM TEAM
23
Jon June Shanshan Namita
QUESTIONS?
If you would like further details, contact: [email protected]
APPENDIX
25
26