S T A T I S T I C S A U S T R I AMarch 2007 1www.statistik.at
SuperSTARA joint development with STR
D.Burget
October 2007© STATISTICS AUSTRIA
I n f o r m a t i o n
We move
S T A T I S T I C S A U S T R I AMarch 2007 2
• Integrated statistical information system (ISIS)– History– Overview
• SuperSTAR– Why– Overview– Joint Development to extend functionality– Current Projects– Milestones
Agenda
S T A T I S T I C S A U S T R I AMarch 2007 3
• Online since 1973– Mainframe terminal solution
• Graphical user interface since 2000– Java Applet– Web http://www.statistik.at/isis/
• Statistical detail data for the public– Registered users and guest users– Payment solution for registered users– Free access to a limited amount of data and detail levels
ISIS
S T A T I S T I C S A U S T R I AMarch 2007 4
• Content– ~6000 databases with up to 7 dimensions– ~2500 different classifications
• Main functionality– Classification maintenance– Production process– Multilingual User Interface and Metadata– Search by catalog– Get comment– Search by keywords– Query the database– Store and run queries– Screen output– File output
ISIS
S T A T I S T I C S A U S T R I AMarch 2007 5
• Current limitations in ISIS– Maintenance problem– Data volume– Limit in number of dimensions and measures per database– Numerical range
• What SuperSTAR will offer– Solutions to the above problems– Microdata analysis– Enhancement of functionalities (Joint Development)– Benefits of Standard Software (support, maintenance, ….)
Why SuperSTAR
S T A T I S T I C S A U S T R I AMarch 2007 6
• SuperSTAR– Company Space-Time Research, Australia– Products
• SuperCHANNEL
– Extract Transform Load (ETL)
• SuperSERVER
– Cube server
• SuperADMIN
– Administration server & console
• SuperCROSS
– Windows client
• SuperWEB
– Browser client (uses JavaScript)
What is SuperSTAR
S T A T I S T I C S A U S T R I AMarch 2007 7
SuperSTAR Components
Direct connection – metadata / data
SuperCHANNEL
SuperSTARdatabase
SuperCROSS
SuperSERVER + SuperADMIN
SuperWEB
RDBMS
Legacy
Spread-sheet
Text
Server Components Client Components
SuperTABLE
© Space Time Research
DataSources
SuperADMIN console
S T A T I S T I C S A U S T R I AMarch 2007 8
DB2
Flat Files
SuperSTAR
CMSSTELLENT
DB2
SuperSTAR
Position of SuperSTAR in the production process
Statistical Production Process :
S T A T I S T I C S A U S T R I AMarch 2007 9
• Classification maintenance– Maintain classifications for SuperSTAR
• Production process– Automated production process for SuperSTAR
• Import of ISIS data– Import ISIS base files
• API for disclosure control– Implement disclosure methods
• Search over all cubes– Full text search over all databases
SuperSTAR new developments - I
S T A T I S T I C S A U S T R I AMarch 2007 10
• File output API – Output of tables to XML (STF)
• Payment solution API– Payment based on table cells
• Thematic tree with multilingual texts– Multilingual catalogue
• Handling index tables– Handle fields which should not be aggregated
• Handling time series– Time has to be mandatory field
SuperSTAR new developments - II
S T A T I S T I C S A U S T R I AMarch 2007 11
• German online help – German help & tutorials (Online and PDF)
• Portal integration– Integration with Austrian government portal (single sign on)
• Configurable data access – Limit access to data for guests (not registered users)
• Porting to zLinux– Porting of Server and Web access to the strategic system platform
(IBM z-Series)
SuperSTAR new developments - III
S T A T I S T I C S A U S T R I AMarch 2007 12
SuperSTAR Architecture
zLinux + zOS
Extern SuperSERVER, SuperADMIN
Intern SuperSERVER, SuperADMIN
Extern SuperWEB
Intern SuperWEB
Extern
Guest
Intern
SuperCROSS
Windows
SuperCHANNEL
DB2 Metadata
SAN
Internal Data cubes
External & Guest Data cubes
MetadataDriver
SAMBA
logical connection
DB2Connect
DB2 Metadata
Samba is an Open Source/Free Software suite that has, since 1992, provided file and print services to all manner of SMB/CIFS clients, including the numerous versions of Microsoft Windows operating systems. Samba is freely available under the GNU General Public License.
S T A T I S T I C S A U S T R I AMarch 2007 13
• Databases (Microdata)– National accounts (database size ~ 300MB)
• Facts: 2• Dimensions: 35
– Income tax (database size ~ 16GB)• Facts: 120• Dimensions: 55
– Census (database size ~ 1GB)• Facts: 1• Dimensions: 210
– Car check (§57a) and registration (database size ~ 770MB)• Facts: 15• Dimensions: 38
– „Economic Atlas“ (13 small databases)• Facts: 200• Dimensions: 5
SuperSTAR Status (operational Projects)
S T A T I S T I C S A U S T R I AMarch 2007 14
• Begin of Joint Development to extend functionality mid 2006• Start „Test-Projects“ (depending on functionality) Q1/2007• Port to z-Linux Q2/2007• Complete ß-Release available Q2/2008• Data Transfer (ISIS - > Superstar) beginning 2008
• 2-4 Analysts/Programmers (ST.AT)• 1 Project Leader (FTE) + up to 6 Analysts/Programmers (STR)• Overall external costs approx. 300 k€
SuperSTAR Milestones/Ressources
S T A T I S T I C S A U S T R I AMarch 2007 15
• Set of additional functionalities in Superstar
(Statistical functions, API‘s, language, payment solution, search, …….)• Port to other platforms with chiphopper process
(Win32, Win64, UNIX, LINUX, z-LINUX)• Experience exchange for joint developments with the SW Industry
(STR)• Experienced partner for Superstar Project development (Paradigma)• Demo, Development and Test Site in STAT and Paradigma
How other NSI‘s can benefit
S T A T I S T I C S A U S T R I AMarch 2007 16
• www.statistik.at/isis– Database ISIS current public access
• www.str.com.au– Space-Time Research homepage
• www.statistik.at/xml/stf– XML - Statistical table format (STF)
• www.developer.ibm.com/isv/eserver/advantage– Chiphopper porting process from IBM
• www.paradigma.net– External Application Development Partner
Documentation
S T A T I S T I C S A U S T R I AMarch 2007 17
We move
I n f o r m a t i o n
www.statistik.at