2016 DLI ON · The DLI community welcomed new DLI members: • Welcomed Trinity Western University, Langley BC Outreach to new and interested institutions continues to be an important
28
Data Liberation Initiative Update 2016 DLI ON December 2016 Chantal Ripp Mike Braia
The Data Liberation Initiative (DLI) is proud of the accomplishments that, together with its members, it has realized over the last 20 years of operation. The program is truly unique, whereby through active leadership, participation and collaboration from its dynamic community, we have succeeded in our shared vision of promoting a culture of data use and literacy in Canadian post-secondary institutions.
Statistics Canada • Statistique Canada4
DLI UPDATE - Changes in staff
2016-12-07
The DLI team, from L to R:
Farrah Sanjari, DDI Officer;Chantal Ripp, Unit Heat;Jennifer Scharf, Former Stat Ass’tDavid Price, ChiefRenée Rocan, Project OfficerKen Turcotte, Former DDI OfficerNathalie Gendron, Former Unit HeadMichael Braia, Data Coordinator; and Curtis Rafter; DDI coder
Presenter
Presentation Notes
Organizational changes – new Chief Statistician Anil Arora The DLI staff remains a dedicated team that supports the members of the DLI community. Renée Rocan, has been the Project Officer of the DLI program since 2007. Her duties include various administrative tasks and financial responsibility, such as yearly invoicing of the DLI membership. Ken Turcotte, the Metadata Coordinator for the Unit, has accepted a position at another department. Ken has been an asset to the team this past year and we wish Ken all the best in his further endeavors. The unit welcomes Farrah Sanjari as a Metadata Officer, who is responsible for updating the Public Use Microdata File (PUMF) and masterfile collection in the Nesstar application. Hired under the FSWEP program as DDI coders we have Curtis Rafter and Frances Ava Goerzen. Jennifer Scharf has accepted a new role within the Microdata Access Division. We thank her for her dedicated hard work over the year updating the content of Beyond 20/20 Web Data Server (WDS) and answering the questions on the listserv. We welcome Carla Ross into our ranks who will be assuming the responsibilities of loading tables to the WDS. Michael Braia joined the team in December 2015, and his role includes managing the collection and providing services to the community. Chantal Ripp has returned from leave in May and continues as the Unit Head for the program.
Statistics Canada • Statistique Canada5
Membership at 81* The DLI community welcomed new DLI members:
• Welcomed Trinity Western University, Langley BC Outreach to new and interested institutions continues to
be an important DLI activity in improving access to Canadian data resources and fostering a national data culture.
DLI UPDATE
DLI MEMBERSHIP
2016-12-07
Presenter
Presentation Notes
The Data Liberation Initiative (DLI) is a partnership between Statistics Canada and academic institutions across the country, and its objective is to improve access to data resources at Canadian postsecondary institutions. The total membership compromises 81 institutional members, with the 29 Canadian Association of Research Libraries (CARL), university member institutions, and 52 other institutions. Algoma University, Southern Alberta Institute of Technology (SAIT) and Sault College of Applied Arts and Technology are planning to terminate their DLI registration in April 2017.
Statistics Canada • Statistique Canada6
Atlantic: Peter Webster – Saint Mary’s University – co-chair Siobhan Hanratty – University of New-Brunswick
Quebec Nathalie Vachon– Institut national de la recherche scientifique
Gaston Quirion – Université Laval – co-chair Ontario
Vince Gray– Western University West
Gail Curry - University of Northern British Columbia Gilbert Bede - Okanagan College
Library Director Carol Shepstone – Mount Royal University
Statistics Canada George Sciadas - Centre for Special Business Projects David Price – Microdata Access Division Jodie-Anne Brzozowski – Microdata Access Division
DLI UPDATE – EAC membership
2016-12-07
Presenter
Presentation Notes
The External Advisory Committee is made up of DLI contacts from across the country representing both large and small institutions. The EAC meeting bi-annually to ensure that the DLI objectives are met. The DLI’s objectives are to promote a data use culture in Canada’s post secondary institutions and to ensure equitable and affordable access to Canadian public data to help support teaching and academic research. The EAC meets with data producers from within and outside Statistics Canada to discuss what products will be available as well as to give advice and encourage a more global dissemination of data. Gaston Quirion is the new co-chair of the DLI External Advisory Committee (EAC) in replacement of Sylvie Lafortune from Carleton University. Siobhan Hanratty from the University of New-Brunswick is replacing Donna Bourne-Tyson from Dalhousie University. Gail Curry from the University of Northern British Columbia is retiring. The DLI External Advisory Committee (EAC) continues to look for a replacement for Lisa Dillon, who represented the researcher’s perspective on the committee.
Statistics Canada • Statistique Canada7
Oct 12 and 13, 2016 DLI Update and bi-annual reports Working Groups:
WG1) External Funding Sources for DDI codingP. Webster and V. Gray
WG2) Reviewing principles and Strategic Plan of DLI.C. Ripp and D. Price
WG3) Working Group on Training Goals S. Hanratty
WG4) Review subscription pricing C. Shepstone
DLI UPDATE – Last EAC
External Advisory Committee
2016-12-07
Presenter
Presentation Notes
WG1) External Funding Sources for DDI coding�-Considering funding proposals through future CFI grant WG2) Reviewing principles and Strategic plan of DLI. In process of reviewing the Governance Documents; changes in Modus Operandi. The EAC will convene in person once a year in the fall, as opposed to twice annually. The DLI program at Statistics Canada carries out a set of regular operational activities in order to satisfy the principles that have been set forth. In order to keep improving the service from Statistics Canada to partnering institutions the DLI program undertakes projects to develop new services, data delivery systems or meet a specific need from the community. Review DLI Strategic plan, and set out the key activities and projects for 2017-2018 to 2019-2020 : Investigate opportunities to measure the timeliness in client service delivery Continue to enhance the search platform Committee focused projects of the Professional Development committee: Update the Survival Guide Define the basic level of data service skills of a DLI Contact WG3) Working Group on Training Goals Mandate The Mandate of the DLI-EAC Working Group on Training Goals is to review the current model of professional development opportunities offered to DLI Contacts and Alternates, with particular attention paid to user needs and delivery models as well as financial sustainability going forward. The Working Group will recommend to the EAC whether the current training model responds to the needs of the DLI community satisfactorily, whether a change in the financing of regional workshops might be in order, and/or whether the frequency of regional workshops may need further review. WG4) Review subscription pricing C. Shepstone Reviewing current pricing model and objective is to make a recommendation to the EAC on a new tiered pricing model. Next steps – client consultations
Statistics Canada • Statistique Canada8
DLI Messages on the dlilist by year DLI Reference questions by Month Participants from Postsecondary
The DLI listserv (dliIist) is a dedicated, online help desk for members seeking responses to statistics and/or data-related questions as well as a forum to discuss data issues and concerns. The Table highlights the use of the listserv by the DLI community of the dlilist. So far in 2016, the list has received over 900 posts. On average, the program responds to 40 reference related questions a month. The high number and the growing complexity of questions reflect the importance of this service.
Statistics Canada • Statistique Canada10
Participants from Postsecondary InstitutionsDLI UPDATE - Activities
2016-12-07
Region / Institution Participants
2012/2013 2013/2014 2014/2015 2015/2016 2016/2017
Atlantic Training 28 24 0 26 28
Quebec Training 15 23 0 27 24
Ontario Training 39 53 0 44
ACCOLEDS training 35 27 47 36
National Training Day 70
Boot Camp 54
TOTAL 117 127 171 133
Presenter
Presentation Notes
In 2016, training events included the DLI Quebec Regional Training session this past April, hosted at McGill University, Montreal QC, and the DLI Atlantic Regional training session in May, hosted at the Memorial University of Newfoundland, St. John’s NFL. The DLI unit works with the Regional Training Coordinators of the Education Committee, who are tasked with planning and developing a training curriculum for regional workshops. Table 3 Participants from Postsecondary Institutions presents the number of DLI contacts and their colleagues that participated in the DLI regional training sessions.
Statistics Canada • Statistique Canada11
DLI Webinars 2016DLI UPDATE - Activities
2016-12-07
The DLI Bootcamp webinar series featured:• DLI Product Line and Beyond20/20 Web Data Server (WDS), June 14,
2016• Deciphering the DLI EFT, July 19, 2016• Nesstar and the new feature, August 24, 2016• The EFT new structure, September 20, 2016• Nesstar and the new feature, November 23, 2016
Available on the DLI Training Repository
Next:• DLI Webinar on SPSD/M (December)• DLI Webinar on GSS Victimization (January)• DLI Webinar on Survival Guide (TBD)
Presenter
Presentation Notes
DLI launched a new series of webinars geared at reintroducing the core products and platforms to a growing DLI user base. After recently celebrating the 20th anniversary of the DLI, these new webinars are an interactive way of revisiting the DLI collection, NESSTAR, and WDS platforms for new contacts, as well as a valuable tool to help gather feedback as the program ventures into the future.
Files added to the collection since April 2016• Labour Force Survey (LFS) Monthly PUMFs• International Travel Survey (ITS) 2014 PUMF• General Social Survey (GSS) Cycle 28 PUMF• Travel Survey of Residents of Canada (TSRC) 2015 PUMF• Canadian Business Patterns (CBP) June 2016 Tables• Tuition and Living Accommodation Costs (TLAC) 2014-2015
Tables• Samples files from the Discharge Abstract Database (DAD)
2014-2015 Reference period• Social Policy Simulation Database and Model (SPSD/M)
v.22.2
DLI UPDATE - ActivitiesCollection growth
2016-12-07
Presenter
Presentation Notes
Since April 2016, the files added to the collection include: The DLI will be created a new page on the site that will detail the newly released files to the collection. In 2016/2017, the surveys added to Nesstar include: PUMFS: National Household Survey (NHS) 2011 – Hierarchical File National Household Survey (NHS) 2011 – Individuals File Survey of Labour and Income Dynamics (SLID) 2011 – Person File Travel Survey of Residents of Canada (TSRC) 2013 Canadian Community Health Survey (CCHS) 2013-2014 Labour Force Survey (LFS) 2014 – Rebased 2011 Census Labour Force Survey (LFS) 2015 Labour Force Survey (LFS) 2016 – January to August Public-Masterfiles: General Social Survey (GSS) 2013 – C27: Social Identity Employment Insurance Coverage Survey (EICS) 2013 Canadian Internet Use Survey 2012 Aboriginal Peoples Survey (APS) 2012
The DLI program also maintains a list of tentatively release dates of PUMFs on its website. The program is anticipating 10 surveys in the next year, including the 2015 Household and the Environment Survey (HES) PUMF, the 2015 Canadian Tobacco, Alcohol and Drugs Survey (CTADS) PUMF, and three cycles of the Canadian Income Survey (CIS) PUMFs.
Statistics Canada • Statistique Canada14
• Listserv and ETI• NESSTAR and new search• Licences & Amendments• DLI’s EFT repository consolidation
project• Survival Guide • DLI Update Newsletter• Support materials
DLI UPDATE – Priorities and Activities for FY16-17
2016-12-07
Statistics Canada • Statistique Canada15
DLI UPDATE – Priorities, Listserv and ETI
2016-12-07
• Email Transformation Initiative (ETI) is a Government of Canada project to replace the email systems of 43 federal organizations with one common system.
• Priority of our IT project team this past summer was to transition the DLI Listserv to the new service from the STC legacy emails
•We successfully transitioned in July with no service outage
•New list address: [email protected](redirect in place on old address)
Presenter
Presentation Notes
The provision of reference services to support access to the DLI data is seen as a key service delivery to the DLI clients. The dlilist has been available to the DLI membership for over 15 years. The Email Transformation Initiative (ETI) is a Government of Canada project to replace the email systems of 43 federal organizations with one common system. With Statistics Canada’s migration to the new email system in March 2016, the DLI and IT project team have worked diligently to transition the DLI Listserv to the GCMSS service from the STC legacy email system. Because of their tireless efforts, the DLI community had a smooth transition and did not whiteness a lack in service through the list.
The list home page:https://dli-idd-listserv.statcan.gc.ca/scripts/wa.exe?A0=DLILIST
Presenter
Presentation Notes
SETTING A PASSWORD If you do not already have a password for the DLILIST, we recommend that you set one now. If you have an existing password, it has been copied to the new server. The password can be used for email commands as well as for access to the web interface, which hosts the list archive. A LISTSERV password is linked to your subscribed address (Email which is subscribe to the DLILIST). If you do not know, please contact us at [email protected]. To set your password for this server, visit: https://dli-idd-listserv.statcan.gc.ca/scripts/wa.exe?GETPW1= WEB INTERFACE Subscription settings and preferences can be set using LISTSERV's web interface. Once you have set a password as explained above, you may log in and set your preferences at: https://dli-idd-listserv.gc.ca/scripts/wa.exe?SUBED1=DLILIST. Please note that the URL has to be https:// Attempts to access the listserv site via http:// will time out. LIST ARCHIVE Contributions sent to this list are automatically archived. You can access the list archives at: https://dli-idd-listserv.statcan.gc.ca/scripts/wa.exe?A0=DLILIST
•Migrated to internal Win2012 server and Nesstar server v.4.0.8.2
•Enhance the robustness of the tool and reduce wait times experienced by clients
•Priorities moving forward: addressing coding gap and more detailed metrics
Presenter
Presentation Notes
Nesstar is a web-based exploration, extraction and analysis tool for social science data. It provides access to the Data Liberation Initiative’s (DLI) collection of public use microdata files (PUMFs). Nesstar allows authorized users to access both PUMFs and metadata for master files. Through consultation with the DLI community, Internet Protocol (IP) recognition of Nesstar was identified as a priority for ensuring equitable access to PUMFs for all partner institutions. Previously, only DLI contacts were able to access the data files on the site with a user name and password. The DLI program has installed the most recently released version of the Nesstar server, v.4.0.8.2, on a newly acquired Windows 2012 server. The DLI launched the new version of Nesstar in February 2016. This new version includes IP authentication for all authorized users and is now available to all clients. Users can authenticate access via their institution’s proxy site if they are off site. The program is also investigating third-party applications for usage metrics within Nesstar. Due to the nature of the DDI coding process for files uploaded to Nesstar, there is a delay in the currency of files, in comparison to files uploaded to the EFT.
The DLI microdata search, which allows users to search surveys and statistical products or perform a search at the variable level, is accessible via the DLI Collection page or links are located in the top banner in Nesstar. Links are available in both official languages. The purpose of creating an enhanced search was to remedy the limited search capacity of the NESSTAR WebView platform through enhancing data discoverability. The DLI Microdata Search Engine was released as a beta platform in February 2016. Since the beta launch, the DLI team has performed client consultations though Regional Training and webinar sessions in order obtain client satisfaction feedback regarding performance and user experience. Over the course of the summer, the DLI Team has worked with IT to address search functionality. Modifications to the search application include addressing issues regarding French characters search results; modifying indexing for variable search results; and corrected search operator functionality (truncation and Boolean operators).
Statistics Canada • Statistique Canada19
DLI UPDATE – PrioritiesSearch application
2016-12-07
Next steps…• The implementation of check boxes next to menu
fields to improve data research needs• Text highlighting• Improving navigational features for the menu filters• Improving search consistency across both languages• Adding the survey name as a field for the variable
search• Tags to filter catalogues in order to differentiate
between microdata for PUMFs and metadata for masterfiles.
The estimated timeline for the completion of these enhancements is Spring 2017
Proposed enhancements for the DLI Microdata Search: The implementation of check boxes next to menu fields to improve data research needs Providing hits by highlighting for keywords used in the search criteria Improving navigational features for the menu filters. Improving search consistency across both languages. Adding the survey name as a field for the variable search. Tags to filter catalogues in order to differentiate between microdata for PUMFs and metadata for masterfiles. If there are any more comments or feedback regarding the content of this presentation, please contact the Data Liberation Initiative at our e-mail address listed on the screen
• Social Policy Simulation Database and Model (SPSD/M) is now available through the DLI.
•The SPSD/M will be governed under a separate licence agreement as an appendix to the DLI Licence Agreement
•Access to the product will be granted under a new safe in the DLI’s electronic file transfer (EFT) safe.
Presenter
Presentation Notes
The DLI is pleased to announce that the Social Policy Simulation Database and Model (SPSD/M) is now available through the DLI. Membership in the DLI is subject to the DLI Licence Agreement. The SPSD/M will be governed under a separate licence agreement as an appendix to the DLI Licence Agreement. If interested in obtaining access to this product, we ask that the Library Director (or the authorized signatory) sign the associated agreement and return a copy to the DLI. The DLI will accept a scanned signed (electronic) version of the licence agreement for the SPSD/M. The signature on the licence agreement has to be a real client signature not a digital signature.
Statistics Canada • Statistique Canada21
DLI UPDATE –EFT repository consolidation project
2016-12-07
• August 2016 saw the DLI team consolidate and restructure the EFT collection
• consolidate both the metadata documentation and the staged datasets, reducing the space required
• New shared common path / File hierarchy structure between both languages
Outcomes:• Reduced space required to manage collection• Reduced risks of administrative errors• Best practices naming convention
Presenter
Presentation Notes
August 2016 saw the DLI team consolidate and restructure the EFT collection. At the beginning of 2015 it was identified that the EFT Safe – MAD_DLI had been designated a maximum size of 175 GBs of which over 90% of the total allocated space was used. With the DLI collection anticipated to continue to grow, the space capacity was recognized as a high risk and required the attention of the DLI Team. The objective of this strategy was to successfully reduce the necessary space required to accommodate the DLI holdings on the various safes on the EFT. This also allowed for an opportunity to perform high-level quality assurance of the holdings in reviewing the data and documentation. Through consultations with the EFT team, and the DLI community, a plan was devised to consolidate PUMF’s from both official languages to a single bilingual safe and organizing surveys by their record number. In addition, other non-standard data products and materials (eg: DLI reports, CD-ROM products, data tables) were consolidated as well to reduce duplication. The DLI team is pleased to share the new consolidated EFT safes for the DLI collection. In addition to saving more than 30 GB of server space, the DLI team has streamlined a previously time consuming upload process into an efficient single location upload. This will help eliminate any inconstancies that existed between the old safes, and ensure users can access data in either language of their choice. The changes and the contents of the safes are noted below: August 2016 saw the DLI team consolidate and restructure the EFT collection consolidate both the metadata documentation and the staged datasets, reducing the space required New shared common path / File hierarchy structure between both languages
Statistics Canada • Statistique Canada22
DLI UPDATE –EFT repository consolidation project
2016-12-07
• New safes are in accordance with the DLI Licensing agreement
Pre review Post review exercise Contents
MAD_DLI MAD_DLI_IDD_DAM DLI Reports, CD-Rom products, data tables, geo files
MAD_PUMF MAD_PUMF_FMGD_DAM PUMF’s and metadata
MAD_DLI_CIHI MAD_CIHI_ICIS_DAM CIHI Files
MAD_DLI_PCCF MAD_PCCF_FCCP_DAM PCCF Files
MAD_SPSDM_BDMSPS_DAM
The Social Policy Simulation Database and Model (SPSD/M) product
Statistics Canada • Statistique Canada23
EFT Directory: MAD_PUMF_FMGD_DAM
2016-12-07
Presenter
Presentation Notes
Created a Read me file at the Root folder to identify survey title/Product and its acronym. Once inside a survey folder, the breakdown would be the same as before. If a survey has been collected more than once, each year is usually contained in a separate subdirectory. The secondary level in the survey breaks down the information based on data (data) and documentation (doc). The readme file for the survey is also found at this level. The data folder provides a zipped file with the data. The data can take the form of microdata in ascii format, excel spreadsheets or databases. The documentation folder includes the metadata that is the information necessary to interpret and understand the microdata. With respect to the GSS folders, please consult the EFT key entitled Readme-Key_Lisezmoi-clé.xls. We made special note of the GSS cycles (far right column indicates associated cycles) Example: GSS Cycle 27 SI (SDDS 5024) GSS Cycle 27 GVP (SDDS 4430) Nothing else has changed (top hierarchy) Organized by SDDS and languages united.
Statistics Canada • Statistique Canada24
DLI UPDATE –Survival Guide
2016-12-07
• In collaboration with the Professional Development Committee, the DLI is currently working on updating and enhancing the content of the Survival Guide
Section outline*:• Section 1: About the DLI• Section 2: Administration• Section 3: Role of DLI Contact• Section 4: Resources• Section 5: Data Concepts• Section 6: Working with Data• Section 7: Accessing and Citing DLI Data• Section 8: FAQ• Section 9: Glossary*May change before publication
• The release of the guide is anticipated for early winter 2016.
Presenter
Presentation Notes
In collaboration with the Professional Development Committee, the DLI is currently working on updating and enhancing the content of the Survival Guide, the comprehensive reference documentation for DLI contacts. The release of the guide is anticipated for early winter 2016.
Statistics Canada • Statistique Canada25
DLI UPDATE – DLI Update Newsletter
2016-12-07
• Published since 1997• For the community by the community à• Also serves to educate participants by providing useful tips to
help with DLI activities
The success of this newsletter depends on participation from DLI contacts and those involved with the project.
Seeking interested members to form a subcommittee responsible for development content
About the DLI Update Published since 1997, the purpose of this newsletter is to build community among DLI contacts by informing them about one another and the project. It also serves to educate participants by providing useful tips to help with DLI activities. The success of this newsletter depends on participation from DLI contacts and those involved with the project.
Support materials:•Posters•Pocket folders•Other items – such as pens, pads,
Presenter
Presentation Notes
The DLI is running low on support materials, opportunity to review Also, the 20th anniversary of the DLI is approaching, how best to communicate the successes and changes over the years?
Statistics Canada • Statistique Canada27
Improving access to major Canadian datasets
• by updating our technical infrastructure.
Continue to enhance our data service delivery.
Continue to support our clients.
DLI UPDATE - Priorities
2016-12-07
Presenter
Presentation Notes
With an increasing emphasis on effective supports for both teaching and research, we are pleased to be able to add the collaborative efforts of the DLI to those of the Canadian academic community.