a first look at san francisco’s new etl job platform

27
A First Look at San Francisco’s New ETL Job Platform Samuel Valdez Janine Heiser

Upload: safe-software

Post on 10-Feb-2017

281 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: A First Look at San Francisco’s New ETL Job Platform

A First Look at San Francisco’s New ETL Job Platform

Samuel ValdezJanine Heiser

Page 2: A First Look at San Francisco’s New ETL Job Platform

Agenda

• About Us• Some Context• A Very Brief History• The ETL Job Platform• Workflow• Looking Ahead• Questions

Page 3: A First Look at San Francisco’s New ETL Job Platform

About Us

City and County of San Francisco

Department of Technologyhttp://sfgov.org/dt/

Samuel ValdezEnterprise GIS

EngineerSan Francisco Enterprise

GIS Program (SFGIS)

Janine Heiser

Open Data Engineer

DataSF

Page 4: A First Look at San Francisco’s New ETL Job Platform

Some Context

Page 5: A First Look at San Francisco’s New ETL Job Platform

DataSF (open data program)Our mission is to empower use of the City’s data. Our vision is that the City’s data is understood, documented, and of high quality.

(With some help from the Department of Technology.)

Learn more at http://datasf.org/

Page 6: A First Look at San Francisco’s New ETL Job Platform

Automated Data Publishing“... goal of increasing the number and timeliness of datasets on [SF OpenData]...”

Page 7: A First Look at San Francisco’s New ETL Job Platform

SF OpenData (Socrata)

Page 8: A First Look at San Francisco’s New ETL Job Platform

A Very Brief History

Page 9: A First Look at San Francisco’s New ETL Job Platform

Why is SFGIS involved?• Well-established legacy• Traditionally shared (spatial) data• Most open-data were spatial• Enterprise perspective and relationships• Technical skills, tools, and experience

Page 10: A First Look at San Francisco’s New ETL Job Platform

Generation 0• Predecessor platform• FME Server-based• Deployed around 2011 to support the EAS• Several key ETL jobs• Project champion• Organic evolution• Learning experience

Page 11: A First Look at San Francisco’s New ETL Job Platform

The ETL Job Platform

Page 12: A First Look at San Francisco’s New ETL Job Platform

Platform Architecture I

Page 13: A First Look at San Francisco’s New ETL Job Platform

“ETL Job” Services

Page 14: A First Look at San Francisco’s New ETL Job Platform

Platform Architecture II

Page 15: A First Look at San Francisco’s New ETL Job Platform

Platform Architecture III

Page 16: A First Look at San Francisco’s New ETL Job Platform

Workflow

Page 17: A First Look at San Francisco’s New ETL Job Platform

“Requirements”• Few workspace authors• Need a safe place to try out workspaces• A few hundred ETL jobs (100-500?)• “Simple” ETL jobs• Use best-practices• And more

Page 18: A First Look at San Francisco’s New ETL Job Platform

Workflow I

Page 19: A First Look at San Francisco’s New ETL Job Platform

Workflow II

Page 20: A First Look at San Francisco’s New ETL Job Platform

Workflow III

Page 21: A First Look at San Francisco’s New ETL Job Platform

Looking Ahead

Page 22: A First Look at San Francisco’s New ETL Job Platform

Looking Ahead x 2

Page 23: A First Look at San Francisco’s New ETL Job Platform

Possible future talks• Version control• Supporting services, including geo• Job scheduling• Business continuity• Operations and maintenance• And more

Page 24: A First Look at San Francisco’s New ETL Job Platform

Possible platform evolution• Scale out• More value-added services• Support streaming data• Third-party scheduler (not FME-centric)• Asynchronous ETL job execution• And more?

Page 25: A First Look at San Francisco’s New ETL Job Platform

Questions?

Page 26: A First Look at San Francisco’s New ETL Job Platform

Thank you!

Jeff Johnson – SFGIS & OD ManagerJoy Bonaguro – Chief Data OfficerMiguel Gamiño – City CIOAaron Koning – FME Server Product Mgr

Page 27: A First Look at San Francisco’s New ETL Job Platform

Thank you!

[email protected]@sfgov.org

http://datasf.org/http://sfgov.org/dt/