china scientific data sharing project international workshop on strategies for preservation of and...

22
China Scientific Data Sharing Project China Scientific Data Sharing Project national Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2 national Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2 Xian-En ZHANG Xian-En ZHANG Working Group, China-Scientific Data Sharing Projec Working Group, China-Scientific Data Sharing Projec t t Basic Research Department, Ministry of Science & Te Basic Research Department, Ministry of Science & Te chnology chnology

Upload: neal-smith

Post on 30-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

China Scientific Data Sharing ProjectChina Scientific Data Sharing Project

International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, BeijingInternational Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Xian-En ZHANGXian-En ZHANG

Working Group, China-Scientific Data Sharing ProjectWorking Group, China-Scientific Data Sharing ProjectBasic Research Department, Ministry of Science & TechnologyBasic Research Department, Ministry of Science & Technology

Page 2: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

•General Considerations and Objectives General Considerations and Objectives

•Framework and ArchitectureFramework and Architecture

•Major TasksMajor Tasks

•Program Work PlanProgram Work Plan

•Current Status and ProgressCurrent Status and Progress

•China-SDSPChina-SDSP

Page 3: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

• China-SDSPChina-SDSP should be developed under comprehensive should be developed under comprehensive planning on the national level. planning on the national level.

• It should collect and re-organize all possible data from It should collect and re-organize all possible data from government agencies, institutes, programs, and government agencies, institutes, programs, and individual investigators while making full use of individual investigators while making full use of international scientific data resources through international scientific data resources through cooperation.cooperation.

• China-SDSP should make all these data accessible to all China-SDSP should make all these data accessible to all interested users at an affordable cost, or free if possible.interested users at an affordable cost, or free if possible.

• • China-SDSP is to form a multi-tiled, distributed scientific China-SDSP is to form a multi-tiled, distributed scientific

data sharing system that bridges the gaps between data sharing system that bridges the gaps between different agencies, institutes, and geographical regions.different agencies, institutes, and geographical regions.

Page 4: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

2020 Goals:2020 Goals:

• To form a scientific data management and sharing To form a scientific data management and sharing system that is more user-friendly; system that is more user-friendly;

• To develop a set of supportive laws, policies, and To develop a set of supportive laws, policies, and standards; standards;

• To form a professional service group by establishing To form a professional service group by establishing a career reward mechanism. a career reward mechanism.

• Eighty percent of scientific data funded by the Eighty percent of scientific data funded by the government will be made available to general public.government will be made available to general public.

Page 5: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Framework and ArchitectureFramework and Architecture

1. Logical Framework of CSDSP1. Logical Framework of CSDSPCSDSP is a three-tiled system: master databases, scientific data CSDSP is a three-tiled system: master databases, scientific data centers or networks, and Gateway Web sitecenters or networks, and Gateway Web site

2. Scope of Data Sharing Supported by China-SDSP2. Scope of Data Sharing Supported by China-SDSPChina-SDSP also functions as a catalyst. Its original purpose is to China-SDSP also functions as a catalyst. Its original purpose is to integrate publicly funded data resources, but its long-term goal is to integrate publicly funded data resources, but its long-term goal is to leverage all possible data resources from government to the private leverage all possible data resources from government to the private sectors, and make them available to the general public.sectors, and make them available to the general public.

3. Service Architecture of China-SDSP3. Service Architecture of China-SDSPChina-SDSP may provide services in various ways: facilitating the China-SDSP may provide services in various ways: facilitating the consistent management of distributed databases; providing a content consistent management of distributed databases; providing a content service and data service, as well as other services mentioned.service and data service, as well as other services mentioned.

Page 6: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Gateway to China Scientific Data Sharing Program

Natural Scienceand Environment

Agriculture

Populationand Health

Basic and Frontier Sciences

Engineering andTechnology

Regional Development

Meteorological Scientific Data Center

Rural Development Sci Data Center

Agricultural Scientific Data Center

Basic Medicine Scientific Data Center

Rural Development Sci Data Center

Population Control Sci Data Center

Earth System Scientific Data Center

Space Environment Sci Data Center

………………………………………………

………………………………………………

About 300 Master Databases

In 40 Data Cent

ers

Disciplines Disciplines Data Center / NetworksData Center / Networks Master Database Master Database

Data Users

Architecture and Framework of China SDSP

Page 7: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Scientific Data SharingScientific Data Sharing

Submission from Submission from Agencies and Agencies and

InstitutesInstitutes

Exchange Data Exchange Data with other with other CountriesCountries

Submission from Submission from Major National Major National

ProgramProgram

Data Data DisseminationDissemination

Data Data Integration / Integration / SubmissionSubmission

Data Data GeneratorGenerator

Scientific Research & Scientific Research & Technology Development Technology Development SectorSector

Observation, MonitoringObservation, MonitoringSurvey and EvaluationSurvey and EvaluationStatistics SectorStatistics Sector

Scope of Scientific Data Sharing ProjectScope of Scientific Data Sharing Project

Page 8: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Extented Service

SubmittingArchivingUpdating

Classes of Scientific Data Service

DATA Management

CONTENTService

DATA Service

SearchingBrowering

SearchingBroweringDownloading

Data MiningSubject ServingForum…. …….

Fig 3. Service Functionality of China Scientific Data Sharing ProgramFig 3. Service Functionality of China Scientific Data Sharing Program

Page 9: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

1. Architectural Development of Data Management and Sharing System

2. Resource Development for Scientific Data

3. Standardization

4. Law and Policy

Major Tasks of China SDSPMajor Tasks of China SDSP

Page 10: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

1 Gateway Site1 Gateway Site40 Data Centers /Networks40 Data Centers /Networks300 Master Databases300 Master Databases

Architectural Development of Data Management and Sharing System

Page 11: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Resource Development for Scientific Data

The major tasks are to re-edify existing data resourcesThe major tasks are to re-edify existing data resources ;;safeguard endangered scientific data and records; devsafeguard endangered scientific data and records; develop the master database for large research programs elop the master database for large research programs funded by the government; introduce international data funded by the government; introduce international data resources based on their scientific values, quality, and resources based on their scientific values, quality, and usabilityusability ;; integrate multi-source data; and conduct vaintegrate multi-source data; and conduct value-added research.lue-added research.

Page 12: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Standardization is the prerequisite for scientific data sharing in the Standardization is the prerequisite for scientific data sharing in the digital era. digital era.

There are two kinds of standards: platform technical standards and There are two kinds of standards: platform technical standards and data sharing standards. The former is based on data platforms, and data sharing standards. The former is based on data platforms, and the latter is based on the scientific data sharing framework. the latter is based on the scientific data sharing framework.

The basic and common data sharing standard will be considered first. The basic and common data sharing standard will be considered first. The data standard in major application areas will also be on the list of The data standard in major application areas will also be on the list of priorities. priorities.

Standardization

Page 13: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Specifically, the following should be conducted first:Specifically, the following should be conducted first:

Policy: Establishment and implementation ofPolicy: Establishment and implementation of • Implementation Guidelines of Scientific Data Sharing Program,Implementation Guidelines of Scientific Data Sharing Program,• Data Submission Guidelines of Major Science and Technology Program Data Submission Guidelines of Major Science and Technology Program Funded by GovernmentFunded by Government• Guideline of Scientific and Technological Data Classification for Data Guideline of Scientific and Technological Data Classification for Data Sharing Sharing

• Management Guidelines of China Scientific Data Sharing Program Management Guidelines of China Scientific Data Sharing Program • Performance Evaluation (Merit Appraisal) of Scientific Data SharingPerformance Evaluation (Merit Appraisal) of Scientific Data Sharing

Law: Legislation and Amendment ofLaw: Legislation and Amendment of• Science and Technology Advancement ActScience and Technology Advancement Act• Copy Right Act Copy Right Act • National Security ActNational Security Act

OthersOthers• Be proactively involved in the on-going legislation of “Policy on Access Be proactively involved in the on-going legislation of “Policy on Access toto

Government Information”.Government Information”.• Promote the issuing of “Policy on National Scientific and Technological Promote the issuing of “Policy on National Scientific and Technological Resources Sharing”.Resources Sharing”.

Law and Policy

Page 14: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Experimental period: 2001-2005• Overall planning and design;• Legislation planning: start research on law and policy framework;• Making and issuing relevant policy and regulation;• Technology and standards;• Establishing data centers (networks) and kicking off the data

sharing pilot project;• Identifying the optical mechanism for existing data consolidation

and sharing;• Launching of program gateway: select 25 data centers for data

sharing pilot project, select other candidate centers for further development;

• Sum up experiences from various aspects of the experimental period, and prepare a feasibility report to facilitate the overall implementation of public good data sharing in next period.

Working PlanWorking Plan

Page 15: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Overall Implementation Period: 2006-2010

• Continue the establishment of data sharing technology, policy and law;

• Extend the program coverage of scientific data centers or networks and make them operational;

• Gradually improve technology and standards • Enforce the cooperation among data centers in different research

area;• Enhance the capacity to develop high-level data product and

quality;

Working PlanWorking Plan

Page 16: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

After each yearly performance evaluation of the 25 pilot After each yearly performance evaluation of the 25 pilot data centers or networks, the qualified ones will be data centers or networks, the qualified ones will be included in the “National Scientific Data Master Network” included in the “National Scientific Data Master Network” and will start regular operation; the amount invested in and will start regular operation; the amount invested in each center depends on their merits and performance. each center depends on their merits and performance. Another 15-20 data centers will be built, including 200 new Another 15-20 data centers will be built, including 200 new master databases.master databases.

By 2010, a mechanism is going to be established, By 2010, a mechanism is going to be established, through which data are submitted from various through which data are submitted from various governmental agencies and programs and delivered to governmental agencies and programs and delivered to potential users efficiently.potential users efficiently.

Page 17: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Current Status and Progress of China SDSPCurrent Status and Progress of China SDSP

• General Planning and Design (Draft) General Planning and Design (Draft) FinishedFinished

• Pilot Projects for Data SharingPilot Projects for Data Sharing• Law, Policy, and StandardLaw, Policy, and Standard

Page 18: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

In June 2003In June 2003 ,, a Coordinating Group and a Scientific a Coordinating Group and a Scientific Group were established for scientific data sharing. The mGroup were established for scientific data sharing. The main task of these groups was to develop the “Planning of ain task of these groups was to develop the “Planning of China Scientific Data Sharing Program” (China-SDSP) by China Scientific Data Sharing Program” (China-SDSP) by May 2004.May 2004.

There are six major components to China-SDSP: curreThere are six major components to China-SDSP: current status and major national requirement; overall considernt status and major national requirement; overall considerations; principle and objectives; strategic arrangement anations; principle and objectives; strategic arrangement and tasks; implementation and measurements; supporting cd tasks; implementation and measurements; supporting conditions and facilitiesonditions and facilities 。。

General Planning and Design (Draft) Finished

Page 19: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

In 2001, the meterological data sharing project was launched, which hIn 2001, the meterological data sharing project was launched, which heralded the start of the scientific data sharing program in China.eralded the start of the scientific data sharing program in China.

By the end of 2002By the end of 2002 , , another 5 data centers and 3 networks had joinanother 5 data centers and 3 networks had joined the pilot project:ed the pilot project:

Pilot Projects for Data Sharing

1.1.Survey data centerSurvey data center2.2.Hydrolgy and Water Resources data centerHydrolgy and Water Resources data center3.3.Seismetic data centerSeismetic data center4.4.Forestry data centerForestry data center5.5.Agriculture data centerAgriculture data center6.6.Earth System Science data center networkEarth System Science data center network7.7.Modern Agricultural Technology and Rural Development networkModern Agricultural Technology and Rural Development network8.8.Sustainable Development networkSustainable Development network

Page 20: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

Law, Policy, and StandardLaw, Policy, and Standard

In terms of policy-making, a working group for data sharing has been established and investigated the current status and trend of data policy both home and abroad, compiled relevant materials and information ;

Established the ”Guidelines of Data Submission from Major National Programs” and its interpretation ; began researching the framework of relevant law and policy; and finished the conceptual design for data classification for sharing.

Page 21: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

In general, China Scientific Data Sharing Program is still in In general, China Scientific Data Sharing Program is still in the phase of overall planning, accumulating the experiences the phase of overall planning, accumulating the experiences of technology and policy making, as well as overseeing pilot of technology and policy making, as well as overseeing pilot data sharing projects. data sharing projects.

Page 22: China Scientific Data Sharing Project International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing

AcknowledgementAcknowledgement

Many who involved the projectMany who involved the project

SUN ShuSUN ShuHUANG DingchengHUANG DingchengSUN JiuLinSUN JiuLinLUI ChuangLUI ChuangXIAO YunXIAO YunYIN LingYIN LingCHEN JunCHEN Jun

TENG MianzhenTENG MianzhenZHOU Wenneng ZHOU Wenneng

Paul Uhlir JDPaul Uhlir JDPeter Weiss JDPeter Weiss JD