data management seminar 05 th october 2011 gwenlian stifin & aude espinasse south east wales...

Post on 28-Mar-2015

217 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Data Management seminar

05th October 2011

Gwenlian Stifin & Aude EspinasseSouth East Wales Trials Unit, Cardiff

University

DATA MANAGEMENT

OVERVIEWAim of the session:• General understanding of the principles underpinning data management for clinical studies.• Overview of the data cycle in a clinical study.• Overview of data management procedures.

BACKGROUNDREGULATORY FRAMEWORK

Good clinical practice is an international ethical and scientific quality standard for the design, conduct and record of research involving humans.

GCP is composed of 13 core principles, of which the following 2 applies specifically to data.

BACKGROUNDGCP – CORE PRINCIPLES FOR DATA

• The confidentiality of records that could identify subjects should be protected, respecting the privacy and confidentiality rules in accordance with the applicable regulatory requirement(s).

• All clinical trial information should be recorded, handled, and stored in a way that allows its accurate reporting, interpretation and verification.

DATA SEQUENCE

• A case report form (CRF) is a printed or electronic form used in a trial to record information about the participant as identified by the study protocol.

• CRFs allow us to:– record data in a manner that is both efficient

and accurate.– Record data in a manner that is suitable for

processing, analysis and reporting.

WHAT IS A CRF?

Designing CRFs, key questions:• What data is required to be collected?

– Only data we specified in the proposal/protocol.– Only data required to answer the study question.

• When will this data be collected?– Baseline / follow-up .

• What Forms will need to be designed.

• Who is going to collect/complete this form.

• Are there validated instruments available?

• How is the data going to be analysed.

KEY QUESTIONS

DATA SEQUENCE

• Metadata is structured data to organise and describe the data being collected.• It is centralized data management. • It is a tool to control and maintain data entities:

– Content and variable definitions– Validation rules

• Metadata consistently and effectively describes data and reduces the probability of the introduction of errors in the data framework by defining the content and structure of the target data.

WHAT IS METADATA?

Metadata File

Name of Trial/Study: PAAD (Probiotics for Antibiotic Associated Diarrhoea) - stage 1

Metadata Author: H S Number of Data Collection Forms for Trial/Study: 10

Name of File (Corresponding Data Collection Form): Recruitment CRF 02

Form

Variable

Variable Label Data Type

Format

Length LinkedSkip Validation

ValidationTitle

Name Value MissingCondition Type

Labels Codes

Recruitment CRF 02

datecons date of consent date dd/mm/yyyy 10 range

warn if <01.11.2010 > 01.06.2012

sugender service user gender category 1 = Male, 2 =

Female 1

consss1 consent for SS1 category 0 = no; 1 = yes 1

CRF AND DATABASE DESIGN

• Study outcomes in protocol define what questions are asked in the CRF.

• Use of validated scales and questionnaires.

•User-friendliness and ease of completion important.

• Database is built to receive data extracted from the CRFs.

• Database needs to include querying and reporting tools.

•Data needs to be coded into numbers to facilitate statistical analysis.

DATABASE DESIGN

Database allows for adequate storage of study data and for accurate reporting, interpretation and verification of the data.

2 database systems tend to co-exist alongside one another: • Study management database: personal information, recruitment, data completeness (CRF receipts) follow-up triggers… • Clinical database: clinical information (study outcomes).

DATABASE DESIGN

Functionalities to consider in both types of database:

• Validation rules (Ranges, skips, inconsistencies…).• Queries / report.• Audit trail.

Check Ranges, Skips, inconsistencies, missing data i.e.

what is on your metadata is exactly what is applied when entering the data on the form

Check output file for data export (for clinical database) Variable names match up/are all there Coding of categories correct Numbers when alpha required What is on the form is transferred exactly into

CSV / SPSS

TEST/VALIDATE THE DATABASE

DATA SEQUENCE

DATA COLLECTION

• Validity of data collection must be ensured.

• Source data is identified and data transcribed correctly onto data collection system.

• Process of data collection/transcription is audited throughout the process (monitoring – Source data verification).

DATA COLLECTION

TESTING• After set-up, test or pilot the system before you

use it.• Maintain an adequate record of this procedure.

• Before starting data collection– Testing– SOP and PRA– Training

• During data collection– Audit

DATA COLLECTIONSOP and PRA

• Good idea to write a Standard Operating Procedure or a working practice document detailing how you set up your electronic data capture systems.

• The appropriate persons need to be trained in these.• Need to write a Privacy Risk Assessment, this

document includes:– Personal data items held in study e.g. name, DOB– Individuals who are granted access to this data– Procedures for colleting, storing, and sharing personal

data– How personal data will be anonomised– Identifying possible breaches of confidentiality and

how these can be reduced

DATA COLLECTIONTRAINING

• After piloting, when it is working as it should, next step is to train all users of the system

• A record should be kept of the training

• A detailed diagram and description of how data will be collected should be provided at training.

Woman identified and agrees to beapproached

Assessed for eligibility and consented

Baseline data (CAPI)

Randomisation

Intervention Control

34 - 36 weeks gestation (CATI)

Birth (CRF)

6 month post partum (CATI)

1 year post partum (CATI)

2 years post partum (CAPI)

Birth

Routine

antenatal care

FNP visits &

usual services

Usual

services 18 month post partum (CATI)

FNP visits

& routine

antenatal care

Key CAPI: Computer Assisted Personal Interview

CATI: Computer Assisted Telephone Interview

Data collection

Participant flowchart

Participant progress

DATA COLLECTIONAUDIT

• Maintain an audit trail of data changes made in the system.

• Procedure in place for when a study participant or other operator capturing data, realises that he / she has made a mistake and wants to correct data.

• Important that original entries are visible or accessible to ensure the changes are traceable.

ELECTRONIC DATA COLLECTIONWHAT IS THIS?

• PC• Laptops• mobile devices• audio• visual• email transmission• web-based systems

Variety of software and hardware nowbeing used to collect data:

ELECTRONIC DATA COLLECTIONWHAT IS THIS?

• Some of the fundamental issues we have discussed are common to all modes of electronic data collection as well as data collection on paper.

• IMPORTANT: There should be no loss of quality when an electronic system is in place of a paper system.

ELECTRONIC DATA COLLECTIONSPECIFIC TRAINING ISSUES

• Training on the importance of security; including the need to protect passwords, as well as enforcement of security systems and processes.

• System user should confirm that he / she accepts responsibility for data entered using their password.

• Maintain a list of individuals who are authorised to access data capture system and add to PRA.

• Ensure that the system can record which user is logged in and when. Timely removal of access no longer required, or no longer permitted.

DATA ENTRY

• Different types of data entry exist, (manual /optical mark recognition system, online/offline, etc…).

• Type of data can also influence the method of data entry (numerical, free text, images etc…).

• It is important to have documented procedures (SOPs) defining who is performing data entry and how it is performed.

• Data entry procedures should be tested at the earlier design stage, and testing adequately documented before sign-off.

•Adequate training on these procedures should be provided.

•Appropriate quality control procedures have to be set up.

DATA ENTRY

ELECTRONIC DATA ENTRY

• Electronic entry does not usually have to be a separate ‘data entry phase’, normally entered during collection straight onto an electronic CRF.

• Data can be entered straight onto a website, or can be entered onto a laptop and uploaded using the internet onto a server.

• When designing forms to collect data electronically you can include ‘validation rules’. An electronic system can stop the Researcher from proceeding with data collection if they break a validation rule.

AFTER DATA COLLECTION• Regular backups should be made of your data, if

outsourcing data collection or storage ensure that the company have backup systems in place.

• After trial has finished using data capture systems, you may need to dispose of these or send them to another company e.g. if they are loaned. Before doing this, you may need to professionally erase the hard drive as it may still contain participant information.

• May need to archive whatever data you collect, includes both hard copy and electronic data, documents not archived need to be disposed of securely.

COLLECTING DATA SAFELY

• The safe collection of data in clinical trials is essential for compliance with Good Clinical Practice (CPMP/ICH/GCP/135/95) and the Data Protection Act 1998.

• Because of increased use of information technology in the collection of trial data there is a need to have clear guidance on how to safely collect data in this manner.

• Need to protect your data capture systems from loss or unauthorised access, at the same time ensuring that it is accessible to those who need it.

COLLECTING DATA SAFELY………CONTINUED

• Need to protect participants’ identity by using Participant Identifiers (PID). PID’s should be used when communicating with other trial team members.

• Electronic info particularly vulnerable to security threats: – can be physically accessed.– could be loss or damage to computer.– can be remotely accessed through internet or

virus. • For each tool that you use to collect data, must

ensure that system is password protected and encrypted.

DATA SEQUENCE

DATA CLEANING

• Errors / inconsistencies / missing data spotted at different time points depending on the study and methods used.• Errors should be corrected where possible, but no changes should be made without proper justification.• Appropriate audit trails should be kept to document changes in the data (queries form, SPSS syntax…).

DATA CLEANING

Yes No

Data validated

REPORTING DATA

• Throughout the course of the study it is usually the responsibility of the Data Manager to report on study progress, these kinds of reports include:

• Recruitment progress• Follow-up rates• SAEs• Data completeness• Withdrawals

Thank you!

Any questions?

top related