high-throughput crystallography at monash noel faux dept of biochemistry and molecular biology...
TRANSCRIPT
![Page 1: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/1.jpg)
High-Throughput Crystallography at Monash
Noel Faux
Dept of Biochemistry
and Molecular Biology
Monash University
![Page 2: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/2.jpg)
Structural Biology Pipe Line
QuickTime™ and aTIFF (Uncompressed) decompressor
are needed to see this picture.
CloningExpression
PurificationCrystallisation
X-ray diffraction Determine the structure
High throughput robots and technologies:Tecan Freedom EvolutionÄKTAxpress™Trialing crystal storage and imaging facilities
Australian synchrotrononline in 2007
Data processingand structural determination: major bottle neck
Target tracking / LIMSData
ManagementPhasing (CCP4/CNS
GRID computing)
![Page 3: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/3.jpg)
The problems• Target-tracking/Data management• The process of protein structure determination creates a large volume of data.• Storage, security, traceability, management and backup of files is ad-hoc.• Remote access of the files is limited and requires different media formats.
• Structure determination• CPU intensive
![Page 4: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/4.jpg)
• Part of a National Project for the
development of eResearch platforms for the
management and analysis of data for
research groups in Australia.
• Aim: establish common standardised software / middleware applications that are adaptable to many research capabilities
![Page 5: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/5.jpg)
Solution
• Central repository of files• Attach metadata to the files• World wide secure access to the files• Automated collection and annotation of
the files from in-house and synchrotron detectors
![Page 6: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/6.jpg)
The infrastructure
Instrume
nt R
epK
epler
Crystal Temp
Lab Temp
X-ray image
Mounted crystal Streaming Video (SV)
Lab SV
Lab Still Pics
Sensor Data
Storage ResourceBroker
Monash University ITs Sun GRID: 54 dual 2.3 GHz CPUs208.7 GB (3.8 GB per node)>10 TB storage capacityRunning Gridsphere
Lab PC
CollectionPC
![Page 7: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/7.jpg)
Central web portal
![Page 8: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/8.jpg)
Central web portal
![Page 9: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/9.jpg)
Automated X-ray data reduction
• Automated processing of the diffraction data
• Investigating the incorporation of Xia2 : Automated Data Reduction:• New automated data reduction system designed
to work from raw diffraction data and a little metadata, and produce usefully reduced data in a form suitable for immediately starting phasing and structure determination (CCP4)
1. (Graeme Winter) The CCP4 suite: programs for protein crystallography. (1994). Acta Crystallogr. D50, 760-763.
1
![Page 10: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/10.jpg)
Divide and Conquer
• A large number of CPUs available across different computer clusters at different locations:• Monash ITs Sun grid• VPAC:
• (Brecca – 97 dual Xeon 2.8 GHz CPUs, 160 GB (2 GB per node) total memory; Edda – 185 Power5 CPUs, 552 GB (8-16 GB per node) total memory)
• APAC:• 1680 processors, 3.56 terabytes of memory, 100 terabytes of
disk
• Personal computers
![Page 11: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/11.jpg)
DART and CCP4
• Aims: Use the CCP4 interface locally but run the jobs remotely across a distributed system
• Nimrod to distribute the CCP4 jobs across the different Grid systems
• Investigating the possibility of incorporating the CCP4 interface into the DART web portal
![Page 12: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/12.jpg)
• No phasing data• No sequence identity (<20%)• No search model• Is there a possible fold homolog• Exhaustive Phaser scan of the PDB
• Exhaustive searches with different parameters and search models
Exhaustive Molecular Replacement
2. Acta Cryst. (2005). D61, 458-464. Likelihood-enhanced fast translation functions A. J. McCoy, R. W. Grosse-Kunstleve, L. C. Storoni and R. J. Read.
2
![Page 13: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/13.jpg)
Exhaustive Molecular Replacement
• Proteins building blocks are domains• Use subset of SCOP as search models in a
PHASER calculation.• The use of Grid computing will make this possible
~1000 CPUs = days for typical run
SCOP
ClassFold
SuperfamilyFamilies
Domains
7971
15893004
75930
• Search at the family level• Take the highest resolution structure
• Mutate to poly-alanine, and delete loops and turns• Phaser• Families with z-score 6 search with each of their domain members
![Page 14: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/14.jpg)
Exhaustive Molecular Replacement
Database containing:• ToDo list• Parameters• Results ITs Sun GRID
Each node runs a perl script:• Requests a job• Launch phaser• Returns the results • Repeats until the list is exhausted
56 dual dual AMD OpteronCPUs208.7 GB (3.8 GB per node)>10 TB storage capacity, 160 GB (2 GB per node) total memory
Will be extended to use Nimrod to gain access to APAC and the Pacific Rim Grid (Pragma)
![Page 15: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/15.jpg)
Final Pipeline
QuickTime™ and aTIFF (Uncompressed) decompressor
are needed to see this picture.
CloningExpression
PurificationCrystallisation
X-ray diffraction Determine the structure
Data collection, management, storage, and remote accessDART
Xia2 Data processing, exhaustive experimental (e.g., SAD, SIRAS, MIRAS) and MR phasing for final refinementGrid ComputingNIMRODPHASERAutoSHARPCCP4DART
High through put robotics and
technologies
![Page 16: High-Throughput Crystallography at Monash Noel Faux Dept of Biochemistry and Molecular Biology Monash University](https://reader035.vdocuments.us/reader035/viewer/2022070410/56649ebc5503460f94bc5adc/html5/thumbnails/16.jpg)
AcknowledgmentsMonash UniversityAnthony BeitzNicholas McPheeJames WhisstockAshley Buckle
James Cook UniversityFrank EilertTristan King
DART Team