large scale metabolic network alignments by compression

32
Large Scale Metabolic Network Alignments by Compression Michael Dang, Ferhat Ay , Tamer Kahveci ACM-BCB 2011 Bioinformatics Lab. University of Florida

Upload: cyrus-ramos

Post on 31-Dec-2015

25 views

Category:

Documents


1 download

DESCRIPTION

Large Scale Metabolic Network Alignments by Compression Michael Dang, Ferhat Ay , Tamer Kahveci ACM-BCB 2011. Bioinformatics Lab. University of Florida. Network Alignment. Bayati et al. ICDM 2009. Metabolic Network Alignment. Alignment with Heterogeneous Entities. - PowerPoint PPT Presentation

TRANSCRIPT

Large Scale Metabolic NetworkAlignments by Compression

Michael Dang, Ferhat Ay, Tamer Kahveci

ACM-BCB 2011

Bioinformatics Lab.

University of Florida

04/19/2023Ferhat Ay 2

Network Alignment

Bayati et al. ICDM 2009

04/19/2023Ferhat Ay 3

Metabolic Network Alignment

Network Alignment

Alignment with

Heterogeneous Entities

Subnetwork

Mappings

Functional Similarity

of Reactions

Querying Network

databases

04/19/2023Ferhat Ay 4

Existing Work Heymans et al. (2003) – Undirected, Hierarchical Enzyme

Similarity

Pinter et al. (2005) – Directed, Only Multi-Source Trees

Singh et al. (2007) – PPI Networks, Sequence Similarity

Dost et al. (2007) – QNET, Color Coding, Tree queries of size

at most 9

Kuchaiev et al. (2010) – GRAAL, Solely Based on

Network Topology

Ay et al. (2011) – SubMAP, Considers Subnetwork

Mappings

Shih et al. – Next Talk!

Clustering of input networks is necessary for aligning

large networks

04/19/2023 5Ferhat Ay

Alignment Phase - SubMAP

Ay et. al., RECOMB 2010, JCB

2011

04/19/2023Ferhat Ay 6

Performance Bottleneck

30 minutes2 Gigabytes

04/19/2023 7

Alignment with Compression

Compress Align

Refine

Ferhat Ay

04/19/2023 8Ferhat Ay

Alignment with/without compression

Without Compression

With Compression

04/19/2023Ferhat Ay 9

Outline of the method

• Compression Phase

• Minimum Degree Selection (MDS) method

• Optimality analysis

• Alignment Phase

• Refinement Phase

• Overall Complexity

• How Much Should We Compress?

• Experimental Results

04/19/2023 10Ferhat Ay

Compression PhaseOriginal Network

Encapsulated View

Compressed Network

What Alignment Algorithm Sees

04/19/2023 11Ferhat Ay

MDS – Minimum Degree Selection

Before AfterAfter

04/19/2023Ferhat Ay 12

Overall Compression

…………Level 1

Level 2

Step 1 Step 2 Step i

Level c

…………

……

……

…….…

04/19/2023 13Ferhat Ay

Optimality Condition for MDS

Optimal? Minimum Degree Node

04/19/2023Ferhat Ay 14

Optimality Condition for MDS

Minimum Degree Node

Optimal?

- Can be optimal- At most one node away

04/19/2023 15Ferhat Ay

How Far Away We Are from Optimal Compression?

“How far is our compressionmethod from the optimal compression?”

Number of compression steps for the optimal compression and MDS

Sizes of the compressed networks for the optimal compression and MDS

By the inequality

04/19/2023 16Ferhat Ay

Alignment PhaseNetwork 1

Network 2

Alignment

Compressed Network 1

Compressed Network 2

Network Alignment Algorithm

04/19/2023 17Ferhat Ay

Refinement Phase

Alignment Algorithm

Refine

Refined Alignment

04/19/2023 18Ferhat Ay

Complexity Analysis Compression Phase:

Alignment Phase:

Refinement Phase:

Overall Complexity (with compression):

Complexity of SubMAP (without compression):

k = largest subnetwork sizec = compression leveln, m = sizes of networks

04/19/2023Ferhat Ay 19

How much should we compress?

Examplesn=20, m=20, k=2 c ~ 1.37

n=20, m=80, k=1 c ~ 2.11

n=80, m=80, k=2 c ~ 2.15

n=200, m=400, k=1 c ~ 3.11

Experimental Results

04/19/2023Ferhat Ay 20

04/19/2023Ferhat Ay 21

To compress or not to compress?

04/19/2023 22Ferhat Ay

Compression Rates in Practice

KEGG Metabolic Networks with sizes ranging from 10 to 279

04/19/2023 23Ferhat Ay

What Do We Gain by Compression?

Subnetwork size k=1Subnetwork size k=2

04/19/2023 24Ferhat Ay

What Do We Lose by Compression?

k/c 1 2 3

1 0.89 0.56 0.53

2 0.85 0.58 0.50

3 0.84 0.57 0.49

Correlation of mappings scores found by

compressed

alignment with the ones found by SubMAP

Conclusions

04/19/2023 25

We developed a scalable compression technique with optimality bounds.

Our method respects network topology while aligning the networks unlike clustering-based methods.

It provides significant improvement on resource utilization of existing network alignment algorithms.

Ferhat Ay

Future Directions

04/19/2023 26

Improving the scale of alignment to genome-wide metabolic networks (without initial clustering).

Evaluating the performance of our compression technique on PPI networks.

Improving the accuracy of compressed alignment w.r.t original alignment for larger levels of compression.

Integrating our compression framework with other existing network alignment methods.

Ferhat Ay

04/19/2023 27Ferhat Ay

Tamer Kahveci

Michael DangNSF IIS-0845439

NSF CCF-0829867

Acknowledements

Computing Innovation Fellow

University of Washington

Department of Genome Sciences

http://cifellows.org/http://www.gs.washington.edu/

http://noble.gs.washington.edu/~wnoble/

A bit of advertisement

THANK YOU.

QUESTIONS?

04/19/2023 29Ferhat Ay

APPENDIX

04/19/2023 30Ferhat Ay

04/19/2023Ferhat Ay 31

04/19/2023Ferhat Ay 32