leveraging sequence classification by taxonomy-based ...€¦ · taxonomy-based multitask learning...

34
Leveraging Sequence Classification by Taxonomy-based Multitask Learning Christian Widmer , 1 Jose Leiva, 1,2 Yasemin Altun, 2 Gunnar R¨ atsch 1 1 Friedrich Miescher Laboratory of the Max Planck Society, T¨ ubingen, Germany 2 Max Planck Institute for Biological Cybernetics, T¨ ubingen, Germany RECOMB 2010 August 15th, 2010 Christian Widmer (FML, T¨ ubingen) Taxonomy-based Multitask Learning 1 / 22

Upload: others

Post on 11-Oct-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Leveraging Sequence Classification by

Taxonomy-based Multitask Learning

Christian Widmer,1 Jose Leiva,1,2 Yasemin Altun,2 Gunnar Ratsch1

1 Friedrich Miescher Laboratory of the Max Planck Society, Tubingen, Germany2 Max Planck Institute for Biological Cybernetics, Tubingen, Germany

RECOMB 2010

August 15th, 2010

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 1 / 22

Page 2: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Talk Outline fml

I Introduction of the setting

I Methods: 3 approaches to Hierarchical Mulitask Learning

I Experiments (Toy, Splicing)

I Outlook

I Summary

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 2 / 22

Page 3: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Introduction

Why Multitask learning? fml

I Model quality limited by insufficient training data

⇒ Idea: Use information from related domains/tasks

I Related Distributions: Different but similar

I In Computational Biology domains correspond to organismsI Organisms share evolutionary historyI Many biological processes are conserved

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 3 / 22

Page 4: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Introduction

Why Hierarchies? fmlHierarchical structure arises naturally from the Tree of Life

I Taxonomy used to define relationship between tasks

I Closer tasks benefit more from each other

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 4 / 22

Page 5: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Introduction

What problems could benefit from this? fmlI Prediction of Sequence Signals in Gene Finding (mGene [4])

I Numerous potential applications for Multitask LearningI Focus on splice-sites (Donor)

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 5 / 22

Page 6: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Introduction

What problems could benefit from this? fmlI Prediction of Sequence Signals in Gene Finding (mGene [4])

I Numerous potential applications for Multitask LearningI Focus on splice-sites (Donor)

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 5 / 22

Page 7: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Formal Problem Definition fml

Multitask Learning

I Consider M tasks Ti , where i ∈ {1, ...,M}I We are given data Di = {(x1, y1), ..., (xNi

, yNi)} for each task

I We want to train M predictors f1, ..., fM , each taking intoaccount all available information

I For that we would like to utilize a given taxonomy T , thatrelates the tasks at hand

⇒ We need algorithms to efficiently exploit T for transfer learning

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 6 / 22

Page 8: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Formal Problem Definition fml

Multitask Learning

I Consider M tasks Ti , where i ∈ {1, ...,M}I We are given data Di = {(x1, y1), ..., (xNi

, yNi)} for each task

I We want to train M predictors f1, ..., fM , each taking intoaccount all available information

I For that we would like to utilize a given taxonomy T , thatrelates the tasks at hand

⇒ We need algorithms to efficiently exploit T for transfer learning

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 6 / 22

Page 9: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Formal Problem Definition fml

Multitask Learning

I Consider M tasks Ti , where i ∈ {1, ...,M}I We are given data Di = {(x1, y1), ..., (xNi

, yNi)} for each task

I We want to train M predictors f1, ..., fM , each taking intoaccount all available information

I For that we would like to utilize a given taxonomy T , thatrelates the tasks at hand

⇒ We need algorithms to efficiently exploit T for transfer learning

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 6 / 22

Page 10: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Formal Problem Definition fml

Multitask Learning

I Consider M tasks Ti , where i ∈ {1, ...,M}I We are given data Di = {(x1, y1), ..., (xNi

, yNi)} for each task

I We want to train M predictors f1, ..., fM , each taking intoaccount all available information

I For that we would like to utilize a given taxonomy T , thatrelates the tasks at hand

⇒ We need algorithms to efficiently exploit T for transfer learning

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 6 / 22

Page 11: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Formal Problem Definition fml

Multitask Learning

I Consider M tasks Ti , where i ∈ {1, ...,M}I We are given data Di = {(x1, y1), ..., (xNi

, yNi)} for each task

I We want to train M predictors f1, ..., fM , each taking intoaccount all available information

I For that we would like to utilize a given taxonomy T , thatrelates the tasks at hand

⇒ We need algorithms to efficiently exploit T for transfer learning

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 6 / 22

Page 12: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Two ways of leveraging a given taxonomy T fml

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 7 / 22

Page 13: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Domain Adaptation by Regularization fmlIdea: Enforce model similarity via Regularization Term

I Regular SVM

minw,b

1

2‖w‖2 + C

∑(x,y)∈D

` (〈Φ(x),w〉+ b, y)

I DA-SVM

minw,b

1

2‖w−wpar‖2 + C

∑(x,y)∈D

` (〈Φ(x),w〉+ b, y) ,

where ` is the hinge loss, `(z , y) = max{1− yz , 0}.Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 8 / 22

Page 14: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Domain Adaptation by Regularization fmlIdea: Enforce model similarity via Regularization Term

I Regular SVM

minw,b

1

2‖w‖2 + C

∑(x,y)∈D

` (〈Φ(x),w〉+ b, y)

I DA-SVM

minw,b

1

2‖w−wpar‖2 + C

∑(x,y)∈D

` (〈Φ(x),w〉+ b, y) ,

where ` is the hinge loss, `(z , y) = max{1− yz , 0}.Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 8 / 22

Page 15: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Hierarchical Top-Down Approach fmlIdea: Exploit taxonomy G algorithmically

I Initialization: w0 trained on union of task dataI Top-Down for each node t:

I Train on Di =⋃

j4i Dj

I Regularize wi against parent predictor wpar : ‖wi−wpar‖2

I Use leaf predictors for classification

(a) Top-level training (b) Inner training (c) Taxon training

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 9 / 22

Page 16: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Hierarchical Top-Down Approach fmlIdea: Exploit taxonomy G algorithmically

I Initialization: w0 trained on union of task dataI Top-Down for each node t:

I Train on Di =⋃

j4i Dj

I Regularize wi against parent predictor wpar : ‖wi−wpar‖2

I Use leaf predictors for classification

(a) Top-level training (b) Inner training (c) Taxon training

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 9 / 22

Page 17: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Hierarchical Top-Down Approach fmlIdea: Exploit taxonomy G algorithmically

I Initialization: w0 trained on union of task dataI Top-Down for each node t:

I Train on Di =⋃

j4i Dj

I Regularize wi against parent predictor wpar : ‖wi−wpar‖2

I Use leaf predictors for classification

(a) Top-level training (b) Inner training (c) Taxon training

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 9 / 22

Page 18: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Pairwise Approach fml

minw1,...,wM

1

2

M∑t=1

M∑s=1

γt,s‖wt −ws‖2 +M∑t=1

Ct

∑(x,y)∈Dt

` (〈x,wt〉 , y) .

where ` is the hinge loss, `(z , y) = max{1− yz , 0}.I Train all classifiers wi at the same time

I Loss is evaluated independently on datasets Di

I Similarity is enforced via pairwise regularization term

I Task closeness controlled by γt,s

⇒ efficient solution possible via decompositionEvgeniou et al. [2]

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 10 / 22

Page 19: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Multitask Kernel Approach fml

maxα−1

2

n∑i=1

n∑j=1

αiαjyiyj k(xi , xj) +n∑

i=1

αi

s.t. 0 ≤ αi ≤ C ∀i ∈ [1, n]

αTy = 0,

wherek((xi , s), (xj , t)) = ktask(s, t)︸ ︷︷ ︸

γt,s

·k(xi , xj)

I Easily implemented by altering existing kernel functions (WDK)I Reuse existing kernel algorithms (SVM)

Daume III [1], Vert et al. [3]

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 11 / 22

Page 20: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Methods

Method Summary fmlI Multitask Learning Methods

I Top-DownI Pairwise RegularizationI Multitask Kernel

I Additional Baselines

Plain Union

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 12 / 22

Page 21: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Experiments

Toy Data Generation fmlI Sample sequences for M tasks

I Evolve generative model (PSSM) according to given structure

I We want to controlI How hard the problems areI How different the problems areI → use relative entropy DKL(P||Q) =

∑i P(i)log P(i)

Q(i)

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 13 / 22

Page 22: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Experiments

Results Toy fml

8 tasks 16 tasks 32 tasks 64 tasks0.62

0.64

0.66

0.68

0.70

0.72

0.74

0.76

auR

OC

plainunionpairwisemultikerneltop-down

I MTL methods outperform baselinesI High mutation rate leads to decreased performance in deep

hierarchiesChristian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 14 / 22

Page 23: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Experiments

Application to splice-site recognition fmlI Formulate as binary classification problem

I Utilize 15 organisms related by taxonomy

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 15 / 22

Page 24: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Experiments

Results Splicing Data fml

I MTL methods outperform baselines

I Best performer is Top-Down

I Potential increase depends on the particular organism

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 16 / 22

Page 25: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Experiments

Outlook fml

I Techniques for estimation of edge lengths

I Apply to Promoter prediction: ARTS [5] ⇒ ARTS.MTL

I Extension to structured output learning

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 17 / 22

Page 26: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Summary

Summary fml

I Presented three MTL methodsI Two extensions of existing MTL methodsI One novel method

I Demonstrated performance on two datasetsI Toy Data: RobustnessI Splicing Data: Outperform baselines

I Implementations available in Shogun [6]

I Relevant to many application in Comp-Bio

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 18 / 22

Page 27: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Summary

Summary fml

I Presented three MTL methodsI Two extensions of existing MTL methodsI One novel method

I Demonstrated performance on two datasetsI Toy Data: RobustnessI Splicing Data: Outperform baselines

I Implementations available in Shogun [6]

I Relevant to many application in Comp-Bio

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 18 / 22

Page 28: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Summary

Summary fml

I Presented three MTL methodsI Two extensions of existing MTL methodsI One novel method

I Demonstrated performance on two datasetsI Toy Data: RobustnessI Splicing Data: Outperform baselines

I Implementations available in Shogun [6]

I Relevant to many application in Comp-Bio

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 18 / 22

Page 29: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Summary

Summary fml

I Presented three MTL methodsI Two extensions of existing MTL methodsI One novel method

I Demonstrated performance on two datasetsI Toy Data: RobustnessI Splicing Data: Outperform baselines

I Implementations available in Shogun [6]

I Relevant to many application in Comp-Bio

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 18 / 22

Page 30: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Summary

Acknowledgments fmlI Jose Leiva 1,2

I Yasemin Altun 2

I Soren Sonnenburg 2

I Gabrielle Schweikert 1,2

I Bernhard Scholkopf 2

I Klaus Robert Muller 3

I Gunnar Ratsch 1

1 Friedrich Miescher Laboratory of the Max Planck Society

2 Max Planck Institute for Biological Cybernetics

3 Berlin Institute of Technology

Thank you for your attention.

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 19 / 22

Page 31: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Summary

Acknowledgments fmlI Jose Leiva 1,2

I Yasemin Altun 2

I Soren Sonnenburg 2

I Gabrielle Schweikert 1,2

I Bernhard Scholkopf 2

I Klaus Robert Muller 3

I Gunnar Ratsch 1

1 Friedrich Miescher Laboratory of the Max Planck Society

2 Max Planck Institute for Biological Cybernetics

3 Berlin Institute of Technology

Thank you for your attention.

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 19 / 22

Page 32: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Summary

References I

H. Daume.

Frustratingly easy domain adaptation.

In ACL. The Association for Computer Linguistics, 2007.

T. Evgeniou, C.A. Micchelli, and M. Pontil.

Learning multiple tasks with kernel methods.

Journal of Machine Learning Research, 6:615–637, 2005.

L. Jacob and J.P. Vert.

Efficient peptide-MHC-I binding prediction for alleles with few knownbinders.

Bioinformatics, 24(3):358, 2008.

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 20 / 22

Page 33: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Summary

References II

Gabriele Schweikert, Alexander Zien, Georg Zeller, Jonas Behr, ChristophDieterich, Cheng Soon Ong, Petra Philips, Fabio De Bona, Lisa Hartmann,Anja Bohlen, Nina Kruger, Soren Sonnenburg, and Gunnar Ratsch.

mGene: accurate SVM-based gene finding with an application to nematodegenomes.

Genome research, 19(11):2133–43, November 2009.

S. Sonnenburg, A. Zien, and G. Ratsch.

ARTS: accurate recognition of transcription starts in human.

Bioinformatics, 22(14):e472, 2006.

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 21 / 22

Page 34: Leveraging Sequence Classification by Taxonomy-based ...€¦ · Taxonomy-based Multitask Learning Christian Widmer, 1Jose Leiva,; 2Yasemin Altun, Gunnar R atsch1 1 Friedrich Miescher

Summary

References III

Soren Sonnenburg, Gunnar Ratsch, Sebastian Henschel, Christian Widmer,Jonas Behr, Alexander Zien, Fabio de Bona, Alexander Binder, ChristianGehl, and Vojtech Franc.

The shogun machine learning toolbox.

Journal of Machine Learning Research, 2010.

accepted.

Christian Widmer (FML, Tubingen) Taxonomy-based Multitask Learning 22 / 22