computational sorting with hyperfinder · poster# computational sorting with hyperfinder™ b70 bd...

1
Allison Irvine 1 , Joe Trotter 1 , Nikolay Samusik 1 , Josef Spidlen 2 . 1 BD Biosciences, 2350 Qume Drive, San Jose, CA, 95131, 2 FlowJo LLC, 385 Williamson Way, Ashland, OR 97520 Computational Sorting with HyperFinder™ Poster# B70 The authors have developed tools for a new workflow which enables users to incorporate machine learning algorithms practically into a cell sort on a BD FACS™ flow cytometer. In recent years, there has been an increase in the usage of dimensionality reduction and clustering techniques to discover and define cell populations. These techniques can result in nonlinear combinations of multiple markers, classification labels for each cell, or parameterized probability distributions. Many of these sophisticated computational tools are available in post-acquisition analysis software. Due to the hardware limitations of a cytometer, the gates used for sorting can only be one- or two-dimensional thresholds or shapes. Furthermore, the gates can only be defined on the parameters present in the data acquisition. To translate what defines a population from a machine-learning result to a cell sorter, we have developed a plug-in for FlowJo™ software called HyperFinder™. To bring those translated results back into the acquisition software, we have added the capability to export gates from FlowJo™ software to a BD FACSDiva™ experiment. Gates drawn on new parameters calculated from dimensionality reduction techniques cannot be interpreted by a cell sorter because those parameters are not present in the new data being acquired. Abstract Software Tools The prototype of the workflow we have developed for this project involves BD FACSDiva™ software and FlowJo™ software. The group has developed 3 plug-ins for FlowJo™ software and added integration with BD FACSDiva™ software. HyperFinder™ HyperFinder™ is an algorithm which searches all combinations of available parameters to find a 2D polygon gating strategy which most accurately distinguishes the population of interest. X-Shift™ X-Shift™ is a clustering algorithm which uses k-nearest neighbors local density estimation to construct clusters according to the density gradient in the k-nearest neighbors graph. ClusterExplorer™ ClusterExplorer™ is an interface for viewing and comparing clusters of cells in various ways, such as line graphs and heatmaps of average marker expression, and tSNE plots. Exporting from BD FACSDiva™ Software The ability to export and import gates, compensation, and parameter scaling between FlowJo software and a BD FACSDiva™ experiment was implemented as a feature in FlowJo™ software and will also be available as a FlowJo™ plug-in for users of earlier versions of FlowJo™. Tools Type equation here.equation here.+ - This is an initial attempt at a solution that allows users to perform a cell-sort based on cell population identification from machine learning methods such as clustering and dimensionality reduction. - This strategy was developed with the restriction that users will require no new hardware or acquisition software. Computational sorting can be performed right now using sorters running BD FACSDiva™ software. - In the future, the hardware on sorters which applies gates will need to be updated to allow sorting decisions based on more sophisticated derived parameters. The types of calculations which can be performed on sorting hardware are currently limited to logical operations. Type equation here.2 equation here.+ Identify Populations Classify Parameter Scaling Conversion Gate Conversion Create 2D Gating Strategy Train Acquire Data on the Sorter Collect Data Import the Data into FlowJo™ Import Export the Gates to a BD FACSDiva™ Experiment Export BD FACSDiva™ contains only Linear, Log, and Biexponential scaling. Biexponential Scaling = −10 2 BD FACSDiva™ stores the scale parameter r FlowJo™ software stores a function of the width parameter W Converting the FlowJo™ width parameter to the BD FACSDiva™ scale parameter: = − ∗ ∗ 10 1. Classical monocytes Identified using t-SNE after two cleanup gates (10 dimensions). 2. HyperFinder™ creates three gates to sort the classical monocytes in BD FACSDiva™ software. FlowJo™ Gates FACSDiva™ Gates FlowJo ™ Plug-ins: The plug-ins described her are available at the FlowJo™ Exchange website: https://www.flowjo.com/exchange/#/ Identify target and background populations Split the dataset into training and validation sets Using training set: Build a biaxial gate sequence Convex polygons Hyper-rectangle Run stochastic global optimization of gate boundaries to maximize the F1-score Report precision, recall, F1-score using validation set Classify Event and Sort Sort Workflow Finding a Gating Strategy with HyperFinder FlowJo ™ to FACSDiva ™ Conversion Conclusions 23-21641-00 BD, the BD Logo, ClusterExplorer, FACS, FACSDiva, FACSymphony, FlowJo, HyperFinder and X-Shift are trademarks of Becton, Dickinson and Company. © 2019 BD. All rights reserved. 1. Gate on Scatter (clustering results can be skewed by differences in range) 2. Cluster the data using FL parameters (X- Shift™ plug-in) 3. Use the ClusterExplore™ plug-in to choose a cluster Products are for Research Use Only. Not for use in diagnostic or therapeutic procedures. BDBiosciences.com

Upload: others

Post on 20-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Computational Sorting with HyperFinder · Poster# Computational Sorting with HyperFinder™ B70 BD Life Sciences The authors have developed tools for a new workflow which enables

Allison Irvine1, Joe Trotter1, Nikolay Samusik1, Josef Spidlen2.1BD Biosciences, 2350 Qume Drive, San Jose, CA, 95131, 2FlowJo LLC, 385 Williamson Way, Ashland, OR 97520

Computational Sorting with HyperFinder™Poster#B70

BD Life Sciences

The authors have developed tools for a new workflow which enables users to incorporate machine learning algorithms practically into a cell sort on a BD FACS™ flow cytometer.

In recent years, there has been an increase in the usage of dimensionality reduction and clustering techniques to discover and define cell populations. These techniques can result in nonlinear combinations of multiple markers, classification labels for each cell, or parameterized probability distributions. Many of these sophisticated computational tools are available in post-acquisition analysis software.

Due to the hardware limitations of a cytometer, the gates used for sorting can only be one- or two-dimensional thresholds or shapes. Furthermore, the gates can only be defined on the parameters present in the data acquisition.

To translate what defines a population from a machine-learning result to a cell sorter, we have developed a plug-in for FlowJo™ software called HyperFinder™. To bring those translated results back into the acquisition software, we have added the capability to export gates from FlowJo™ software to a BD FACSDiva™ experiment.

Gates drawn on new parameters calculated from dimensionality reduction techniques cannot be interpreted by a cell sorter because those parameters are not present in the new data being acquired.

Abstract

Software ToolsThe prototype of the workflow we have developed for this project involves BD FACSDiva™ software and FlowJo™ software. The group has developed 3 plug-ins for FlowJo™ software and added integration with BD FACSDiva™ software.

HyperFinder™HyperFinder™ is an algorithm which searches all combinations of available parameters to find a 2D polygon gating strategy which most accurately distinguishes the population of interest.

X-Shift™X-Shift™ is a clustering algorithm which uses k-nearest neighbors local density estimation to construct clusters according to the density gradient in the k-nearest neighbors graph.

ClusterExplorer™ClusterExplorer™ is an interface for viewing and comparing clusters of cells in various ways, such as line graphs and heatmaps of average marker expression, and tSNE plots.

Exporting from BD FACSDiva™ SoftwareThe ability to export and import gates, compensation, and parameter scaling between FlowJo software and a BD FACSDiva™ experiment was implemented as a feature in FlowJo™ software and will also be available as a FlowJo™ plug-in for users of earlier versions of FlowJo™.

Tools

Type equation here.2 equation here.+

- This is an initial attempt at a solution that allows users to perform a cell-sort based on cell population identification from machine learning methods such as clustering and dimensionality reduction.

- This strategy was developed with the restriction that users will require no new hardware or acquisition software. Computational sorting can be performed right now using sorters running BD FACSDiva™ software.

- In the future, the hardware on sorters which applies gates will need to be updated to allow sorting decisions based on more sophisticated derived parameters. The types of calculations which can be performed on sorting hardware are currently limited to logical operations.

Type equation here.2 equation here.+

Identify PopulationsClassify

Parameter Scaling Conversion

Gate Conversion

Create 2D Gating

StrategyTrain

Acquire Data on the

Sorter

Collect Data

Import the Data into FlowJo™

Import

Export the Gates to a BD

FACSDiva™ Experiment

Export

• BD FACSDiva™ contains only Linear, Log, and Biexponential scaling.

Biexponential Scaling

𝑊𝐹𝑙𝑜𝑤𝐽𝑜 = −102𝑊

• BD FACSDiva™ stores the scale parameter r

• FlowJo™ software stores a function of the width parameter W

Converting the FlowJo™ width parameter to the BD FACSDiva™ scale parameter:

𝑟 = −𝑊𝐹𝑙𝑜𝑤𝐽𝑜 ∗ 𝑇 ∗ 10−𝑚

1. Classical monocytes Identified using t-SNE after two cleanup gates (10 dimensions).

2. HyperFinder™ creates three gates to sort the classical monocytes in BD FACSDiva™ software.

FlowJo™ Gates FACSDiva™ Gates

FlowJo™ Plug-ins:

The plug-ins described her are available at the FlowJo™ Exchange website:

https://www.flowjo.com/exchange/#/

• Identify target and background populations• Split the dataset into training and validation sets• Using training set:

• Build a biaxial gate sequence• Convex polygons• Hyper-rectangle

• Run stochastic global optimization of gate boundaries to maximize the F1-score• Report precision, recall, F1-score using validation set

Classify Event and Sort

Sort

Workflow

Finding a Gating Strategy with HyperFinder™ FlowJo™ to FACSDiva™ Conversion Conclusions

23-21641-00

BD, the BD Logo, ClusterExplorer, FACS, FACSDiva, FACSymphony, FlowJo, HyperFinder and X-Shift are trademarks of Becton, Dickinson and Company. © 2019 BD. All rights reserved.

1. Gate on Scatter (clustering results can be skewed by differences in range)

2. Cluster the data using FL parameters (X-Shift™ plug-in)

3. Use the ClusterExplore™ plug-in to choose a cluster

Products are for Research Use Only. Not for use in diagnostic or therapeutic procedures.BDBiosciences.com