michael alcorn, sr. software engineer, red hat inc. at mlconf sf 2017

REPRESENTATIONLEARNING @ RED HATMichael A. Alcorn ([email protected])

Machine Learning Engineer - Information Retrieval

https://sites.google.com/view/michaelaalcorn/

1

https://sites.google.com/view/michaelaalcorn/

OutlineBackgroundword2vec/url2vecdoc2vec/account2vecDuplicate Detection(batter|pitcher)2vec

MLconf Blog

2

https://mlconf.com/guest-blog-michael-alcorn-senior-software-engineer-red-hat/

Background

Why?Small amount (zero?) of labeled data for taskLots of unlabeled data (labeled data for a differenttask?)

Can we use large amounts of unlabeled data to makebetter predictions?

Not the same as traditional unsupervised learning!

in Goodfellow et al.'s Deep Learningtextbook

by Bengio et al.

Representation learning

Transfer learning

Excellent chapter

Article

3

https://en.wikipedia.org/wiki/Feature_learning

https://journalofbigdata.springeropen.com/articles/10.1186/s40537-016-0043-6

http://www.deeplearningbook.org/contents/representation.html

https://arxiv.org/abs/1206.5538

word2vec

ew

TextTextTextText

NVIDIA - " "Introduction to Neural Machine Translation with GPUs (Part 2)

4

https://devblogs.nvidia.com/parallelforall/introduction-neural-machine-translation-gpus-part-2/

https://devblogs.nvidia.com/parallelforall/introduction-neural-machine-translation-gpus-part-2/

word2vec

ew

Deeplearning4j - " "

Mikolov et al. (2013)

Word2vec

5

https://deeplearning4j.org/word2vec

http://arxiv.org/pdf/1301.3781.pdf

http://deeplearning4j.org/word2vec

word2vecAnalogies

"x is to y as ? is to z" x - y + z = ?bash - shellshock + heartbleed = opensslfirefox - linux + windows = internet_exploreropenshift - cloud + storage = glusterrhn_register - rhn + rhsm = subscription-manager

=+—

6

Naming Colors mapping RGB values to

color namesResults are pretty underwhelming for those in theknowCan word embeddings improve ( )?

Blog post by Janelle Shane

GitHub

7

http://aiweirdness.com/post/160776374467/new-paint-colors-invented-by-neural-network

https://github.com/airalcorn2/Color-Names

url2vecTasks concerning URLs

Search - returning relevant contentTroubleshooting - recommending related articles

Obvious method - look at textAlternative/enhanced method - use customerbrowsing behavior as additional contextual clues

8

url2vecHow?

Treat each day of browsing activity as a "sentence"Treat each URL as a "word"Run word2vec!

9

url2vec

https://access.redhat.com/solutions/25190


Application: ScatterPlot3D

10



https://slides.com/secure/decks/1112724/print?margin=0.0&print-pdf=true

https://sites.google.com/view/michaelaalcorn/projects/scatterplot3d

doc2vec

" "

Le and Mikolov (2014)

NLP 05: From Word2vec to Doc2vec: a simple example with Gensim

11

https://arxiv.org/abs/1405.4053

https://ireneli.eu/2016/07/27/nlp-05-from-word2vec-to-doc2vec-a-simple-example-with-gensim/

customer2vecWhy?

Data-driven segmentation

Same idea as url2vec except now we treat each account asa "document" of many "sentences" (different browsingdays)

12

customer2vecWhy?

Data-driven segmentation

Same idea as url2vec except now we treat each account asa "document" of many "sentences" (different browsingdays)

13

customer2vec 14

https://slides.com/secure/decks/1112724/print?margin=0.0&print-pdf=true

Duplicate DetectionThere are a number of "duplicate" KCS solutions onthe Customer Portal

Muddy search results

How can we identify candidate duplicate documents?

Obvious approach - compare text (e.g., tf-idf)

Bag-of-words loses any structural meaning behind text

Can we learn better representations?

Title is essentially a summary of the solution contentLearn representations of body that are similar to titlerepresentations (like the DSSM; )my code

15

https://github.com/airalcorn2/Deep-Semantic-Similarity-Model

Deep Semantic Similarity Model

Jianfeng Gao - " "Deep Learning for Web Search and Natural Language Processing

16

https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/wsdm2015.v3.pdf

https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/wsdm2015.v3.pdf

(batter|pitcher)2vec ( )GitHubCan we learn meaningful representations of MLBplayers?

Accurate representations could be used to simulategames and inform tradesFind undervalued/overvalued players

17

https://github.com/airalcorn2/batter-pitcher-2vec



Can we learn meaningful representations of MLBplayers?


(batter|pitcher)2vec ( )GitHub 18



Can we learn meaningful representations of MLBplayers?


SI.com NBCSports.com

=+— LR

(batter|pitcher)2vec ( )GitHub 19

https://www.si.com/mlb/2017/05/25/mike-trout-los-angeles-angels

http://www.si.com/mlb/2017/05/25/mike-trout-los-angeles-angels

http://www.nbcsports.com/chicago/chicago-cubs/back2backoneday-what-bryce-harper-trying-say-about-his-future-kris-bryant-and-cubs

http://www.nbcsports.com/chicago/chicago-cubs/back2backoneday-what-bryce-harper-trying-say-about-his-future-kris-bryant-and-cubs


(batter|pitcher)2vec

""

Learning to CoachFootball

Wang and Zemel (2016)

20

https://sites.google.com/view/michaelaalcorn/blog/learning-to-coach-football


http://www.sloansportsconference.com/wp-content/uploads/2016/02/1536-Classifying-NBA-Offensive-Plays-Using-Neural-Networks.pdf

http://www.sloansportsconference.com/wp-content/uploads/2016/02/1536-Classifying-NBA-Offensive-Plays-Using-Neural-Networks.pdf

THANK YOU!

21

michael alcorn, sr. software engineer, red hat inc. at mlconf sf 2017

Technology