graph representation learninggraph representation learning william l. hamilton and jian tang mcgill...

18
Graph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila 1 Tutorial on Graph Representation Learning, AAAI 2019

Upload: others

Post on 16-Jul-2020

22 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Graph Representation Learning

William L. Hamilton and Jian TangMcGill University, HEC, and Mila

1Tutorial on Graph Representation Learning, AAAI 2019

Page 2: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Quebec Institute forLearning Algorithms (Mila)

§ Lead by the Deep Learning pioneer:Yoshua Bengio

§ The largest academic lab on deeplearning and reinforcement learning

§ >30 professors (14 core member), ~300 students

§ Multiple Postdoc, Ph.D., Master, andInterns positions are available.

Page 3: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Why graphs?Graphs are a general

language for describing and modeling complex

systems

3Tutorial on Graph Representation Learning, AAAI 2019

Page 4: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

4Tutorial on Graph Representation Learning, AAAI 2019

Page 5: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Graph!5Tutorial on Graph Representation Learning, AAAI 2019

Page 6: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

6

Economic networksSocial networks

Networks of neuronsInformation networks:

Web & citations

Biomedical networks

Internet

Many Data are Graphs

A B

C

Figure 3: Higher-order cluster in the C. elegans neuronal network (28). A: The 4-node“bi-fan” motif, which is over-expressed in the neuronal networks (1). Intuitively, this motifdescribes a cooperative propagation of information from the nodes on the left to the nodes onthe right. B: The best higher-order cluster in the C. elegans frontal neuronal network based onthe motif in (A). The cluster contains three ring motor neurons (RMEL/V/R; cyan) with manyoutgoing connections, serving as the source of information; six inner labial sensory neurons(IL2DL/VR/R/DR/VL; orange) with many incoming connections, serving as the destination ofinformation; and four URA neurons (purple) acting as intermediaries. These RME neurons havebeen proposed as pioneers for the nerve ring (20), while the IL2 neurons are known regulators ofnictation (21), and the higher-order cluster exposes their organization. The cluster also revealsthat RIH serves as a critical intermediary of information processing. This neuron has incominglinks from all three RME neurons, outgoing connections to five of the six IL2 neurons, and thelargest total number of connections of any neuron in the cluster. C: Illustration of the higher-order cluster in the context of the entire network. Node locations are the true two-dimensionalspatial embedding of the neurons. Most information flows from left to right, and we see thatRME/V/R/L and RIH serve as sources of information to the neurons on the right.

9

Tutorial on Graph Representation Learning, AAAI 2019

Page 7: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Why Graphs? Why Now?

§ Universal language for describing complex data§ Networks/graphs from science, nature, and

technology are more similar than one would expect

§ Shared vocabulary between fields§ Computer Science, Social science, Physics,

Economics, Statistics, Biology

§ Data availability (+computational challenges)§ Web/mobile, bio, health, and medical

§ Impact!§ Social networking, Social media, Drug design

Tutorial on Graph Representation Learning, AAAI 2019 7

Page 8: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Machine Learning with Graphs

Classical ML tasks in graphs:§ Node classification

§ Predict a type of a given node§ Link prediction

§ Predict whether two nodes are linked§ Community detection

§ Identify densely linked clusters of nodes

§ Network similarity§ How similar are two (sub)networks

Tutorial on Graph Representation Learning, AAAI 2019 8

Page 9: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Example: Node Classification

? ?

??

?Machine Learning

Tutorial on Graph Representation Learning, AAAI 2019 9

Page 10: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Example: Node Classification

Tutorial on Graph Representation Learning, AAAI 2019 10

Classifying the function of proteins in the interactome!

Image from: Ganapathiraju et al. 2016. Schizophrenia interactome with 504 novel protein–protein interactions. Nature.

Page 11: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Example: Link Prediction

Machine Learning

Tutorial on Graph Representation Learning, AAAI 2019 11

?

?

?

x

Page 12: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Example: Link Prediction

Tutorial on Graph Representation Learning, AAAI 2019 12

Content recommendation is

link prediction! ?

Page 13: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Machine Learning Lifecycle

13

Raw Data

Structured Data

Learning Algorithm

Model

Downstream prediction task

Feature Engineering

Automatically learn the features

§ (Supervised) Machine Learning Lifecycle: This feature, that feature. Every single time!

Tutorial on Graph Representation Learning, AAAI 2019

Page 14: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Feature Learning in Graphs

Goal: Efficient task-independent feature learning for machine learning

in graphs!

Tutorial on Graph Representation Learning, AAAI 2019 14

vecnode 2

!: # → ℝ&ℝ&

Feature representation, embedding

u

Page 15: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Example

Tutorial on Graph Representation Learning, AAAI 2019 15

OutputInput

A B

§ Zachary’s Karate Club Network:

Image from: Perozzi et al. 2014. DeepWalk: Online Learning of Social Representations. KDD.

Page 16: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Why Is It Hard?§ Modern deep learning toolbox is

designed for simple sequences or grids.§ CNNs for fixed-size images/grids….

§ RNNs or word2vec for text/sequences…

Tutorial on Graph Representation Learning, AAAI 2019 16

Page 17: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

Why Is It Hard?§ But graphs are far more complex!

§ Complex topographical structure (i.e., no spatial locality like grids)

§ No fixed node ordering or reference point (i.e., the isomorphism problem)

§ Often dynamic and have multimodal features.Tutorial on Graph Representation Learning, AAAI 2019 17

Page 18: Graph Representation LearningGraph Representation Learning William L. Hamilton and Jian Tang McGill University, HEC, and Mila Tutorial on Graph Representation Learning, AAAI 2019 1

This talk

§ 1) Node embeddings§ Map nodes to low-dimensional

embeddings.

~30min break

§ 2) Graph neural networks§ Deep learning architectures for graph-

structured data

§ 3) Generative graph models§ Learning to generate realistic graph data.

Tutorial on Graph Representation Learning, AAAI 2019 18