introductionpdhoff/courses/567/notes/l1_intro_paused.pdfintroduction 567 statistical analysis of...

82
Introduction 567 Statistical analysis of social networks Peter Hoff Statistics, University of Washington 1/15

Upload: others

Post on 03-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Introduction567 Statistical analysis of social networks

Peter Hoff

Statistics, University of Washington

1/15

Page 2: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Networks are relational data

Relationship: An irreducible property of two or more entities.

• Contrast to properties of entities alone (attributes).

• Relations are possibly affected, but not determined, by entity attributes.

Focus of RDA/SNA: The study of relational data arising from social entities.

• Entities: people, animals, groups, locations, organizations, regions, etc..

• Relationships: communication, acquaintanceship, sexual contact, trade,migration rate, alliance/conflict, etc..

2/15

Page 3: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Networks are relational data

Relationship: An irreducible property of two or more entities.

• Contrast to properties of entities alone (attributes).

• Relations are possibly affected, but not determined, by entity attributes.

Focus of RDA/SNA: The study of relational data arising from social entities.

• Entities: people, animals, groups, locations, organizations, regions, etc..

• Relationships: communication, acquaintanceship, sexual contact, trade,migration rate, alliance/conflict, etc..

2/15

Page 4: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Networks are relational data

Relationship: An irreducible property of two or more entities.

• Contrast to properties of entities alone (attributes).

• Relations are possibly affected, but not determined, by entity attributes.

Focus of RDA/SNA: The study of relational data arising from social entities.

• Entities: people, animals, groups, locations, organizations, regions, etc..

• Relationships: communication, acquaintanceship, sexual contact, trade,migration rate, alliance/conflict, etc..

2/15

Page 5: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Networks are relational data

Relationship: An irreducible property of two or more entities.

• Contrast to properties of entities alone (attributes).

• Relations are possibly affected, but not determined, by entity attributes.

Focus of RDA/SNA: The study of relational data arising from social entities.

• Entities: people, animals, groups, locations, organizations, regions, etc..

• Relationships: communication, acquaintanceship, sexual contact, trade,migration rate, alliance/conflict, etc..

2/15

Page 6: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Networks are relational data

Relationship: An irreducible property of two or more entities.

• Contrast to properties of entities alone (attributes).

• Relations are possibly affected, but not determined, by entity attributes.

Focus of RDA/SNA: The study of relational data arising from social entities.

• Entities: people, animals, groups, locations, organizations, regions, etc..

• Relationships: communication, acquaintanceship, sexual contact, trade,migration rate, alliance/conflict, etc..

2/15

Page 7: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Networks are relational data

Relationship: An irreducible property of two or more entities.

• Contrast to properties of entities alone (attributes).

• Relations are possibly affected, but not determined, by entity attributes.

Focus of RDA/SNA: The study of relational data arising from social entities.

• Entities: people, animals, groups, locations, organizations, regions, etc..

• Relationships: communication, acquaintanceship, sexual contact, trade,migration rate, alliance/conflict, etc..

2/15

Page 8: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Some vocabulary

Relational data:A collection of entities and a set ofmeasured relations between them.

• Entities: nodes, actors, egos, units.

• Relations: ties, links, edges.

Relations can be

• directed or undirected;

• valued or dichotomous (binary).

● ●

● ●

1 2

3

4

5

●●

●1

2

3

4

5

6

7

8

●●

●●

1

2

3

45

6

3/15

Page 9: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Some vocabulary

Relational data:A collection of entities and a set ofmeasured relations between them.

• Entities: nodes, actors, egos, units.

• Relations: ties, links, edges.

Relations can be

• directed or undirected;

• valued or dichotomous (binary).

● ●

● ●

1 2

3

4

5

●●

●1

2

3

4

5

6

7

8

●●

●●

1

2

3

45

6

3/15

Page 10: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Some vocabulary

Relational data:A collection of entities and a set ofmeasured relations between them.

• Entities: nodes, actors, egos, units.

• Relations: ties, links, edges.

Relations can be

• directed or undirected;

• valued or dichotomous (binary).

● ●

● ●

1 2

3

4

5

●●

●1

2

3

4

5

6

7

8

●●

●●

1

2

3

45

6

3/15

Page 11: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Some vocabulary

Relational data:A collection of entities and a set ofmeasured relations between them.

• Entities: nodes, actors, egos, units.

• Relations: ties, links, edges.

Relations can be

• directed or undirected;

• valued or dichotomous (binary).

● ●

● ●

1 2

3

4

5

●●

●1

2

3

4

5

6

7

8

●●

●●

1

2

3

45

6

3/15

Page 12: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Some vocabulary

Relational data:A collection of entities and a set ofmeasured relations between them.

• Entities: nodes, actors, egos, units.

• Relations: ties, links, edges.

Relations can be

• directed or undirected;

• valued or dichotomous (binary).

● ●

● ●

1 2

3

4

5

●●

●1

2

3

4

5

6

7

8

●●

●●

1

2

3

45

6

3/15

Page 13: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Some vocabulary

Relational data:A collection of entities and a set ofmeasured relations between them.

• Entities: nodes, actors, egos, units.

• Relations: ties, links, edges.

Relations can be

• directed or undirected;

• valued or dichotomous (binary).

● ●

● ●

1 2

3

4

5

●●

●1

2

3

4

5

6

7

8

●●

●●

1

2

3

45

6

3/15

Page 14: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Different perspectives on network analysis

• Social sciences (social theory, description, survey design)

• Machine learning (clustering, prediction, computation)

• Physics and applied math (agent-based models, emergent features)

• Statistics (statistical modeling, estimation and testing, design based inference)

4/15

Page 15: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Different perspectives on network analysis

• Social sciences (social theory, description, survey design)

• Machine learning (clustering, prediction, computation)

• Physics and applied math (agent-based models, emergent features)

• Statistics (statistical modeling, estimation and testing, design based inference)

4/15

Page 16: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Different perspectives on network analysis

• Social sciences (social theory, description, survey design)

• Machine learning (clustering, prediction, computation)

• Physics and applied math (agent-based models, emergent features)

• Statistics (statistical modeling, estimation and testing, design based inference)

4/15

Page 17: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Different perspectives on network analysis

• Social sciences (social theory, description, survey design)

• Machine learning (clustering, prediction, computation)

• Physics and applied math (agent-based models, emergent features)

• Statistics (statistical modeling, estimation and testing, design based inference)

4/15

Page 18: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 19: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 20: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 21: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 22: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 23: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 24: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 25: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 26: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 27: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Core areas of statistical network analysis

1. Statistical modeling: evaluation and fitting of network models.• Testing: evaluation of competing theories of network formation.• Estimation: evaluation of parameters in a presumed network model.• Description: summaries of main network patterns.• Prediction: prediction of missing or future network relations.

2. Design-based inference: Network inference from sampled data.• Design: survey and data-gathering procedures.• Inference: generalization of sample data to the full network.

5/15

Page 28: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

• “Add Health” - The National Longitudinal Study of Adolescent Health:

• A school-based study of adolescent health and social behaviors;• www.cpc.unc.edu/projects/addhealth.

• Data from 160 schools across the US:• The smallest had 69 adolescents in grades 7–12;• The largest had thousands of participants.

• Relational data:• Participants nominated and ranked up to 5 boys and 5 girls as friends.

6/15

Page 29: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

• “Add Health” - The National Longitudinal Study of Adolescent Health:

• A school-based study of adolescent health and social behaviors;• www.cpc.unc.edu/projects/addhealth.

• Data from 160 schools across the US:• The smallest had 69 adolescents in grades 7–12;• The largest had thousands of participants.

• Relational data:• Participants nominated and ranked up to 5 boys and 5 girls as friends.

6/15

Page 30: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

• “Add Health” - The National Longitudinal Study of Adolescent Health:

• A school-based study of adolescent health and social behaviors;• www.cpc.unc.edu/projects/addhealth.

• Data from 160 schools across the US:• The smallest had 69 adolescents in grades 7–12;• The largest had thousands of participants.

• Relational data:• Participants nominated and ranked up to 5 boys and 5 girls as friends.

6/15

Page 31: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

• “Add Health” - The National Longitudinal Study of Adolescent Health:

• A school-based study of adolescent health and social behaviors;• www.cpc.unc.edu/projects/addhealth.

• Data from 160 schools across the US:• The smallest had 69 adolescents in grades 7–12;• The largest had thousands of participants.

• Relational data:• Participants nominated and ranked up to 5 boys and 5 girls as friends.

6/15

Page 32: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

• “Add Health” - The National Longitudinal Study of Adolescent Health:

• A school-based study of adolescent health and social behaviors;• www.cpc.unc.edu/projects/addhealth.

• Data from 160 schools across the US:• The smallest had 69 adolescents in grades 7–12;• The largest had thousands of participants.

• Relational data:• Participants nominated and ranked up to 5 boys and 5 girls as friends.

6/15

Page 33: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

• “Add Health” - The National Longitudinal Study of Adolescent Health:

• A school-based study of adolescent health and social behaviors;• www.cpc.unc.edu/projects/addhealth.

• Data from 160 schools across the US:• The smallest had 69 adolescents in grades 7–12;• The largest had thousands of participants.

• Relational data:• Participants nominated and ranked up to 5 boys and 5 girls as friends.

6/15

Page 34: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

• “Add Health” - The National Longitudinal Study of Adolescent Health:

• A school-based study of adolescent health and social behaviors;• www.cpc.unc.edu/projects/addhealth.

• Data from 160 schools across the US:• The smallest had 69 adolescents in grades 7–12;• The largest had thousands of participants.

• Relational data:• Participants nominated and ranked up to 5 boys and 5 girls as friends.

6/15

Page 35: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

• “Add Health” - The National Longitudinal Study of Adolescent Health:

• A school-based study of adolescent health and social behaviors;• www.cpc.unc.edu/projects/addhealth.

• Data from 160 schools across the US:• The smallest had 69 adolescents in grades 7–12;• The largest had thousands of participants.

• Relational data:• Participants nominated and ranked up to 5 boys and 5 girls as friends.

6/15

Page 36: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

●●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

● ●

●●

●●

●●

●●

●●

grade 7grade 8grade 9grade 10grade 11grade 12

Notice: Homophily by nodal attributes.

7/15

Page 37: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

●●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

● ●

●●

●●

●●

●●

●●

grade 7grade 8grade 9grade 10grade 11grade 12

Notice: Homophily by nodal attributes.

7/15

Page 38: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

●●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

● ●

●●

●●

●●

●●

●●

grade 7grade 8grade 9grade 10grade 11grade 12

●●

●●

● ●

●●

●●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

whiteblackhispanicother

Notice: Homophily by nodal attributes.

8/15

Page 39: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: AddHealth friendships

●●

●●

●●

● ●

●●

●●

● ●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

● ●

●●

●●

●●

● ●

●●

●●

malefemale

Question: Why might this plot be misleading?

9/15

Page 40: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Example: Protein interaction data

●●

●●

●●●

●●●●

●●

●●

● ●

●●

● ●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

Notice: Network structure as compared to the friendship data.

10/15

Page 41: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Features of many social networks

In addition to associations to nodal and dyadic attributes, many networksexhibit the following features:

• Reciprocity of ties

• Degree heterogeneity in the propensity to form or receive ties• sociability• popularity

• Homophily by actor attributes

• higher propensity to form ties between actors with similar attributes• attributes may be observed or unobserved

• Transitivity of relationships• friends of friends have a higher propensity to be friends

• Balance of relationships• liking those who dislike whom you dislike

• Equivalence of nodes• some nodes may have identical or similar patterns of relationships

11/15

Page 42: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Features of many social networks

In addition to associations to nodal and dyadic attributes, many networksexhibit the following features:

• Reciprocity of ties

• Degree heterogeneity in the propensity to form or receive ties• sociability• popularity

• Homophily by actor attributes

• higher propensity to form ties between actors with similar attributes• attributes may be observed or unobserved

• Transitivity of relationships• friends of friends have a higher propensity to be friends

• Balance of relationships• liking those who dislike whom you dislike

• Equivalence of nodes• some nodes may have identical or similar patterns of relationships

11/15

Page 43: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Features of many social networks

In addition to associations to nodal and dyadic attributes, many networksexhibit the following features:

• Reciprocity of ties

• Degree heterogeneity in the propensity to form or receive ties• sociability• popularity

• Homophily by actor attributes

• higher propensity to form ties between actors with similar attributes• attributes may be observed or unobserved

• Transitivity of relationships• friends of friends have a higher propensity to be friends

• Balance of relationships• liking those who dislike whom you dislike

• Equivalence of nodes• some nodes may have identical or similar patterns of relationships

11/15

Page 44: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Features of many social networks

In addition to associations to nodal and dyadic attributes, many networksexhibit the following features:

• Reciprocity of ties

• Degree heterogeneity in the propensity to form or receive ties• sociability• popularity

• Homophily by actor attributes

• higher propensity to form ties between actors with similar attributes• attributes may be observed or unobserved

• Transitivity of relationships• friends of friends have a higher propensity to be friends

• Balance of relationships• liking those who dislike whom you dislike

• Equivalence of nodes• some nodes may have identical or similar patterns of relationships

11/15

Page 45: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Features of many social networks

In addition to associations to nodal and dyadic attributes, many networksexhibit the following features:

• Reciprocity of ties

• Degree heterogeneity in the propensity to form or receive ties• sociability• popularity

• Homophily by actor attributes

• higher propensity to form ties between actors with similar attributes• attributes may be observed or unobserved

• Transitivity of relationships• friends of friends have a higher propensity to be friends

• Balance of relationships• liking those who dislike whom you dislike

• Equivalence of nodes• some nodes may have identical or similar patterns of relationships

11/15

Page 46: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Features of many social networks

In addition to associations to nodal and dyadic attributes, many networksexhibit the following features:

• Reciprocity of ties

• Degree heterogeneity in the propensity to form or receive ties• sociability• popularity

• Homophily by actor attributes

• higher propensity to form ties between actors with similar attributes• attributes may be observed or unobserved

• Transitivity of relationships• friends of friends have a higher propensity to be friends

• Balance of relationships• liking those who dislike whom you dislike

• Equivalence of nodes• some nodes may have identical or similar patterns of relationships

11/15

Page 47: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Features of many social networks

In addition to associations to nodal and dyadic attributes, many networksexhibit the following features:

• Reciprocity of ties

• Degree heterogeneity in the propensity to form or receive ties• sociability• popularity

• Homophily by actor attributes

• higher propensity to form ties between actors with similar attributes• attributes may be observed or unobserved

• Transitivity of relationships• friends of friends have a higher propensity to be friends

• Balance of relationships• liking those who dislike whom you dislike

• Equivalence of nodes• some nodes may have identical or similar patterns of relationships

11/15

Page 48: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Features of many social networks

In addition to associations to nodal and dyadic attributes, many networksexhibit the following features:

• Reciprocity of ties

• Degree heterogeneity in the propensity to form or receive ties• sociability• popularity

• Homophily by actor attributes

• higher propensity to form ties between actors with similar attributes• attributes may be observed or unobserved

• Transitivity of relationships• friends of friends have a higher propensity to be friends

• Balance of relationships• liking those who dislike whom you dislike

• Equivalence of nodes• some nodes may have identical or similar patterns of relationships

11/15

Page 49: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Features of many social networks

In addition to associations to nodal and dyadic attributes, many networksexhibit the following features:

• Reciprocity of ties

• Degree heterogeneity in the propensity to form or receive ties• sociability• popularity

• Homophily by actor attributes

• higher propensity to form ties between actors with similar attributes• attributes may be observed or unobserved

• Transitivity of relationships• friends of friends have a higher propensity to be friends

• Balance of relationships• liking those who dislike whom you dislike

• Equivalence of nodes• some nodes may have identical or similar patterns of relationships

11/15

Page 50: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Dependent relational data

On notion of statistical dependence is as follows:

Dependence:Two outcomes are dependent if knowing one gives you information aboutthe other.

Exercise:How might network features give rise to statistical dependence?

Ubiquitous feature of network data:Statistical dependence among relational measurements.

12/15

Page 51: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Dependent relational data

On notion of statistical dependence is as follows:

Dependence:Two outcomes are dependent if knowing one gives you information aboutthe other.

Exercise:How might network features give rise to statistical dependence?

Ubiquitous feature of network data:Statistical dependence among relational measurements.

12/15

Page 52: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Dependent relational data

On notion of statistical dependence is as follows:

Dependence:Two outcomes are dependent if knowing one gives you information aboutthe other.

Exercise:How might network features give rise to statistical dependence?

Ubiquitous feature of network data:Statistical dependence among relational measurements.

12/15

Page 53: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Dependent relational data

On notion of statistical dependence is as follows:

Dependence:Two outcomes are dependent if knowing one gives you information aboutthe other.

Exercise:How might network features give rise to statistical dependence?

Ubiquitous feature of network data:Statistical dependence among relational measurements.

12/15

Page 54: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Data analysis goals

●●

●●

●●

●●●

●●●●

●●

●●

● ●

●●

● ●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

How do network features drive our data analysis?

1. How can we describe features of social relations?(reciprocity/sociability/popularity/transitivity : descriptive statistics)

2. How can we identify nodes with similar network roles?(stochastic equivalence : node partitioning)

3. How do we relate the network to covariate information?(homophily : regression modeling)

13/15

Page 55: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Data analysis goals

●●

●●

●●

●●●

●●●●

●●

●●

● ●

●●

● ●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

How do network features drive our data analysis?

1. How can we describe features of social relations?(reciprocity/sociability/popularity/transitivity : descriptive statistics)

2. How can we identify nodes with similar network roles?(stochastic equivalence : node partitioning)

3. How do we relate the network to covariate information?(homophily : regression modeling)

13/15

Page 56: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Data analysis goals

●●

●●

●●

●●●

●●●●

●●

●●

● ●

●●

● ●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

How do network features drive our data analysis?

1. How can we describe features of social relations?(reciprocity/sociability/popularity/transitivity : descriptive statistics)

2. How can we identify nodes with similar network roles?(stochastic equivalence : node partitioning)

3. How do we relate the network to covariate information?(homophily : regression modeling)

13/15

Page 57: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Data analysis goals

●●

●●

●●

●●●

●●●●

●●

●●

● ●

●●

● ●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

How do network features drive our data analysis?

1. How can we describe features of social relations?(reciprocity/sociability/popularity/transitivity : descriptive statistics)

2. How can we identify nodes with similar network roles?(stochastic equivalence : node partitioning)

3. How do we relate the network to covariate information?(homophily : regression modeling)

13/15

Page 58: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Data analysis goals

●●

●●

●●

●●●

●●●●

●●

●●

● ●

●●

● ●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

How do network features drive our data analysis?

1. How can we describe features of social relations?(reciprocity/sociability/popularity/transitivity : descriptive statistics)

2. How can we identify nodes with similar network roles?(stochastic equivalence : node partitioning)

3. How do we relate the network to covariate information?(homophily : regression modeling)

13/15

Page 59: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Data analysis goals

●●

●●

●●

●●●

●●●●

●●

●●

● ●

●●

● ●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

How do network features drive our data analysis?

1. How can we describe features of social relations?(reciprocity/sociability/popularity/transitivity : descriptive statistics)

2. How can we identify nodes with similar network roles?(stochastic equivalence : node partitioning)

3. How do we relate the network to covariate information?(homophily : regression modeling)

13/15

Page 60: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Inferential goals in the regression framework

yi,j measures i → j , xi,j is a vector of explanatory variables.

Y =

y1,1 y1,2 y1,3 NA y1,5 · · ·y2,1 y2,2 y2,3 y2,4 y2,5 · · ·y3,1 NA y3,3 y3,4 NA · · ·y4,1 y4,2 y4,3 y4,4 y4,5 · · ·

.

.

....

.

.

....

.

.

.

X =

x1,1 x1,2 x1,3 x1,4 x1,5 · · ·x2,1 x2,2 x2,3 x2,4 x2,5 · · ·x3,1 x3,2 x3,3 x3,4 x3,5 · · ·x4,1 x4,2 x4,3 x4,4 x4,5 · · ·

.

.

....

.

.

....

.

.

.

Consider a basic (generalized) linear model

yi,j ∼ βTxi,j + ei,j

A model can provide

• a measure of the association between X and Y: β, se(β)

• imputations of missing observations: p(y1,4|Y,X)

• a probabilistic description of network features: g(Y), Y ∼ p(Y|Y,X)

A recurring challenge will be to sufficiently account for dependence in the data.

14/15

Page 61: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Inferential goals in the regression framework

yi,j measures i → j , xi,j is a vector of explanatory variables.

Y =

y1,1 y1,2 y1,3 NA y1,5 · · ·y2,1 y2,2 y2,3 y2,4 y2,5 · · ·y3,1 NA y3,3 y3,4 NA · · ·y4,1 y4,2 y4,3 y4,4 y4,5 · · ·

.

.

....

.

.

....

.

.

.

X =

x1,1 x1,2 x1,3 x1,4 x1,5 · · ·x2,1 x2,2 x2,3 x2,4 x2,5 · · ·x3,1 x3,2 x3,3 x3,4 x3,5 · · ·x4,1 x4,2 x4,3 x4,4 x4,5 · · ·

.

.

....

.

.

....

.

.

.

Consider a basic (generalized) linear model

yi,j ∼ βTxi,j + ei,j

A model can provide

• a measure of the association between X and Y: β, se(β)

• imputations of missing observations: p(y1,4|Y,X)

• a probabilistic description of network features: g(Y), Y ∼ p(Y|Y,X)

A recurring challenge will be to sufficiently account for dependence in the data.

14/15

Page 62: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Inferential goals in the regression framework

yi,j measures i → j , xi,j is a vector of explanatory variables.

Y =

y1,1 y1,2 y1,3 NA y1,5 · · ·y2,1 y2,2 y2,3 y2,4 y2,5 · · ·y3,1 NA y3,3 y3,4 NA · · ·y4,1 y4,2 y4,3 y4,4 y4,5 · · ·

.

.

....

.

.

....

.

.

.

X =

x1,1 x1,2 x1,3 x1,4 x1,5 · · ·x2,1 x2,2 x2,3 x2,4 x2,5 · · ·x3,1 x3,2 x3,3 x3,4 x3,5 · · ·x4,1 x4,2 x4,3 x4,4 x4,5 · · ·

.

.

....

.

.

....

.

.

.

Consider a basic (generalized) linear model

yi,j ∼ βTxi,j + ei,j

A model can provide

• a measure of the association between X and Y: β, se(β)

• imputations of missing observations: p(y1,4|Y,X)

• a probabilistic description of network features: g(Y), Y ∼ p(Y|Y,X)

A recurring challenge will be to sufficiently account for dependence in the data.

14/15

Page 63: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Inferential goals in the regression framework

yi,j measures i → j , xi,j is a vector of explanatory variables.

Y =

y1,1 y1,2 y1,3 NA y1,5 · · ·y2,1 y2,2 y2,3 y2,4 y2,5 · · ·y3,1 NA y3,3 y3,4 NA · · ·y4,1 y4,2 y4,3 y4,4 y4,5 · · ·

.

.

....

.

.

....

.

.

.

X =

x1,1 x1,2 x1,3 x1,4 x1,5 · · ·x2,1 x2,2 x2,3 x2,4 x2,5 · · ·x3,1 x3,2 x3,3 x3,4 x3,5 · · ·x4,1 x4,2 x4,3 x4,4 x4,5 · · ·

.

.

....

.

.

....

.

.

.

Consider a basic (generalized) linear model

yi,j ∼ βTxi,j + ei,j

A model can provide

• a measure of the association between X and Y: β, se(β)

• imputations of missing observations: p(y1,4|Y,X)

• a probabilistic description of network features: g(Y), Y ∼ p(Y|Y,X)

A recurring challenge will be to sufficiently account for dependence in the data.

14/15

Page 64: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Inferential goals in the regression framework

yi,j measures i → j , xi,j is a vector of explanatory variables.

Y =

y1,1 y1,2 y1,3 NA y1,5 · · ·y2,1 y2,2 y2,3 y2,4 y2,5 · · ·y3,1 NA y3,3 y3,4 NA · · ·y4,1 y4,2 y4,3 y4,4 y4,5 · · ·

.

.

....

.

.

....

.

.

.

X =

x1,1 x1,2 x1,3 x1,4 x1,5 · · ·x2,1 x2,2 x2,3 x2,4 x2,5 · · ·x3,1 x3,2 x3,3 x3,4 x3,5 · · ·x4,1 x4,2 x4,3 x4,4 x4,5 · · ·

.

.

....

.

.

....

.

.

.

Consider a basic (generalized) linear model

yi,j ∼ βTxi,j + ei,j

A model can provide

• a measure of the association between X and Y: β, se(β)

• imputations of missing observations: p(y1,4|Y,X)

• a probabilistic description of network features: g(Y), Y ∼ p(Y|Y,X)

A recurring challenge will be to sufficiently account for dependence in the data.

14/15

Page 65: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 66: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 67: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 68: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 69: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 70: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 71: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 72: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 73: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 74: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 75: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 76: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 77: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 78: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 79: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 80: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 81: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15

Page 82: Introductionpdhoff/courses/567/Notes/l1_intro_paused.pdfIntroduction 567 Statistical analysis of social networks Peter Ho Statistics, University of Washington 1/15. Networks are relational

Course outline

1. Representations of relational data• matrix representations• graph representations

2. Descriptive statistics and summaries• matrix-based (row/column summaries, matrix decompositions)• graph-based (degrees, dyads, triads, paths)• covariates

3. Inference for complete relational data• model comparison via hypothesis testing• regression models• p1 and ERGM models• social relations model• latent variable models (random effects, latent factors and blockmodels)

4. Inference for incomplete or sampled relational data• sampling designs (link tracing, egocentric)• sample-based inference• model-based inference

5. Longitudinal and multivariate relational data

15/15