cs224n research highlight - stanford university

CS224N Research Highlight A Simple but Tough-to-beat Baseline for Sentence Embeddings Sanjeev Arora, Yingyu Liang, Tengyu Ma Princeton University Presenter: Danqi Chen In submission to ICLR 2017

Upload: others

Post on 20-Mar-2022

5 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

CS224N Research Highlight

A Simple but Tough-to-beat Baseline for Sentence Embeddings

Sanjeev Arora, Yingyu Liang, Tengyu MaPrinceton University

Presenter: Danqi Chen

In submission to ICLR 2017

Page 2: CS224N Research Highlight - Stanford University

linguistics =

Word Sentence ?

BBBBBBBBBB@

0.2860.792�0.177�0.1070.109�0.5420.3490.271

CCCCCCCCCCA

Page 3: CS224N Research Highlight - Stanford University

linguistics =

Word Sentence ?

Natural language processing is fun. =

BBBBBBBBBB@

�0.1321.1290.8270.110�0.5270.1560.349�0.286

CCCCCCCCCCA

BBBBBBBBBB@

0.2860.792�0.177�0.1070.109�0.5420.3490.271

CCCCCCCCCCA

Page 4: CS224N Research Highlight - Stanford University

Sentence embedding

• Compute sentence similarity using the inner product:

S1: Mexico wishes to guarantee citizen’s safety.

S2: Mexico wishes to avoid more violence.Score: 4 (/5)

S1: Iranians Vote in Presidential Election.

S2: Keita Wins Mali Presidential Election. Score: 0.4 (/5)

Page 5: CS224N Research Highlight - Stanford University

Sentence embedding

Natural language processing is fun. =

BBBBBBBBBB@

�0.1321.1290.8270.110�0.5270.1560.349�0.286

CCCCCCCCCCA

• Use as features for sentence classification (e.g., sentiment analysis):

Page 6: CS224N Research Highlight - Stanford University

• Bag-of-words (BoW) v(“natural language processing”) =

1/3 (v(“natural”) + v(“language”) + v(“processing”))

From Bag-of-words to Complex Models…

Page 7: CS224N Research Highlight - Stanford University

• Bag-of-words (BoW) v(“natural language processing”) =

1/3 (v(“natural”) + v(“language”) + v(“processing”))

From Bag-of-words to Complex Models…

• Recurrent neural networks, recursive neural networks, convolutional neural networks..

Page 8: CS224N Research Highlight - Stanford University

• A VERY SIMPLE unsupervised method

• weighted Bag-of-words + remove some special direction

This paper

Page 9: CS224N Research Highlight - Stanford University

• A VERY SIMPLE unsupervised method

• weighted Bag-of-words + remove some special direction

This paper

• Step 1:

Page 10: CS224N Research Highlight - Stanford University

• A VERY SIMPLE unsupervised method

• weighted Bag-of-words + remove some special direction

This paper

• Step 1:

• Step 2:

Page 11: CS224N Research Highlight - Stanford University

A Probabilistic Interpretation

commondiscourse,oftenrelatedtosyntax

Page 12: CS224N Research Highlight - Stanford University

Resultssentence similarity

sentence classification

Page 13: CS224N Research Highlight - Stanford University

Results

Thanks!

sentence similarity

sentence classification

Reading Comprehension on SQuAD - Stanford … · Reading Comprehension on SQuAD Zachary Taylor, Brexton Pham, Meghana Rao Stanford University, Department of Computer Science CS224n

FusionNet: working smarter, not harder with SQuAD · FusionNet: working smarter, not harder with SQuAD Stanford CS224N Default Project Sebastian Hurubaru and François Chesnay Department

web.stanford.edu€¦ · Default Final Project for CS224N - Jaak J. Uudmae Stanford University juudmae@ st an ford . edu Abstract Self-Attention Self-Attention is an additional attention

Natural Language Processing with Deep Learning CS224N/Ling284web.stanford.edu/class/cs224n/slides/cs224n-2019-lecture01-wordvecs1.pdf · What’s different this year? •Lectures

CoreferenceResolution - Stanford University · CoreferenceResolution Part 2 CS224n Christopher Manning (borrows slides from Roger Levy, Altaf Rahman, Vincent Ng)

Natural Language Processing with Deep Learning CS224N/Ling284web.stanford.edu/class/cs224n/slides/cs224n-2020... · Program synthesis applications from natural language I think it

FusionNet: working smarter, not harder with SQuADweb.stanford.edu/class/cs224n/reports/default/report26.pdf · FusionNet: working smarter, not harder with SQuAD Stanford CS224N Default

Stanford University Jay Whang and Zach Maurer Python …web.stanford.edu/class/cs224n/lectures/python-review.… · · 2018-01-23Introduction to Numpy 4. Practical Python Tips 5

Natural Language Processing with Deep Learning CS224N ...web.stanford.edu/class/cs224n/slides/cs224n-2021-lecture...•David Hays, one of the founders of U.S. computational linguistics,

CS224N/Lin4 with Deep Learning tural Language Pr ocessingweb.stanford.edu/class/cs224n/slides/cs224n-2019-lecture06-rnnlm.pdf · longer, but this slide doesn’t have space! hidden

Stanford University Jay Whang and Zach Maurer Python Reviewweb.stanford.edu/class/cs224n/readings/python-review.pdf · Python is a strongly-typed and dynamically-typed language. Strongly-typed:

Stanford NLP Groupnlp.stanford.edu/courses/cs224n/2010/reports/mattbush-t… · Web view1.1 Twitter Overview. It is useful to begin with an initial introduction to our domain and

cs224n-python-review-code-updatedweb.stanford.edu/class/cs224n/readings/cs224n-python... · 2021. 1. 19. · Recommended IDEs Spyder (in-built in Anaconda) Pycharm (the most popular

Coreference(Resolution( - Stanford University · 2014-11-12 · Coreference(Resolution(CS224n ChristopherManning((borrows(slides(from(RogerLevy,(Altaf(Rahman,Vincent(Ng, HeeyoungLee)

Natural Language Processing with Deep Learning CS224N ...web.stanford.edu/class/cs224n/slides/cs224n-2021-lecture...•On each timestep, each element of the gates can be open(1), closed(0),

TensorFlow! An introduction to - Stanford University · An introduction to TensorFlow! Chip Huyen ([email protected]) CS224N 1/25/2018 1. 2. Agenda ... Graph from TensorFlow

cs221 final poster - Stanford Universityweb.stanford.edu/class/cs224n/posters/15744482.pdfContext Final Representation RNN Self-Attention RNN Inter-Attention RNN Contextualized Word

CS224N DeepNLP Week7 lecture3 - Stanford University · Part 1.1: The Basics 27 #4 Unsupervised feature learning Today, most practical, good NLP& ML methods require labeled training

CS224n: Natural Language Processing with Deep Learning ...web.stanford.edu/class/cs224n/readings/cs224n-2019-notes02-wordvecs2.pdfcs224n: natural language processing with deep learning

Natural Language Processing with Deep Learning CS224N ...web.stanford.edu/class/cs224n/slides/cs224n-2021-lecture...(Oscar Wilde –The Picture of Dorian Gray) Taking stock… •It’s

CS 224n: Assignment #5 [updated] - Stanford Universityweb.stanford.edu/class/cs224n/assignments/a5_updated.pdf · 2020-02-20 · CS 224n Assignment 5 [updated] Page 2 of 12 1. Character-based

Stanford University Jay Whang and Zach Maurer Python Review...Python Review CS224N - 1/19/18 Jay Whang and Zach Maurer Stanford University

CS224N/Lin4 with Deep Learning tural Language Pr …web.stanford.edu/class/cs224n/slides/cs224n-2019-lecture...Natural Language Processing with Deep Learning CS224N/Ling284 Lecture

Natural Language Processing with Deep Learning CS224N ...web.stanford.edu/class/cs224n/slides/cs224n-2021-lecture...PP attachment ambiguities multiply •A key parsing decision is

Natural Language Processing CS224N/Ling237

Natural Language Processing with Deep Learning CS224N/Ling284web.stanford.edu/class/cs224n/slides/cs224n-2020... · Neural graph-based parser: Dozat and Manning (2017) 3.Constraint

Natural Language Processing with Deep Learning CS224N ...web.stanford.edu/class/cs224n/slides/cs224n-2021-lecture...high frightened red correct similar fast evil christian 1) 57.5

CS224n: Deep Learning for NLP - Stanford University...cs224n: deep learning for nlp lecture notes: part vii 3 We now take h(1) and put it through a softmax layer to get a score over

CS224N/Lin4 with Deep Learning tural Language Pr ocessing › class › archive › cs › cs224n › cs224n... · 2019-02-04 · tural Language Pr ocessing with Deep Learning CS224N/Lin4

CS224N Python Introduction - Stanford Universityweb.stanford.edu/class/cs224n/readings/cs224n-python-review-20.pdf · CS224N Python Introduction. Plan for Today Intro to Python Installing

TOPIC HIGHLIGHT - Microsoft...cine, Stanford University School of Medicine, Stanford, CA 94305, United States Author contributions: All the authors contributed to this manuscript

cs224n-2018-lecture12-Transformers and CNNs

CS224N Final Project Geo-location Route Recognition

CS224N/Lin4 with Deep Learning tural Language Pr ocessingweb.stanford.edu/class/cs224n/slides/cs224n-2019-lecture07-fancy-rnn.pdfNatural Language Processing with Deep Learning CS224N/Ling284

BERT for Coreference Resolution - Stanford Universityweb.stanford.edu/class/cs224n/posters/15735157.pdf · 2019-12-20 · Improving coreference resolution by learning entity-level