distributed representation of sentences and documents

Distributed Representations of Words and Phrases and their Compositionality Abdullah Khan Zehady

Upload: abdullah-khan-zehady

Post on 15-Apr-2017

57 views

Category:

Documents

1 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Distributed representation of sentences and documents

Distributed Representations of Words and Phrases and their Compositionality

Abdullah Khan Zehady

Page 2: Distributed representation of sentences and documents

Neural Word Embedding● Continuous vector space representation

o Words represented as dense real-valued vectors in Rd

● Distributed word representation ↔ Word Embeddingo Embed an entire vocabulary into a relatively low-dimensional linear

space where dimensions are latent continuous features.

● Classical n-gram model works in terms of discrete units o No inherent relationship in n-gram.

● In contrast, word embeddings capture regularities and relationships between words.

Page 3: Distributed representation of sentences and documents

Syntactic & Semantic Relationship

Regularities are observed as the constant offset vector between pair of words sharing some relationship.

Gender RelationKING-QUEEN ~ MAN - WOMAN

Singular/Plural Relation

KING-KINGS ~ QUEEN - QUEENS

Other Relations: Language France - French ~ Spain - Spanish

Past Tense Go – Went ~ Capture - Captured

Page 4: Distributed representation of sentences and documents

Neural Net

Page 5: Distributed representation of sentences and documents

Language Model(LM) Different models for estimating continuous representations of words.

Latent Semantic Analysis (LSA) Latent Dirichlet Allocation (LDA)

Neural network Language Model(NNLM)

Page 6: Distributed representation of sentences and documents

Feed Forward NNLM Consists of input, projection, hidden and output layers.

N previous words are encoded using 1-of-V coding, where V is size of the vocabulary. Ex: A = (1,0,...,0), B = (0,1,...,0), … , Z = (0,0,...,1) in R26

NNLM becomes computationally complex between projection(P) and hidden(H) layer

For N=10, size of P = 500-2000, size of H = 500-1000 Hidden layer is used to compute prob. dist. over all the words in

vocabulary V Hierarchical softmax as the rescue.

Page 7: Distributed representation of sentences and documents

Recurrent NNLM No projection Layer, consists of input, hidden and output layers only.

No need to specify the context length like feed forward NNLM

What is special in RNN model?

Recurrent matrix that connects

layer to itself

Page 8: Distributed representation of sentences and documents

Recurrent NNLMw(t): Input word at time ty(t): Output layer produces a prob. Dist. over words.s(t): Hidden layerU: Each column represents a word

RNN is trained with backpropagationto maximize the log likelihood.

Page 9: Distributed representation of sentences and documents

Continuous Bag of Word Model

Page 10: Distributed representation of sentences and documents

Hierarchical Softmax

Page 11: Distributed representation of sentences and documents

Negative Sampling

Page 12: Distributed representation of sentences and documents

Negative Sampling

Page 13: Distributed representation of sentences and documents

Subsampling of Frequent words

Page 14: Distributed representation of sentences and documents

Skip gram model

Page 15: Distributed representation of sentences and documents

Empirical Result

Page 16: Distributed representation of sentences and documents

Skip gram model

Page 17: Distributed representation of sentences and documents

Learning Phrases

Page 18: Distributed representation of sentences and documents

Phrase skip gram results

Page 19: Distributed representation of sentences and documents

Page 20: Distributed representation of sentences and documents

Additive compositionality

Page 21: Distributed representation of sentences and documents

Page 22: Distributed representation of sentences and documents

Compare with published word representations

Page 23: Distributed representation of sentences and documents

Page 24: Distributed representation of sentences and documents

Skip gram model

Page 25: Distributed representation of sentences and documents

Skip gram model

Distributed rainfall-runoff modeling - 京都大学hywr.kuciv.kyoto-u.ac.jp/publications/lectureNote/... · A simple distributed hydrologic representation is a rainfall-runoff model

Representation of a Distributed Database System for the

ardl: Estimating autoregressive distributed lag and … ARDL model EC representation Bounds testing Postestimation Further topics Summary ardl: Estimating autoregressive distributed

Principles of Distributed Representation

Sentences and Clauses Simple Sentences Compound Sentences Complex Sentences

Distributed Storage Allocations for Optimal DelayPasadena, California 91125, USA [email protected] Abstract—We examine the problem of creating an encoded distributed storage representation

1 4.0 Knowledge Representation Outline 4.1 Introduction 4.2 Knowledge Representation schemes. 4.3 Propositional Calculus (PC) 4.3.1 PC Symbols and Sentences

Distributed Memory and the Representation of General …cnbc.cmu.edu/~plaut/IntroPDP/papers/McClellandRumelhart85JEPG.di… · DISTRIBUTED MEMORY 161 under which functional equivalents

Distributed Representation

Distributed Representation, Connection-Based Learning, and Memory Psychology 209 February 1, 2013

Paper2vec: Citation-Context Based Document Distributed ... · Paper2vec: Citation-Context Based Document Distributed Representation for Scholar Recommendation Han Tian, Hankz Hankui

Review Representation in Pain Perception · representation of the body, which regulates distributed, non-spatial responses to pain, including autonomic and affectiveresponses[4].Interestingly,thissystemalsounder-lies

Distributed Semantic Web Knowledge Representation and Inferencing

Distributed Representation, Connection-Based Learning, and Memory

Complete Sentences Fragments Run-On Sentences Compound Sentences

Analysis of Distributed Representation of Constituent ...papers.nips.cc/paper/58-analysis-of-distributed-representation-of... · Analysis of distributed representation of constituent

The mental representation of sentences Tree structures or state vectors? Stefan Frank [email protected]

Title: “Semantic Interoperability in Distributed Planning”Semantic interoperability of the plan representation is critical to support distributed planning. The initial approach

Recursive Distributed RepresentationsThe most obvious and natural distributed representation is a feature (or micro-feature) system, traditionally used in linguistics. A good example

Combining Distributed Word Representation and

Distributed Data Clustering in Sensor Networksclustering criterion and cluster representation, in fully asynchronous settings. Keywords Sensor networks, distributed cluster-ing, robust

Improving Distributed Word Representation and Topic …proceedings.mlr.press/v63/Fu60.pdf · Improving Distributed Word Representation and Topic ... Xianghua Fu [email protected] Ting

XML-based Representation of Test Cases for Distributed Systems

Distributed representation of chemical features and tunotopic … · 2012-04-26 · Distributed representation of chemical features and tunotopic organization of glomeruli in the

A Unified Graphical Representation and Tool for Design and ...€¦ · 1 A Unified Graphical Representation and Tool for Design and Integration of Components in Heterogeneous Distributed

Distributed Representation, Connection-Based Learning, and Memory PDP Class Lecture January 26, 2011

Towards Distributed Grammar motivations and issues of representation André WLODARCZYK

Simple sentences and their representation in Prolog

SENTENCES TYPES OF SENTENCES: SIMPLE SENTENCES COMPOUND SENTENCES COMPLEX SENTENCES

Learning Bilingual Distributed Phrase …2 Distributed Phrase Representation Acquisition via Semantic Composition In this section, we introduce semantics-based vector representations

ITCS 6150 Intelligence Systems - UNC Charlotte · Lecture 10 Logical Agents Chapter 7. ... –derive new sentences (knowledge) from existing sentences. Knowledge Representation Must

Long-Term Stewardship of Globally-Distributed ...€¦ · Long-Term Stewardship of Globally-Distributed Representation Information David Holdsworth Leeds University [email protected]

Neural Networks NNs are a study of parallel and distributed processing systems (PDPs) – the idea is that the representation is distributed across a network

Distributed Representation-based Recommender Systems in …Distributed Representation-based Recommender Systems in E-commerce ... In Word2vec, or Doc2vec model, a document is a sequence

Distributed Representation of Documents with Explicit …dm.snu.ac.kr/static/docs/st2015/hank4.pdf · 2015-11-16 · Distributed Representation of Documents with Explicit Explanatory