selective encoding for abstractive sentence summarization

Selective Encoding for Abstractive Sentence Summarization Oingyu Zhou, Nan Yang, Furu Wei and Ming Zhou ACL 2017 Presentator: Kodaira Tomonori

Upload: kodaira-tomonori

Post on 21-Jan-2018

56 views

Category:

Science

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Selective Encoding for Abstractive Sentence

Summarization Oingyu Zhou, Nan Yang, Furu Wei and Ming Zhou

ACL 2017

Presentator: Kodaira Tomonori

Page 2: Selective encoding for abstractive sentence summarization

TaskTask: Abstractive sentence summarization

Input: sentence Output: sentence

Figure 1.

Page 3: Selective encoding for abstractive sentence summarization

Introduction

• This task different from MT:1. there is no explicit alignment relationship.2. this task needs to keep the highlights and remove the unnecessary information.

Page 4: Selective encoding for abstractive sentence summarization

Improvement

• Problem: Previous framework is no explicit alignment relationship between the input sentence and the summary except for extracted common words.

• Solution:There method is not to infer the alignment, but to select the highlights while filtering out secondary information in the input.

Page 5: Selective encoding for abstractive sentence summarization

Problem Formulation

• Input sentence x = (x1, x2, …, xn) { xi ∈ Vs (=source vocab)}

• Output y = (y1, y2, … , yl) { l <= n}

Page 6: Selective encoding for abstractive sentence summarization

Model

Page 7: Selective encoding for abstractive sentence summarization

Sentence Encoder

• bidirectional GRU

• The initial states are set to zero vectors.

• After reading the sentence, hidden states are concatenated.

Page 8: Selective encoding for abstractive sentence summarization

Selective Mechanism

• sentence representation: s = [hbackward,1, hforward, n]

• sGatei = σ(Wshi + Uss + b)

• h’i = hi ○ sGatei

Page 9: Selective encoding for abstractive sentence summarization

Summary Decoder• GRU:

st = GRU(wt-1, ct-1, st-1)s0 = tanh(Wdhbackward,1 + b)

• Atention: et,i = vaTtanh(Wast-1 + Uah’i)at,i = exp(et,i) / ∑ni=1exp(et,i)ct = ∑ni=1at,ih’i

• Predict:rt = Wrwt-1 + Urct + Vrstmt =[max{rt,2j-1, rt,2j}]Tj=1,…,dp(yt|y1, …, yt-1) = softmax(Womt)

• w: word embeddingc: context vectors: hidden state

• h’i: encoder state

• rt: readout state

Page 10: Selective encoding for abstractive sentence summarization

Objective Function

• J(θ) = - (1 / |D|) ∑(x,y) ∈D log p (y|x)(D: a set of parallel sentence-summary pair)

• optimizer: Stochastic Gradient Descent

Page 11: Selective encoding for abstractive sentence summarization

Dataset

• Training set:English Gigaword dataset (Napoles et al., 2012)Training: 3.8M sentence-summary pairsDevelop: 189K

• Test set:1. English Gigaword 2. DUC 20043. MSR Abstractive Text Compression test sets

Page 12: Selective encoding for abstractive sentence summarization

Data statics

Table 2

Page 13: Selective encoding for abstractive sentence summarization

Evaluation Metric

• ROUGE (Lin, 2004)ROUGE-1, ROUGE-2, ROUGE-L

Page 14: Selective encoding for abstractive sentence summarization

Implementation Details• Parameters:

Embedding size: 300GRU hidden state sizes to 512dropout(Srivastava et al., 2014) [p = 0.5]

• Training:Adam: (α = .001, β1 = .9, β2 = .999)gradient clipping [-5, 5]

• BeamSearchbeamsize 12

Page 15: Selective encoding for abstractive sentence summarization

Baselines• ABS(Rush et al., 2015)

• ABS+ (Rush et al., 2015)

• CAs2s (Chopra et al., 2016)

• Feats2s (Nallapati et al., 2016)

• Luong-NMT (Luong et al., 2015)

• s2s+attthey also implement a s2s model with attention

Page 16: Selective encoding for abstractive sentence summarization

English Gigaword

Table 3

Page 17: Selective encoding for abstractive sentence summarization

DUC 2004

Table 4

Page 18: Selective encoding for abstractive sentence summarization

MSR-ATC

Figure 5

Page 19: Selective encoding for abstractive sentence summarization

Saliency Heat Map of Selective Gate

• they use the method in Li et al., 2016 to visualize the ocntribution of the selective gate to the final output.

• They approximate the Sy(g) by computing the first order Taylor expansion.

• THey draw the Euclidean norm of the first derivative of the output y with respect to the selective gate g associated with each input words.

Figure 3

Page 20: Selective encoding for abstractive sentence summarization

Conclusion

• propose a selective encoding model.

• greatly improves

Improving Abstractive Dialogue Summarization with Graph … · 2020. 11. 30. · makes the information ﬂow of the dialogue more clearer. We devise a topic-word guided graph-to-sequence

Improving Neural Abstractive Document Summarization with ...week to leave mementos and ﬂowers for faith and hope , but when parents simon howie and renee young arrived on thursday

CONCEPTUAL FRAMEWORK FOR ABSTRACTIVE …airccse.org/journal/ijnlc/papers/4115ijnlc04.pdfNikita Munot1 and Sharvari S. Govilkar2 ... Summarization by extractive just extracts the sentences

Comparing Abstractive and Extractive Summarization of ...jcheung/papers/honours-thesis.pdf · the world of research and guided me in taking my first steps into the field of computational

NeuralTextSummarizationand Generationlipiji.com/docs/thesis.pdf · problems, for abstractive text summarization, we propose a new framework based on a sequence-to-sequence oriented

Abstractive Review Summarization based on Improved

The Role of CNL and AMR in Scalable Abstractive Summarization for Multilingual Media Monitoring

Query-Based Abstractive Summarization Using … Abstractive Summarization Using Neural Networks Johan Hasselqvist [email protected] Niklas Helmertz [email protected] Mikael Kågebäck*

Bottom-Up Abstractive Summarization - Harvard University

Regularizing Output Distribution of Abstractive …Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency BINGZHEN

A Neural Attention Model for Abstractive Sentence ...people.seas.harvard.edu/~srush/emnlp2015_slides.pdf · A Neural Attention Model for Abstractive Sentence Summarization Alexander

Algorithmic Comment Processing - Columbia DataSciencecolumbia.edu… · Future Work Evaluate abstractive summarization Explore CNN vector representations Evaluate models using other

Conceptual framework for abstractive text summarization

Abstractive Text Summarization - IITKhome.iitk.ac.in/~soumye/cs498a/pres.pdf · I TF-IDF statistics I Though it speeds up training, it hurts the abstractive capabilities of the model

by - GitHub Pages · tend this initial step towards abstractive techniques by developing and assess-ing neural techniques for multi-document generic summarization and abstractive

Bottom-Up Abstractive SummarizationBottom-Up Abstractive Summarization Sebastian Gehrmann Yuntian Deng Alexander M. Rush School of Engineering and Applied Sciences Harvard University

Ontology-aware Clinical Abstractive Summarization

Deep recurrent neural networks for abstractive text summarization · 2018. 5. 9. · Abstract Technical Faculty Bachelor of Science Deep recurrent neural networks for abstractive

Plausibility-promoting generative adversarial network for ... · Plausibility-promoting generative adversarial network for abstractive text summarization with multi-task constraint

QUERY FOCUSED ABSTRACTIVE SUMMARIZATION USING …

A Unified Model for Extractive and Abstractive ... · informativity and readability on human eval-uation. 2 Related Work Text summarization has been widely studied in re-cent years

Knowledge Graph-Augmented Abstractive Summarization ...with Semantic-Driven Cloze Reward Luyang Huang 1Lingfei Wu2 and Lu Wang 1Khoury College of Computer Sciences, Northeastern University,

arXiv:1906.01973v2 [cs.CL] 9 Apr 2020 · In the domain of text summarization, hier-archical encoder, encoding words in a sentence (post) followed by the encoding of sentences in a

Tutorial on Abstractive Text Summarization · Tutorial on Abstractive Text Summarization Advaith Siddharthan NLG Summer School, Aberdeen, 22 July 2015 Introduction Sentence Compression

Abstractive Review Summarization

Attention Neural Network-Based Abstractive Summarization and …isl.anthropomatik.kit.edu/pdf/Douma2018.pdf · ing solutions is limiting the input to the ﬁrst one or two sentences

Abstractive Text Summarization - IITKhome.iitk.ac.in/~soumye/cs498a/report.pdf · 2017-11-22 · Abstractive Text Summarization Soumye Singhal Department of Computer Science IIT Kanpur

Deep Learning Based Abstractive Text Summarization ...downloads.hindawi.com/journals/mpe/2020/9365340.pdfDeep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation

MULTI DOCUMENT TEXT SUMMARIZATION USING … · Basically text Summarization methods can be classified into extractive and abstractive summarization. An extractive summarization method

Deep Communicating Agents for Abstractive Summarization · Deep Communicating Agents for Abstractive Summarization Asli Celikyilmaz1, Antoine Bosselut 2, Xiaodong He3 and Yejin Choi2;4

Abstractive Text Summarization · Outline 1.Introduction AutomaticSummarizationandMotivation 2.Background N-gram DocumentVectorSpaceandCosineSimilarity 3.Application WikiWrite 4.Results

Abstractive Text Summarization of the Parkland Shooting ......works better than other document representations like bag-of-words or tf-idf due to its ability to capture semantic relationships,

Selection and Presentation Practices for Code Example ...martin/papers/fse2014.pdf · tim for the summaries, motivating abstractive summarization. The results provide a grounded basis

Neural Abstractive Text Summarization with Sequence-to ... · sequence models, attention model, pointer-generator network, deep reinforcement learning, beam search. I. INTRODUCTION

An Entity-Driven Framework for Abstractive Summarization · 2020. 1. 23. · dressed by single-document summarization sys-tems (Jones et al.,1999). We use a sample sum-mary in Fig.1to