Order Matters: Sequence to Sequence for Sets

I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This paper shows that the order in which input/output data sequences are organized matters significantly when learning an underlying model.

Reference Link to PDF of Paper

Date Created:
Last Edited:
1 9

Sequences have become first class citizens in supervised learning tasks thanks to the resurgence of recurrent neural networks. Many complex tasks that require mapping from or to a sequence of observations can now be formulated with the sequence-to-sequence framework which employs the chain rule to efficiently represent the joint probability of sequences. In many cases, however, variable sized inputs and/or outputs might not be naturally expressed as sequences. This paper shows that the order in which we organize input and/or output data matters significantly when learning an underlying model. The paper then discusses an extension of the seq2seq framework that goes beyond sequences and handles input sets in a principled way. The paper also proposes a loss which, by searching over possible orders during trraining, deals with the lack of structure of output sets.

Approaches to sequence-to-sequence (seq2seq) modeling involve reading the input completely using an encoder, which is either an LSTM when the input is a sequence, or a convolutional neural network for images. The final state of the encoder is then fed to a decoder LSTM whose purpose is to produce the target sequence, one token at a time. How should we represent data - inputs or outputs - for problems where an obvious order cannot be determined. This paper shows that order matters, and that there might be a better ordering than the natural ordering of a sequence, and it proposes two approaches to consider sets either as inputs and/or outputs in models and evaluate how they perform on various artificial and real datasets.

Comments

You have to be logged in to add a comment

User Comments

Insert Math Markup

ESC
About Inserting Math Content
Display Style:

Embed News Content

ESC
About Embedding News Content

Embed Youtube Video

ESC
Embedding Youtube Videos

Embed TikTok Video

ESC
Embedding TikTok Videos

Embed X Post

ESC
Embedding X Posts

Embed Instagram Post

ESC
Embedding Instagram Posts

Insert Details Element

ESC

Example Output:

Summary Title
You will be able to insert content here after confirming the title of the <details> element.

Insert Table

ESC
Customization
Align:
Preview:

Insert Horizontal Rule

#000000

Preview:


View Content At Different Sizes

ESC

Edit Style of Block Nodes

ESC

Edit the background color, default text color, margin, padding, and border of block nodes. Editable block nodes include paragraphs, headers, and lists.

#ffffff
#000000

Edit Selected Cells

Change the background color, vertical align, and borders of the cells in the current selection.

#ffffff
Vertical Align:
Border
#000000
Border Style:

Edit Table

ESC
Customization:
Align:

Upload Lexical State

ESC

Upload a .lexical file. If the file type matches the type of the current editor, then a preview will be shown below the file input.

Upload 3D Object

ESC

Upload Jupyter Notebook

ESC

Upload a Jupyter notebook and embed the resulting HTML in the text editor.

Insert Custom HTML

ESC

Edit Image Background Color

ESC
#ffffff

Insert Columns Layout

ESC
Column Type:

Select Code Language

ESC
Select Coding Language

Insert Chart

ESC

Use the search box below

Upload Previous Version of Article State

ESC