Statistical Optimal Transport via Factored Couplings

  • Aden Forrow
  • , Jan Christian Hütter
  • , Mor Nitzan
  • , Philippe Rigollet
  • , Geoffey Schiebinger
  • , Jonathan Weed

Research output: Contribution to journalConference articlepeer-review

6 Scopus citations

Abstract

We propose a new method to estimate Wasserstein distances and optimal transport plans between two probability distributions from samples in high dimension. Unlike plug-in rules that simply replace the true distri-butions by their empirical counterparts, our method promotes couplings with low trans-port rank, a new structural assumption that is similar to the nonnegative rank of a ma-trix. Regularizing based on this assump-tion leads to drastic improvements on high-dimensional data for various tasks, includ-ing domain adaptation in single-cell RNA sequencing data. These findings are sup-ported by a theoretical analysis that indicates that the transport rank is key in overcoming the curse of dimensionality inherent to data-driven optimal transport.

Original languageEnglish
Pages (from-to)2454-2465
Number of pages12
JournalProceedings of Machine Learning Research
Volume89
StatePublished - 2019
Externally publishedYes
Event22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 - Naha, Japan
Duration: 16 Apr 201918 Apr 2019

Bibliographical note

Publisher Copyright:
© 2019 by the author(s).

Fingerprint

Dive into the research topics of 'Statistical Optimal Transport via Factored Couplings'. Together they form a unique fingerprint.

Cite this