University of Sussex
Browse
1/1
2 files

Low-variance black-box gradient estimates for the Plackett-Luce distribution

conference contribution
posted on 2023-06-07, 06:50 authored by Artyom Gadetsky, Kirill Struminsky, Christopher Robinson, Novi QuadriantoNovi Quadrianto, Dmitry Vetrov
Learning models with discrete latent variables using stochastic gradient descent remains a challenge due to the high variance of gradient estimates. Modern variance reduction techniques mostly consider categorical distributions and have limited applicability when the number of possible outcomes becomes large. In this work, we consider models with latent permutations and propose control variates for the Plackett-Luce distribution. In particular, the control variates allow us to optimize black-box functions over permutations using stochastic gradient descent. To illustrate the approach, we consider a variety of causal structure learning tasks for continuous and discrete data. We show that our method outperforms competitive relaxation-based optimization methods and is also applicable to non-differentiable score functions.

Funding

EthicalML: Injecting Ethical and Legal Constraints into Machine Learning Models; G2034; EPSRC-ENGINEERING & PHYSICAL SCIENCES RESEARCH COUNCIL

History

Publication status

  • Published

File Version

  • Accepted version

Journal

Thirty-Fourth AAAI Conference on Artificial Intelligence

ISSN

2159-5399

Publisher

AAAI

Issue

6

Volume

34

Page range

10126-10135

Event name

Thirty-Fourth AAAI Conference on Artificial Intelligence

Event location

Hilton New York Midtown, New York, New York, USA

Event type

conference

Event date

February 7-12 2020

ISBN

9781577358350

Department affiliated with

  • Informatics Publications

Research groups affiliated with

  • Data Science Research Group Publications

Full text available

  • Yes

Peer reviewed?

  • Yes

Legacy Posted Date

2019-11-25

First Open Access (FOA) Date

2021-01-19

First Compliant Deposit (FCD) Date

2019-11-25

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC