Distributed optimization and statistical learning via the alternating direction method of multipliers

S Boyd, N Parikh, E Chu, B Peleato… - … and Trends® in …, 2011 - nowpublishers.com
Many problems of recent interest in statistics and machine learning can be posed in the
framework of convex optimization. Due to the explosion in size and complexity of modern …

Theseus: A library for differentiable nonlinear optimization

L Pineda, T Fan, M Monge… - Advances in …, 2022 - proceedings.neurips.cc
We present Theseus, an efficient application-agnostic open source library for differentiable
nonlinear least squares (DNLS) optimization built on PyTorch, providing a common …

A tutorial on dual decomposition and lagrangian relaxation for inference in natural language processing

AM Rush, MJ Collins - Journal of Artificial Intelligence Research, 2012 - jair.org
Dual decomposition, and more generally Lagrangian relaxation, is a classical method for
combinatorial optimization; it has recently been applied to several inference problems in …

Structured pruning of large language models

Z Wang, J Wohlwend, T Lei - arxiv preprint arxiv:1910.04732, 2019 - arxiv.org
Large language models have recently achieved state of the art performance across a wide
variety of natural language tasks. Meanwhile, the size of these models and their latency …

Joint extraction of events and entities within a document context

B Yang, T Mitchell - arxiv preprint arxiv:1609.03632, 2016 - arxiv.org
Events and entities are closely related; entities are often actors or participants in events and
events without entities are uncommon. The interpretation of events and entities is highly …

Frame-semantic parsing

D Das, D Chen, AFT Martins, N Schneider… - Computational …, 2014 - direct.mit.edu
Frame semantics is a linguistic theory that has been instantiated for English in the FrameNet
lexicon. We solve the problem of frame-semantic parsing using a two-stage statistical model …

A comparative study of modern inference techniques for structured discrete energy minimization problems

JH Kappes, B Andres, FA Hamprecht, C Schnörr… - International Journal of …, 2015 - Springer
Szeliski et al. published an influential study in 2006 on energy minimization methods for
Markov random fields. This study provided valuable insights in choosing the best …

Online alternating direction method (longer version)

H Wang, A Banerjee - arxiv preprint arxiv:1306.3721, 2013 - arxiv.org
Online optimization has emerged as powerful tool in large scale optimization. In this pa-per,
we introduce efficient online optimization algorithms based on the alternating direction …

End-to-end learning for structured prediction energy networks

D Belanger, B Yang… - … Conference on Machine …, 2017 - proceedings.mlr.press
Abstract Structured Prediction Energy Networks (SPENs) are a simple, yet expressive family
of structured prediction models (Belanger and McCallum, 2016). An energy function over …

A comparative study of modern inference techniques for discrete energy minimization problems

J Kappes, B Andres, F Hamprecht… - Proceedings of the …, 2013 - openaccess.thecvf.com
Seven years ago, Szeliski et al. published an influential study on energy minimization
methods for Markov random fields (MRF). This study provided valuable insights in choosing …