Machine learning methods that economists should know about

S Athey, GW Imbens - Annual Review of Economics, 2019 - annualreviews.org
We discuss the relevance of the recent machine learning (ML) literature for economics and
econometrics. First we discuss the differences in goals, methods, and settings between the …

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

A brief overview of universal sentence representation methods: A linguistic view

R Li, X Zhao, MF Moens - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
How to transfer the semantic information in a sentence to a computable numerical
embedding form is a fundamental problem in natural language processing. An informative …

Network embedding as matrix factorization: Unifying deepwalk, line, pte, and node2vec

J Qiu, Y Dong, H Ma, J Li, K Wang, J Tang - Proceedings of the eleventh …, 2018 - dl.acm.org
Since the invention of word2vec, the skip-gram model has significantly advanced the
research of network embedding, such as the recent emergence of the DeepWalk, LINE, PTE …

Inference via interpolation: Contrastive representations provably enable planning and inference

B Eysenbach, V Myers… - Advances in Neural …, 2025 - proceedings.neurips.cc
Given time series data, how can we answer questions like what will happen in the
future?''and how did we get here?''These sorts of probabilistic inference questions are …

Unsupervised learning of sentence embeddings using compositional n-gram features

M Pagliardini, P Gupta, M Jaggi - arxiv preprint arxiv:1703.02507, 2017 - arxiv.org
The recent tremendous success of unsupervised word embeddings in a multitude of
applications raises the obvious question if similar methods could be derived to improve …

Diffusionshield: A watermark for copyright protection against generative diffusion models

Y Cui, J Ren, H Xu, P He, H Liu, L Sun, Y **ng… - arxiv preprint arxiv …, 2023 - arxiv.org
Recently, Generative Diffusion Models (GDMs) have showcased their remarkable
capabilities in learning and generating images. A large community of GDMs has naturally …

An introduction to neural information retrieval

B Mitra, N Craswell - Foundations and Trends® in Information …, 2018 - nowpublishers.com
Neural ranking models for information retrieval (IR) use shallow or deep neural networks to
rank search results in response to a query. Traditional learning to rank models employ …

Poincar\'e glove: Hyperbolic word embeddings

A Tifrea, G Bécigneul, OE Ganea - arxiv preprint arxiv:1810.06546, 2018 - arxiv.org
Words are not created equal. In fact, they form an aristocratic graph with a latent hierarchical
structure that the next generation of unsupervised learned word embeddings should reveal …

Less: Selecting influential data for targeted instruction tuning

M **a, S Malladi, S Gururangan, S Arora… - arxiv preprint arxiv …, 2024 - arxiv.org
Instruction tuning has unlocked powerful capabilities in large language models (LLMs),
effectively using combined datasets to develop generalpurpose chatbots. However, real …