Graph neural networks: foundation, frontiers and applications

L Wu, P Cui, J Pei, L Zhao, X Guo - … of the 28th ACM SIGKDD Conference …, 2022 - dl.acm.org
The field of graph neural networks (GNNs) has seen rapid and incredible strides over the
recent years. Graph neural networks, also known as deep learning on graphs, graph …

Language-agnostic BERT sentence embedding

F Feng, Y Yang, D Cer, N Arivazhagan… - arxiv preprint arxiv …, 2020 - arxiv.org
While BERT is an effective method for learning monolingual sentence embeddings for
semantic similarity and embedding based transfer learning (Reimers and Gurevych, 2019) …

Multilingual universal sentence encoder for semantic retrieval

Y Yang, D Cer, A Ahmad, M Guo, J Law… - arxiv preprint arxiv …, 2019 - arxiv.org
We introduce two pre-trained retrieval focused multilingual sentence encoding models,
respectively based on the Transformer and CNN model architectures. The models embed …

Efficient intent detection with dual sentence encoders

I Casanueva, T Temčinas, D Gerz… - arxiv preprint arxiv …, 2020 - arxiv.org
Building conversational systems in new domains and with added functionality requires
resource-efficient models that work under low-data regimes (ie, in few-shot setups) …

ConveRT: Efficient and accurate conversational representations from transformers

M Henderson, I Casanueva, N Mrkšić, PH Su… - arxiv preprint arxiv …, 2019 - arxiv.org
General-purpose pretrained sentence encoders such as BERT are not ideal for real-world
conversational AI applications; they are computationally heavy, slow, and expensive to train …

Privacy issues in large language models: A survey

S Neel, P Chang - arxiv preprint arxiv:2312.06717, 2023 - arxiv.org
This is the first survey of the active area of AI research that focuses on privacy issues in
Large Language Models (LLMs). Specifically, we focus on work that red-teams models to …

Argument mining on Twitter: A survey

R Schaefer, M Stede - it-Information Technology, 2021 - degruyter.com
In the last decade, the field of argument mining has grown notably. However, only relatively
few studies have investigated argumentation in social media and specifically on Twitter …

Mixed negative sampling for learning two-tower neural networks in recommendations

J Yang, X Yi, D Zhiyuan Cheng, L Hong, Y Li… - … proceedings of the web …, 2020 - dl.acm.org
Learning query and item representations is important for building large scale
recommendation systems. In many real applications where there is a huge catalog of items …

Are pre-trained convolutions better than pre-trained transformers?

Y Tay, M Dehghani, J Gupta, D Bahri… - arxiv preprint arxiv …, 2021 - arxiv.org
In the era of pre-trained language models, Transformers are the de facto choice of model
architectures. While recent research has shown promise in entirely convolutional, or CNN …

Improving multilingual sentence embedding using bi-directional dual encoder with additive margin softmax

Y Yang, GH Abrego, S Yuan, M Guo, Q Shen… - arxiv preprint arxiv …, 2019 - arxiv.org
In this paper, we present an approach to learn multilingual sentence embeddings using a bi-
directional dual-encoder with additive margin softmax. The embeddings are able to achieve …