Graph neural networks: foundation, frontiers and applications
The field of graph neural networks (GNNs) has seen rapid and incredible strides over the
recent years. Graph neural networks, also known as deep learning on graphs, graph …
recent years. Graph neural networks, also known as deep learning on graphs, graph …
Language-agnostic BERT sentence embedding
While BERT is an effective method for learning monolingual sentence embeddings for
semantic similarity and embedding based transfer learning (Reimers and Gurevych, 2019) …
semantic similarity and embedding based transfer learning (Reimers and Gurevych, 2019) …
Multilingual universal sentence encoder for semantic retrieval
We introduce two pre-trained retrieval focused multilingual sentence encoding models,
respectively based on the Transformer and CNN model architectures. The models embed …
respectively based on the Transformer and CNN model architectures. The models embed …
Efficient intent detection with dual sentence encoders
Building conversational systems in new domains and with added functionality requires
resource-efficient models that work under low-data regimes (ie, in few-shot setups) …
resource-efficient models that work under low-data regimes (ie, in few-shot setups) …
ConveRT: Efficient and accurate conversational representations from transformers
General-purpose pretrained sentence encoders such as BERT are not ideal for real-world
conversational AI applications; they are computationally heavy, slow, and expensive to train …
conversational AI applications; they are computationally heavy, slow, and expensive to train …
Privacy issues in large language models: A survey
S Neel, P Chang - arxiv preprint arxiv:2312.06717, 2023 - arxiv.org
This is the first survey of the active area of AI research that focuses on privacy issues in
Large Language Models (LLMs). Specifically, we focus on work that red-teams models to …
Large Language Models (LLMs). Specifically, we focus on work that red-teams models to …
Argument mining on Twitter: A survey
In the last decade, the field of argument mining has grown notably. However, only relatively
few studies have investigated argumentation in social media and specifically on Twitter …
few studies have investigated argumentation in social media and specifically on Twitter …
Mixed negative sampling for learning two-tower neural networks in recommendations
Learning query and item representations is important for building large scale
recommendation systems. In many real applications where there is a huge catalog of items …
recommendation systems. In many real applications where there is a huge catalog of items …
Are pre-trained convolutions better than pre-trained transformers?
In the era of pre-trained language models, Transformers are the de facto choice of model
architectures. While recent research has shown promise in entirely convolutional, or CNN …
architectures. While recent research has shown promise in entirely convolutional, or CNN …
Improving multilingual sentence embedding using bi-directional dual encoder with additive margin softmax
In this paper, we present an approach to learn multilingual sentence embeddings using a bi-
directional dual-encoder with additive margin softmax. The embeddings are able to achieve …
directional dual-encoder with additive margin softmax. The embeddings are able to achieve …