On llms-driven synthetic data generation, curation, and evaluation: A survey

L Long, R Wang, R **ao, J Zhao, X Ding… - arxiv preprint arxiv …, 2024 - arxiv.org
Within the evolving landscape of deep learning, the dilemma of data quantity and quality has
been a long-standing problem. The recent advent of Large Language Models (LLMs) offers …

[HTML][HTML] Information retrieval meets large language models: a strategic report from chinese ir community

Q Ai, T Bai, Z Cao, Y Chang, J Chen, Z Chen, Z Cheng… - AI Open, 2023 - Elsevier
The research field of Information Retrieval (IR) has evolved significantly, expanding beyond
traditional search to meet diverse user information needs. Recently, Large Language …

The goldilocks of pragmatic understanding: Fine-tuning strategy matters for implicature resolution by llms

L Ruis, A Khan, S Biderman, S Hooker… - Advances in …, 2024 - proceedings.neurips.cc
Despite widespread use of LLMs as conversational agents, evaluations of performance fail
to capture a crucial aspect of communication: interpreting language in context …

Large language models are not zero-shot communicators

LE Ruis, A Khan, S Biderman, S Hooker, T Rocktäschel… - 2022 - openreview.net
The recent success of large language models (LLMs) has drawn heavy attention and
investment in their use as conversational and embodied systems. Despite widespread use …

Shieldgemma: Generative ai content moderation based on gemma

W Zeng, Y Liu, R Mullins, L Peran, J Fernandez… - arxiv preprint arxiv …, 2024 - arxiv.org
We present ShieldGemma, a comprehensive suite of LLM-based safety content moderation
models built upon Gemma2. These models provide robust, state-of-the-art predictions of …

Kafa: Rethinking image ad understanding with knowledge-augmented feature adaptation of vision-language models

Z Jia, P Narayana, AR Akula, G Pruthi, H Su… - arxiv preprint arxiv …, 2023 - arxiv.org
Image ad understanding is a crucial task with wide real-world applications. Although highly
challenging with the involvement of diverse atypical scenes, real-world entities, and …

LLM aided semi-supervision for Extractive Dialog Summarization

N Mishra, G Sahu, I Calixto, A Abu-Hanna… - arxiv preprint arxiv …, 2023 - arxiv.org
Generating high-quality summaries for chat dialogs often requires large labeled datasets.
We propose a method to efficiently use unlabeled data for extractive summarization of …

Information Retrieval Performance in Text Generation using Knowledge from Generative Pre-trained Transformer (GPT-3)

KM Fitria - Jambura Journal of Mathematics, 2023 - ejurnal.ung.ac.id
The rise of advanced language models like GPT-3 and text generation has witnessed
remarkable progress. However, leveraging the vast amount of knowledge within these …

Advancing low resource information extraction and dialogue system using data efficient methods

B Ding - 2024 - dr.ntu.edu.sg
This thesis presents an extensive study aimed at improving the efficacy of language models
in situations characterized by limited data resources, a prevalent challenge in the field of …

Design of text generator application with OpenAI GPT-3

KM Fitria - Journal of Electrical Engineering and Computer …, 2023 - ejournal.unuja.ac.id
The increasing need for text content creation today challenges the development of systems
that can alleviate the need for text creation. Currently, text generation is done manually and …