A survey of controllable text generation using transformer-based pre-trained language models

H Zhang, H Song, S Li, M Zhou, D Song - ACM Computing Surveys, 2023‏ - dl.acm.org
Controllable Text Generation (CTG) is an emerging area in the field of natural language
generation (NLG). It is regarded as crucial for the development of advanced text generation …

Cross-modal text and visual generation: A systematic review. Part 1: Image to text

M Żelaszczyk, J Mańdziuk - Information Fusion, 2023‏ - Elsevier
We review the existing literature on generating text from visual data under the cross-modal
generation umbrella, which affords us to compare and contrast various approaches taking …

Neural natural language generation: A survey on multilinguality, multimodality, controllability and learning

E Erdem, M Kuyu, S Yagcioglu, A Frank… - Journal of Artificial …, 2022‏ - jair.org
Develo** artificial learning systems that can understand and generate natural language
has been one of the long-standing goals of artificial intelligence. Recent decades have …

Emotional video captioning with vision-based emotion interpretation network

P Song, D Guo, X Yang, S Tang… - IEEE Transactions on …, 2024‏ - ieeexplore.ieee.org
Effectively summarizing and re-expressing video content by natural languages in a more
human-like fashion is one of the key topics in the field of multimedia content understanding …

Memcap: Memorizing style knowledge for image captioning

W Zhao, X Wu, X Zhang - Proceedings of the AAAI Conference on Artificial …, 2020‏ - aaai.org
Generating stylized captions for images is a challenging task since it requires not only
describing the content of the image accurately but also expressing the desired linguistic …

Hooks in the headline: Learning to generate headlines with controlled styles

D **, Z **, JT Zhou, L Orii, P Szolovits - arxiv preprint arxiv:2004.01980, 2020‏ - arxiv.org
Current summarization systems only produce plain, factual headlines, but do not meet the
practical needs of creating memorable titles to increase exposure. We propose a new task …

Evolution of visual data captioning Methods, Datasets, and evaluation Metrics: A comprehensive survey

D Sharma, C Dhiman, D Kumar - Expert Systems with Applications, 2023‏ - Elsevier
Abstract Automatic Visual Captioning (AVC) generates syntactically and semantically correct
sentences by describing important objects, attributes, and their relationships with each other …

Semi-supervised text style transfer: Cross projection in latent space

M Shang, P Li, Z Fu, L Bing, D Zhao, S Shi… - arxiv preprint arxiv …, 2019‏ - arxiv.org
Text style transfer task requires the model to transfer a sentence of one style to another style
while retaining its original content meaning, which is a challenging problem that has long …

Similar scenes arouse similar emotions: Parallel data augmentation for stylized image captioning

G Li, Y Zhai, Z Lin, Y Zhang - Proceedings of the 29th ACM International …, 2021‏ - dl.acm.org
Stylized image captioning systems aim to generate a caption not only semantically related to
a given image but also consistent with a given style description. One of the biggest …

Sketch storytelling

Y Zhou - ICASSP 2022-2022 IEEE International Conference on …, 2022‏ - ieeexplore.ieee.org
Sketch storytelling aims to generate a story for a given sketch. Although image captioning
based on deep learning has great progress, describing the sketch in a story style is still a …