Audio-Language Datasets of Scenes and Events: A Survey
Audio-language models (ALMs) generate linguistic descriptions of sound-producing events
and scenes. Advances in dataset creation and computational power have led to significant …
and scenes. Advances in dataset creation and computational power have led to significant …
EHR-based prediction modelling meets multimodal deep learning: A systematic review of structured and textual data fusion methods
Abstract Electronic Health Records (EHRs) have transformed healthcare by digitally
consolidating patient medical history, encompassing structured data (eg, demographic data …
consolidating patient medical history, encompassing structured data (eg, demographic data …
Clinical entity augmented retrieval for clinical information extraction
Large language models (LLMs) with retrieval-augmented generation (RAG) have improved
information extraction over previous methods, yet their reliance on embeddings often leads …
information extraction over previous methods, yet their reliance on embeddings often leads …
Kimi k1. 5: Scaling Reinforcement Learning with LLMs
Language model pretraining with next token prediction has proved effective for scaling
compute but is limited to the amount of available training data. Scaling reinforcement …
compute but is limited to the amount of available training data. Scaling reinforcement …
Vision-Language Models Represent Darker-Skinned Black Individuals as More Homogeneous than Lighter-Skinned Black Individuals
Vision-Language Models (VLMs) combine Large Language Model (LLM) capabilities with
image processing, enabling tasks like image captioning and text-to-image generation. Yet …
image processing, enabling tasks like image captioning and text-to-image generation. Yet …
Beyond Factual Accuracy: Evaluating Coverage of Diverse Factual Information in Long-form Text Generation
This paper presents ICAT, an evaluation framework for measuring coverage of diverse
factual information in long-form text generation. ICAT breaks down a long output text into a …
factual information in long-form text generation. ICAT breaks down a long output text into a …
A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options
Purpose: We present an updated study evaluating the performance of large language
models (LLMs) in answering radiation oncology physics questions, focusing on the recently …
models (LLMs) in answering radiation oncology physics questions, focusing on the recently …
Resource-efficient photonic networks for next-generation AI computing
Current trends in artificial intelligence toward larger models demand a rethinking of both
hardware and algorithms. Photonics-based systems offer high-speed, energy-efficient …
hardware and algorithms. Photonics-based systems offer high-speed, energy-efficient …
Lightweight safety classification using pruned language models
In this paper, we introduce a novel technique for content safety and prompt injection
classification for Large Language Models. Our technique, Layer Enhanced Classification …
classification for Large Language Models. Our technique, Layer Enhanced Classification …
Integrating personalized and contextual information in fine-grained emotion recognition in text: A multi-source fusion approach with explainability
Emotion recognition in textual data is a rapidly evolving field with diverse applications. While
the state-of-the-art (SOTA) models based on pre-trained large language models (LLMs) …
the state-of-the-art (SOTA) models based on pre-trained large language models (LLMs) …