Sugarcrepe++ dataset: Vision-language model sensitivity to semantic and lexical alterations

SH Dumpala, A Jaiswal, C Sastry, E Milios… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite their remarkable successes, state-of-the-art large language models (LLMs),
including vision-and-language models (VLMs) and unimodal language models (ULMs), fail …

LegalVis: Exploring and inferring precedent citations in legal documents

LE Resck, JR Ponciano, LG Nonato… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
To reduce the number of pending cases and conflicting rulings in the Brazilian Judiciary, the
National Congress amended the Constitution, allowing the Brazilian Supreme Court (STF) to …

[HTML][HTML] Détecter la réutilisation de texte avec Passim

M Romanello, S Hengchen - Programming Historian, 2021 - programminghistorian.org
Dans cette leçon, vous serez initié à la détection automatique de la réutilisation des textes
avec la bibliothèque Passim. Vous apprendrez comment installer et exécuter Passim et ses …

Atr-vis: Visual and interactive information retrieval for parliamentary discussions in twitter

R Makki, E Carvalho, AJ Soto, S Brooks… - ACM Transactions on …, 2018 - dl.acm.org
The worldwide adoption of Twitter turned it into one of the most popular platforms for content
analysis as it serves as a gauge of the public's feeling and opinion on a variety of topics …

Automated classification of content components in technical communication

J Oevermann, W Ziegler - Computational Intelligence, 2018 - Wiley Online Library
Automated classification is usually not adjusted to specialized domains due to a lack of
suitable data collections and insufficient characterization of the domain‐specific content and …

Automated intrinsic text classification for component content management applications in technical communication

J Oevermann, W Ziegler - Proceedings of the 2016 ACM Symposium on …, 2016 - dl.acm.org
Classification models are used in component content management to identify content
components for retrieval, reuse and distribution. Intrinsic metadata, such as the assigned …

[HTML][HTML] Evaluation of different text representation techniques and distance metrics using KNN for documents classification

LA Calvo-Valverde, JA Mena-Arias - Revista Tecnología en Marcha, 2020 - scielo.sa.cr
Actualmente, los datos textuales constituyen una parte fundamental de las bases de datos
de todo el mundo y uno de los mayores desafíos ha sido la extracción de información útil a …

Evaluación de distintas técnicas de representación de texto y medidas de distancia de texto usando KNN para clasificación de documentos

LAC Valverde, JAM Arias - Tecnología en Marcha, 2020 - dialnet.unirioja.es
Actualmente, los datos textuales constituyen una parte fundamental de las bases de datos
de todo el mundo y uno de los mayores desafíos ha sido la extracción de información útil a …

Semantic annotation of heterogeneous data sources: Towards an integrated information framework for service technicians

S Bader, J Oevermann - … of the 13th International Conference on …, 2017 - dl.acm.org
Service technicians in the domain of industrial maintenance require extensive technical
knowledge and experience to complete their tasks. Some of the needed knowledge is made …

Calculating Similarity of Javadoc Comments

DV Koznov, EY Ledeneva, DV Luciv… - … and Computer Software, 2024 - Springer
Code comments are an essential part of software documentation. Many software projects
suffer from the problem of low-quality comments that are often produced by copy-paste. In …