HunEmBERT: a fine-tuned BERT-model for classifying sentiment and emotion in political communication

I Üveges, O Ring - IEEE Access, 2023 - ieeexplore.ieee.org
The growing number of digitally accessible text corpora and the accelerating development of
NLP tools and methods (particularly the emergence of powerful large-scale language …

Mono-and multilingual GPT-3 models for Hungarian

ZG Yang, LJ Laki, T Váradi, G Prószéky - International Conference on Text …, 2023 - Springer
In recent years, the growth in size of Transformer-based language models has accelerated
significantly. Global technology companies are training larger and larger models that require …

A multimodal deep learning architecture for smoking detection with a small data approach

R Lakatos, P Pollner, A Hajdu, T Joó - Frontiers in Artificial …, 2024 - frontiersin.org
Covert tobacco advertisements often raise regulatory measures. This paper presents that
artificial intelligence, particularly deep learning, has great potential for detecting hidden …

Can Triplet Loss Be Used for Multi-Label Few-Shot Classification? A Case Study

GM Csányi, R Vági, A Megyeri, A Fülöp, D Nagy… - Information, 2023 - mdpi.com
Few-shot learning is a deep learning subfield that is the focus of research nowadays. This
paper addresses the research question of whether a triplet-trained Siamese network, initially …

[PDF][PDF] Analyzing Narratives of Patient Experiences: A BERT Topic Modeling Approach

M Osváth, ZG Yang, K Kósa - Acta Polytech. Hung, 2023 - researchgate.net
Due to healthcare systems increased focus on healthcare quality and patientcentered care,
the patients' perspective of delivered healthcare, has become an important part of …

Introducing the CURLICAT corpora: seven-language domain specific annotated corpora from curated sources

T Váradi, B Nyéki, S Koeva, M Tadić… - Proceedings of the …, 2022 - aclanthology.org
This article presents the current outcomes of the CURLICAT CEF Telecom project, which
aims to collect and deeply annotate a set of large corpora from selected domains. The …

End-to-end Multilingual Coreference Resolution with Mention Head Prediction

O Pražák, M Konopik - arxiv preprint arxiv:2209.12516, 2022 - arxiv.org
This paper describes our approach to the CRAC 2022 Shared Task on Multilingual
Coreference Resolution. Our model is based on a state-of-the-art end-to-end coreference …

Enhancing deep neural networks with morphological information

M Klemen, L Krsnik, M Robnik-Šikonja - Natural Language …, 2023 - cambridge.org
Deep learning approaches are superior in natural language processing due to their ability to
extract informative features and patterns from languages. The two most successful neural …

[HTML][HTML] From Fact Drafts to Operational Systems: Semantic Search in Legal Decisions Using Fact Drafts

GM Csányi, D Lakatos, I Üveges, A Megyeri… - Big Data and Cognitive …, 2024 - mdpi.com
This research paper presents findings from an investigation in the semantic similarity search
task within the legal domain, using a corpus of 1172 Hungarian court decisions. The study …

[PDF][PDF] Sentiment Analysis with Neural Models for Hungarian

LJ Laki, ZG Yang - Acta Polytechnica Hungarica, 2023 - researchgate.net
Sentiment analysis is a powerful tool to gain insight into the emotional polarity of
opinionated texts. Computerized applications can contribute to the establishment of …