The influence of preprocessing on text classification using a bag-of-words representation

Y HaCohen-Kerner, D Miller, Y Yigal - PloS one, 2020 - journals.plos.org
Text classification (TC) is the task of automatically assigning documents to a fixed number of
categories. TC is an important component in many text applications. Many of these …

Automatic extraction and learning of keyphrases from scientific articles

Y HaCohen-Kerner, Z Gross, A Masa - International conference on …, 2005 - Springer
Many academic journals and conferences require that each article include a list of
keyphrases. These keyphrases should provide general information about the contents and …

Automatic classification of complaint letters according to service provider categories

Y HaCohen-Kerner, R Dilmon, M Hone… - Information Processing …, 2019 - Elsevier
In the technological age, the phenomenon of complaint letters published on the Internet is
increasing. Therefore, it is important to automatically classify complaint letters according to …

Conjugation-based compression for Hebrew texts

Y Wiseman, I Gefner - ACM Transactions on Asian Language Information …, 2007 - dl.acm.org
Traditional compression techniques do not look deeply into the morphology of languages.
This can be less critical in languages like English where most of the sequences are illegal …

[PDF][PDF] Detecting Hate Speech Spreaders on Twitter using LSTM and BERT in English and Spanish.

M Uzan, Y HaCohen-Kerner - CLEF (Working Notes), 2021 - ceur-ws.org
In this paper, we describe our submissions for PAN at CLEF 2021 contest. We tackled the
subtask “Profiling Hate Speech Spreaders on Twitter”. We developed different models for …

JCT at SemEval-2020 Task 12: Offensive language detection in tweets using preprocessing methods, character and word n-grams

M Uzan, Y HaCohen-Kerner - Proceedings of the Fourteenth …, 2020 - aclanthology.org
In this paper, we describe our submissions to SemEval-2020 contest. We tackled subtask 12-
“Multilingual Offensive Language Identification in Social Media”. We developed different …

Detection of Anorexic Girls-In Blog Posts Written in Hebrew Using a Combined Heuristic AI and NLP Method

Y Hacohen-Kerner, N Manor, M Goldmeier… - IEEE …, 2022 - ieeexplore.ieee.org
In this study, we aim to detect in social media texts written in Hebrew girls who are
suspected of being anorexic. We constructed a dataset containing 100 blog posts written by …

Automatic machine learning of keyphrase extraction from short html documents written in Hebrew

Y HaCohen-Kerner, I Stern, D Korkus… - … and Systems: An …, 2007 - Taylor & Francis
Keyphrases extracted from documents may save precious time for tasks such as filtering,
summarization, and categorization. A few such systems are available for documents written …

[PDF][PDF] Detecting Offensive Language in English Hindi and Marathi using Classical Supervised Machine Learning Methods and Word/Char N-grams.

Y HaCohen-Kerner, M Uzan - FIRE (Working Notes), 2021 - researchgate.net
In this paper, we describe our submissions for the HASOC 2021 contest. We tackled subtask
1A that addresses the problem of hate speech and offensive language identification in three …

Automatic summarization

LH Belguith, M Ellouze, MH Maaloul, M Jaoua… - … language processing of …, 2014 - Springer
This chapter addresses automatic summarization of Semitic languages. After a presentation
of the theoretical background and current challenges of automatic summarization, we …