Automated identification of media bias in news articles: an interdisciplinary literature review
Media bias, ie, slanted news coverage, can strongly impact the public perception of the
reported topics. In the social sciences, research over the past decades has developed …
reported topics. In the social sciences, research over the past decades has developed …
Web data extraction, applications and techniques: A survey
Abstract Web Data Extraction is an important problem that has been studied by means of
different scientific tools and in a broad range of applications. Many approaches to extracting …
different scientific tools and in a broad range of applications. Many approaches to extracting …
IndicNLPSuite: Monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages
In this paper, we introduce NLP resources for 11 major Indian languages from two major
language families. These resources include:(a) large-scale sentence-level monolingual …
language families. These resources include:(a) large-scale sentence-level monolingual …
We value your privacy... now take some cookies: Measuring the GDPR's impact on web privacy
M Degeling, C Utz, C Lentzsch, H Hosseini… - ar** library and command-line tool for text discovery and extraction
A Barbaresi - Proceedings of the 59th Annual Meeting of the …, 2021 - aclanthology.org
An essential operation in web corpus construction consists in retaining the desired content
while discarding the rest. Another challenge finding one's way through websites. This article …
while discarding the rest. Another challenge finding one's way through websites. This article …