Open-source high quality speech datasets for Basque, Catalan and Galician

O Kjartansson, A Gutkin, A Butryna… - Proceedings of the …, 2020 - aclanthology.org
This paper introduces new open speech datasets for three of the languages of Spain:
Basque, Catalan and Galician. Catalan is furthermore the official language of the Principality …

Buceador, a multi-language search engine for digital libraries

J Adell Mercado, A Bonafonte Cávez… - 2012 International …, 2012 - upcommons.upc.edu
This paper presents a web-based multimedia search engine built within the Buceador (www.
buceador. org) research project. A proof-of-concept tool has been implemented which is …

[PDF][PDF] Building high quality databases for minority languages such as Galician

F Campillo, D Braga, AB Mourín… - Proceedings of the 7th …, 2010 - lrec-conf.org
This paper describes the result of a joint R&D project between Microsoft Portugal and the
Signal Theory Group of the University of Vigo (Spain), where a set of language resources …

Language identification in multilingual, short and noisy texts using common N-grams

D Kosmajac, V Keselj - … Conference on Big Data (Big Data), 2017 - ieeexplore.ieee.org
The problem of Language Identification (LID) has been present in the Natural Language
Processing domain for a relatively long period of time. There is a number of approaches …

“España Verde”: Tourism destination image among German Facebook users

TM Schuh, D Agapito, P Pinto - Handbook of Research on …, 2018 - igi-global.com
This study aims at measuring the image of the tourism brand “España Verde” by using the
social media platform Facebook. The ever-increasing competition within the tourism industry …

Poetry as a continuous struggle between joy and hopelessness: On the Spanish and Galician translations of Adam Zagajewski's selected poems

A Jackiewicz - Hermēneus. Revista de traducción e interpretación, 2024 - revistas.uva.es
El objetivo del artículo es reflexionar sobre el papel de la creatividad en la traducción de la
poesía de Adam Zagajewski. Pretendemos realizar un análisis contrastivo entre dos …

[PDF][PDF] Nos Celtia-GL: an Open High-Quality Speech Synthesis Resource for Galician

NG Dıaz, MV Abuın, C Magarinos, AI Vladu… - isca-archive.org
Abstract We introduce Nos Celtia-GL, an open speech corpus for high-quality speech
synthesis (TTS) in Galician. The corpus consists of 25 hours of single-speaker recordings …

Automatic phonetic transcription by phonological derivation

M Garcia, IJ González - … Processing of the Portuguese Language: 10th …, 2012 - Springer
Automatic phonetic transcription tools usually perform phonetic transcriptions directly from
orthographic representations. Although these approaches often achieve good results …

VenPro: a morphological analyzer for Venetan

S Tonelli, E Pianta, R Delmonte, M Brunelli - Proceedings of the Seventh …, 2010 - iris.unive.it
This document reports the process of extending MorphoPro for Venetan, a lesser-used
language spoken in the Nort-Eastern part of Italy. MorphoPro is the morphological …

SEA_AP: a segmentation and labelling tool for prosodic analysis

PL Otero, LD Fernández, CG Mateo… - Dialectologia: revista …, 2016 - raco.cat
This paper introduces a tool that performs segmentation and labelling of sound chains in
phono units, syllables and/or words departing from a sound signal and its corresponding …