[PDF][PDF] The brWaC corpus: A new open resource for Brazilian Portuguese

JA Wagner Filho, R Wilkens, M Idiart… - Proceedings of the …, 2018 - aclanthology.org
In this work, we present the construction process of a large Web corpus for Brazilian
Portuguese, aiming to achieve a size comparable to the state of the art in other languages …

Cross-lingual induction and transfer of verb classes based on word vector space specialisation

I Vulić, N Mrkšić, A Korhonen - arxiv preprint arxiv:1707.06945, 2017 - arxiv.org
Existing approaches to automatic VerbNet-style verb classification are heavily dependent on
feature engineering and therefore limited to languages with mature NLP pipelines. In this …

Text complexity of open educational resources in Portuguese: mixing written and spoken registers in a multi-task approach

M Gazzola, S Leal, B Pedroni, F Theoto Rocha… - Language Resources …, 2022 - Springer
This paper presents a study on text complexity of Open Educational Resources (OER) in
Brazilian Portuguese. In a data analysis of the Brazilian Ministry of Education Integrated …

[PDF][PDF] Predição da complexidade textual de recursos educacionais abertos em português

MG Gazzola, SE Leal, SM Aluisio - Proceedings, 2019 - repositorio.usp.br
In 2016, UNESCO stated the priorities for the use of Open Educational Resources (OER),
highlighting the main research challenges. The lack of quality of OER is a challenge to be …

A lexical simplification tool for promoting health literacy

L Zilio, LB Paraguassu… - 1st Workshop on …, 2020 - openresearch.surrey.ac.uk
This paper presents MedSimples, an authoring tool that combines Natural Language
Processing, Corpus Linguistics and Terminology to help writers to convert health-related …

Passport: A dependency parsing model for portuguese

L Zilio, R WILkENS, C Fairon - International Conference on Computational …, 2018 - Springer
Parsers are essential tools for several NLP applications. Here we introduce PassPort, a
model for the dependency parsing of Portuguese trained with the Stanford Parser. For …

Automatic construction of large readability corpora

JA Wagner Filho, R Wilkens… - Proceedings of the …, 2016 - aclanthology.org
This work presents a framework for the automatic construction of large Web corpora
classified by readability level. We compare different Machine Learning classifiers for the task …

LexSubNC: A dataset of lexical substitution for nominal compounds

R Wilkens, L Zilio, S Cordeiro, FSF Paula… - Proceedings of the 12th …, 2017 - hal.science
In the context of NLP tasks such as text simplification, lexicons containing information about
semantically related words are an important resource for evaluating the quality of the system …

[PDF][PDF] Abordagem baseada em aumento de dados para avaliaçao automática de leiturabilidade

LC de MENEZES, P Aline, MJB FINATTO - Domínios de Lingu@ gem, 2023 - seer.ufu.br
Embora estudos sobre como medir a leiturabilidade de um texto remontem ao século
passado, ainda não há um consenso sobre quais seriam as melhores métricas …

Desarrollo de un Framework para la identificación del nivel de complejidad de texto para el entrenamiento de chatbots basado en Machine Learning

H Matos Rios - 2022 - tesis.pucp.edu.pe
La generación de diálogo implica diseñar un programa para generar una conversación
natural, esto requiere desarrollar algoritmos que puedan conversar con un ser humano y …