Lexicon enhanced Chinese sequence labeling using BERT adapter

W Liu, X Fu, Y Zhang, W **ao - arxiv preprint arxiv:2105.07148, 2021 - arxiv.org
Lexicon information and pre-trained models, such as BERT, have been combined to explore
Chinese sequence labelling tasks due to their respective strengths. However, existing …

Hero: Hierarchical spatio-temporal reasoning with contrastive action correspondence for end-to-end video object grounding

M Li, T Wang, H Zhang, S Zhang, Z Zhao… - Proceedings of the 30th …, 2022 - dl.acm.org
Video Object Grounding (VOG) is the problem of associating spatial object regions in the
video to a descriptive natural language query. This is a challenging vision-language task …

Breaking the representation bottleneck of Chinese characters: Neural machine translation with stroke sequence modeling

Z Wang, X Liu, M Zhang - arxiv preprint arxiv:2211.12781, 2022 - arxiv.org
Existing research generally treats Chinese character as a minimum unit for representation.
However, such Chinese character representation will suffer two bottlenecks: 1) Learning …

Improving Automatic Forced Alignment for Phoneme Segmentation in Quranic Recitation

AMA Alqadasi, AM Zeki, MS Sunar, MSBH Salam… - IEEE …, 2023 - ieeexplore.ieee.org
Segmentation plays a crucial role in speech processing applications, where high accuracy is
essential. The quest for improved accuracy in automatic segmentation, particularly in the …

Enhancing Sindhi Word Segmentation using Subword Representation Learning and Position-aware Self-attention

W Ali, J Kumar, S Tumani, R Nour, A Noor, Z Xu - IEEE Access, 2024 - ieeexplore.ieee.org
Sindhi word segmentation is a challenging task due to space omission and insertion issues.
The Sindhi language itself adds to this complexity. It's cursive and consists of characters with …

NE–LP: normalized entropy-and loss prediction-based sampling for active learning in Chinese word segmentation on EHRs

T Cai, Z Ma, H Zheng, Y Zhou - Neural Computing and Applications, 2021 - Springer
Electronic health records (EHRs) in hospital information systems contain patients' diagnoses
and treatments, so EHRs are essential to clinical data mining. Of all the tasks in the mining …

中文命名实体识别研究综述.

王颖洁, 张程烨, 白凤波, 汪祖民… - Journal of Frontiers of …, 2023 - search.ebscohost.com
随着自然语言处理领域相关技术的快速发展, 作为自然语言处理的上游任务,
提高命名实体识别的准确率对于后续的文本处理任务而言具有重要的意义. 然而 …

Reassembling Fragmented Entity Names: A Novel Model for Chinese Compound Noun Processing

Y Pan, X Fu - Electronics, 2023 - mdpi.com
In the process of classifying intelligent assets, we encountered challenges with a limited
dataset dominated by complex compound noun phrases. Training classifiers directly on this …

Coupling distant annotation and adversarial training for cross-domain Chinese word segmentation

N Ding, D Long, G Xu, M Zhu, P **e, X Wang… - arxiv preprint arxiv …, 2020 - arxiv.org
Fully supervised neural approaches have achieved significant progress in the task of
Chinese word segmentation (CWS). Nevertheless, the performance of supervised models …

Semi-Supervised Chinese Word Segmentation in Geological Domain Using Pseudo-Lexicon and Self-Training Strategy

B Wan, Z Tan, D Chu, Y Dai, F Fang, Y Wu - Applied Sciences, 2025 - mdpi.com
Featured Application This study proposes a novel semi-supervised deep learning
framework, GeoCWS, which provides a solution for Chinese word segmentation in the …