A joint multiple criteria model in transfer learning for cross-domain chinese word segmentation

K Huang, D Huang, Z Liu, F Mo - Proceedings of the 2020 …, 2020 - aclanthology.org
Word-level information is important in natural language processing (NLP), especially for the
Chinese language due to its high linguistic complexity. Chinese word segmentation (CWS) …

Improving cross-domain Chinese word segmentation with word embeddings

Y Ye, Y Zhang, W Li, L Qiu, J Sun - arxiv preprint arxiv:1903.01698, 2019 - arxiv.org
Cross-domain Chinese Word Segmentation (CWS) remains a challenge despite recent
progress in neural-based CWS. The limited amount of annotated data in the target domain …

Out-domain Chinese new word detection with statistics-based character embedding

Y Liang, M Yang, J Zhu, SM Yiu - Natural Language Engineering, 2019 - cambridge.org
Unlike English and other Western languages, many Asian languages such as Chinese and
Japanese do not delimit words by space. Word segmentation and new word detection are …

[PDF][PDF] Leveraging rich linguistic features for cross-domain Chinese segmentation

G Wu, D He, K Zhong, X Zhou… - Proceedings of The Third …, 2014 - aclanthology.org
This paper describes the system that we use for Chinese segmentation task in the 3rd CIPS-
SIGHAN bakeoff. We use character sequence labeling method for segmentation, and in …

A feature-rich CRF segmenter for Chinese micro-blog

Y Leng, W Liu, S Wang, X Wang - International Conference on Computer …, 2016 - Springer
This paper describes our system for Chinese word segmentation of micro-blog text, one of
the NLPCC-ICCPOL 2016 Shared Tasks 1. The CRF (Conditional Random Field) model is …

Recurrent neural word segmentation with tag inference

Q Zhou, L Ma, Z Zheng, Y Wang, X Wang - … Language Understanding and …, 2016 - Springer
In this paper, we present a Long Short-Term Memory (LSTM) based model for the task of
Chinese Weibo word segmentation. The model adopts a LSTM layer to capture long-range …

A Chinese word segment model for energy literature based on neural networks with electricity user dictionary

B Song, B Chai, Q Zhang, Q Jia - … International Conference on …, 2019 - ieeexplore.ieee.org
Traditional Chinese word segmentation (CWS) methods are based on supervised machine
learning such as Condtional Random Fields (CRFs), Maximum Entropy (ME), whose …

A Chinese word segmentation model for energy literature based on conditional random fields

L Zhao, W Kong, B Chai - 2018 2nd IEEE Conference on …, 2018 - ieeexplore.ieee.org
Chinese word segmentation is one of the foundation and core tasks for Chinese natural
language processing. Although some achievements have been made for Chinese word …

[PDF][PDF] Integrating surface and abstract features for robust cross-domain Chinese word segmentation

X Li, K Wang, C Zong, KY Su - Proceedings of COLING 2012, 2012 - aclanthology.org
Current character-based approaches are not robust for cross domain Chinese word
segmentation. In this paper, we alleviate this problem by deriving a novel enhanced …

Contextual-and-semantic-information-based domain-adaptive chinese word segmentation

J Zhang, D Huang, D Tong - … : First CCF Conference, NLPCC 2012, Bei**g …, 2012 - Springer
This paper presents a new domain-adaptive Chinese Word Segmentation (CWS) method.
Considering the characteristics of the territorial Out-of–Vocabularies (OOVs), both the …