A joint multiple criteria model in transfer learning for cross-domain chinese word segmentation
Word-level information is important in natural language processing (NLP), especially for the
Chinese language due to its high linguistic complexity. Chinese word segmentation (CWS) …
Chinese language due to its high linguistic complexity. Chinese word segmentation (CWS) …
Improving cross-domain Chinese word segmentation with word embeddings
Cross-domain Chinese Word Segmentation (CWS) remains a challenge despite recent
progress in neural-based CWS. The limited amount of annotated data in the target domain …
progress in neural-based CWS. The limited amount of annotated data in the target domain …
Out-domain Chinese new word detection with statistics-based character embedding
Unlike English and other Western languages, many Asian languages such as Chinese and
Japanese do not delimit words by space. Word segmentation and new word detection are …
Japanese do not delimit words by space. Word segmentation and new word detection are …
[PDF][PDF] Leveraging rich linguistic features for cross-domain Chinese segmentation
G Wu, D He, K Zhong, X Zhou… - Proceedings of The Third …, 2014 - aclanthology.org
This paper describes the system that we use for Chinese segmentation task in the 3rd CIPS-
SIGHAN bakeoff. We use character sequence labeling method for segmentation, and in …
SIGHAN bakeoff. We use character sequence labeling method for segmentation, and in …
A feature-rich CRF segmenter for Chinese micro-blog
Y Leng, W Liu, S Wang, X Wang - International Conference on Computer …, 2016 - Springer
This paper describes our system for Chinese word segmentation of micro-blog text, one of
the NLPCC-ICCPOL 2016 Shared Tasks 1. The CRF (Conditional Random Field) model is …
the NLPCC-ICCPOL 2016 Shared Tasks 1. The CRF (Conditional Random Field) model is …
Recurrent neural word segmentation with tag inference
In this paper, we present a Long Short-Term Memory (LSTM) based model for the task of
Chinese Weibo word segmentation. The model adopts a LSTM layer to capture long-range …
Chinese Weibo word segmentation. The model adopts a LSTM layer to capture long-range …
A Chinese word segment model for energy literature based on neural networks with electricity user dictionary
B Song, B Chai, Q Zhang, Q Jia - … International Conference on …, 2019 - ieeexplore.ieee.org
Traditional Chinese word segmentation (CWS) methods are based on supervised machine
learning such as Condtional Random Fields (CRFs), Maximum Entropy (ME), whose …
learning such as Condtional Random Fields (CRFs), Maximum Entropy (ME), whose …
A Chinese word segmentation model for energy literature based on conditional random fields
L Zhao, W Kong, B Chai - 2018 2nd IEEE Conference on …, 2018 - ieeexplore.ieee.org
Chinese word segmentation is one of the foundation and core tasks for Chinese natural
language processing. Although some achievements have been made for Chinese word …
language processing. Although some achievements have been made for Chinese word …
[PDF][PDF] Integrating surface and abstract features for robust cross-domain Chinese word segmentation
X Li, K Wang, C Zong, KY Su - Proceedings of COLING 2012, 2012 - aclanthology.org
Current character-based approaches are not robust for cross domain Chinese word
segmentation. In this paper, we alleviate this problem by deriving a novel enhanced …
segmentation. In this paper, we alleviate this problem by deriving a novel enhanced …
Contextual-and-semantic-information-based domain-adaptive chinese word segmentation
J Zhang, D Huang, D Tong - … : First CCF Conference, NLPCC 2012, Bei**g …, 2012 - Springer
This paper presents a new domain-adaptive Chinese Word Segmentation (CWS) method.
Considering the characteristics of the territorial Out-of–Vocabularies (OOVs), both the …
Considering the characteristics of the territorial Out-of–Vocabularies (OOVs), both the …