GPT-NAS: Neural architecture search meets generative pre-trained transformer model

C Yu, X Liu, Y Wang, Y Liu, W Feng… - Big Data Mining and …, 2024 - ieeexplore.ieee.org
The pursuit of optimal neural network architectures is foundational to the progression of
Neural Architecture Search (NAS). However, the existing NAS methods suffer from the …

Differential privacy may have a potential optimization effect on some swarm intelligence algorithms besides privacy-preserving

Z Zhang, H Zhu, M **e - Information Sciences, 2024 - Elsevier
Differential privacy (DP), as a promising privacy-preserving model, has attracted great
interest from researchers in recent years. At present, research on the combination of deep …

GPT-NAS: Evolutionary neural architecture search with the generative pre-trained model

C Yu, X Liu, Y Wang, Y Liu, W Feng, X Deng… - arxiv preprint arxiv …, 2023 - arxiv.org
Neural Architecture Search (NAS) has emerged as one of the effective methods to design
the optimal neural network architecture automatically. Although neural architectures have …

[HTML][HTML] Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation

J Hou, G Cosma, A Finke - Information Sciences, 2025 - Elsevier
Continual learning refers to the capability of a machine learning model to learn and adapt to
new information, without compromising its performance on previously learned tasks …

[HTML][HTML] SecureTLM: Private inference for transformer-based large model with MPC

Y Chen, X Meng, Z Shi, Z Ning, J Lin - Information Sciences, 2024 - Elsevier
Abstract Transformer-based Large Models (TLM), such as generative pre-trained models
(GPT), have become increasingly popular for practical applications through Deep Learning …

Lifelong Sentiment Classification Based on Adaptive Parameter Updating

Z Zhang, J Wang, K Nie, X Wang, J Liu - International Conference on …, 2024 - Springer
A classifier with the ability to handle continuous streams of opinion information on the
Internet should have good lifelong learning ability. However, deep neural networks face the …

OL4TeX: Adaptive Online Learning for Text Classification under Distribution Shifts

MS Kim, L Liu, HY Kwon - 2024 IEEE International Conference …, 2024 - ieeexplore.ieee.org
This study presents an adaptive online learning method for text classification under
distribution shifts. We formulate a typical neural network-based text classification model as …

Advancing neural machine continual learning and unlearning for language models in information retrieval systems

J Hou - 2024 - repository.lboro.ac.uk
Information retrieval (IR) refers to obtaining relevant information from a repository, such as a
database or the internet, based on a user's query. The rise of deep learning has led to the …

Stability-Plasticity Trade-Off in Large Language Models for Health Chatbot Applications

IT Tapang - 2024 - search.proquest.com
This study investigated the intricate dynamics of the stability-plasticity trade-off within large
language models (LLMs) as applied to healthcare chatbot systems, specifically focusing on …