Llm-qat: Data-free quantization aware training for large language models

Z Liu, B Oguz, C Zhao, E Chang, P Stock… - arxiv preprint arxiv …, 2023 - arxiv.org
Several post-training quantization methods have been applied to large language models
(LLMs), and have been shown to perform well down to 8-bits. We find that these methods …

Learning to learn from apis: Black-box data-free meta-learning

Z Hu, L Shen, Z Wang, B Wu… - … on Machine Learning, 2023 - proceedings.mlr.press
Data-free meta-learning (DFML) aims to enable efficient learning of new tasks by meta-
learning from a collection of pre-trained models without access to the training data. Existing …

Architecture, dataset and model-scale agnostic data-free meta-learning

Z Hu, L Shen, Z Wang, T Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
The goal of data-free meta-learning is to learn useful prior knowledge from a collection of
pre-trained models without accessing their training data. However, existing works only solve …

Sparse model inversion: efficient inversion of vision transformers for data-free applications

Z Hu, Y Wei, L Shen, Z Wang, L Li… - Forty-first International …, 2024 - openreview.net
Model inversion, which aims to reconstruct the original training data from pre-trained
discriminative models, is especially useful when the original training data is unavailable due …

MO-EMT-NAS: Multi-Objective Continuous Transfer of Architectural Knowledge Between Tasks from Different Datasets

P Liao, XL Wang, Y **, WL Du - European Conference on Computer …, 2024 - Springer
Deploying models across diverse devices demands tradeoffs among multiple objectives due
to different resource constraints. Arguably, due to the small model trap problem in multi …

Multimodal Label Relevance Ranking via Reinforcement Learning

T Guo, T Zhang, H Wu, H Li, R Qiao, X Sun - European Conference on …, 2024 - Springer
Conventional multi-label recognition methods often focus on label confidence, frequently
overlooking the pivotal role of partial order relations consistent with human preference. To …

Leverage class-specific accuracy to guide data generation for improving image classification

J Gala, P **e - Forty-first International Conference on Machine …, 2024 - openreview.net
In many image classification applications, the number of labeled training images is limited,
which leads to model overfitting. To mitigate the lack of training data, deep generative …

Automated dominative subspace mining for efficient neural architecture search

Y Chen, Y Guo, D Liao, F Lv, H Song… - … on Circuits and …, 2024 - ieeexplore.ieee.org
Neural Architecture Search (NAS) aims to automatically find effective architectures within a
predefined search space. However, the search space is often extremely large. As a result …

Task-Distributionally Robust Data-Free Meta-Learning

Z Hu, L Shen, Z Wang, Y Wei, B Wu, C Yuan… - arxiv preprint arxiv …, 2023 - arxiv.org
Data-Free Meta-Learning (DFML) aims to efficiently learn new tasks by leveraging multiple
pre-trained models without requiring their original training data. Existing inversion-based …

基于最优架构搜索网络的液压泵故障诊断改进方法研究.

郑直, 刘彤谣, 赵文博, 刘伟民… - Machine Tool & …, 2024 - search.ebscohost.com
针对神经网络结构搜索方法(NAS) 在搜索最优结构时存在性能评估效率偏低,
以及由于模型泛化性能力不足导致液压泵故障诊断精度过低等问题, 提出一种改进的Data-free …