Conditional image-to-video generation with latent flow diffusion models
Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video
starting from an image (eg, a person's face) and a condition (eg, an action class label like …
starting from an image (eg, a person's face) and a condition (eg, an action class label like …
Active teacher for semi-supervised object detection
In this paper, we study teacher-student learning from the perspective of data initialization
and propose a novel algorithm called Active Teacher for semi-supervised object detection …
and propose a novel algorithm called Active Teacher for semi-supervised object detection …
Cantor: Inspiring multimodal chain-of-thought of mllm
With the advent of large language models (LLMs) enhanced by the chain-of-thought (CoT)
methodology, the visual reasoning problem is usually decomposed into manageable sub …
methodology, the visual reasoning problem is usually decomposed into manageable sub …
Ompq: Orthogonal mixed precision quantization
To bridge the ever-increasing gap between deep neural networks' complexity and hardware
capability, network quantization has attracted more and more research attention. The latest …
capability, network quantization has attracted more and more research attention. The latest …
Solving oscillation problem in post-training quantization through a theoretical perspective
Post-training quantization (PTQ) is widely regarded as one of the most efficient compression
methods practically, benefitting from its data privacy and low computation costs. We argue …
methods practically, benefitting from its data privacy and low computation costs. We argue …
Refined Coreset Selection: Towards Minimal Coreset Size under Model Performance Constraints
Coreset selection is powerful in reducing computational costs and accelerating data
processing for deep learning algorithms. It strives to identify a small subset from large-scale …
processing for deep learning algorithms. It strives to identify a small subset from large-scale …
Training language model agents without modifying language models
Researchers and practitioners have recently reframed powerful Large Language Models
(LLMs) as agents, enabling them to automate complex tasks largely via the use of …
(LLMs) as agents, enabling them to automate complex tasks largely via the use of …
Coreset selection with prioritized multiple objectives
Coreset selection is powerful in reducing computational costs and accelerating data
processing for deep learning algorithms. It strives to identify a small subset from large-scale …
processing for deep learning algorithms. It strives to identify a small subset from large-scale …
Hypertime: Hyperparameter optimization for combating temporal distribution shifts
In this work, we propose a hyperparameter optimization method named HyperTime to find
hyperparameters robust to potential temporal distribution shifts in the unseen test data. Our …
hyperparameters robust to potential temporal distribution shifts in the unseen test data. Our …
Autogen: Enabling next-gen LLM applications via multi-agent conversations
We present AutoGen, an open-source framework that allows developers to build LLM
applications by composing multiple agents to converse with each other to accomplish tasks …
applications by composing multiple agents to converse with each other to accomplish tasks …