Disentangled counterfactual learning for physical audiovisual commonsense reasoning

C Lv, S Zhang, Y Tian, M Qi… - Advances in Neural …, 2023 - proceedings.neurips.cc
In this paper, we propose a Disentangled Counterfactual Learning (DCL) approach for
physical audiovisual commonsense reasoning. The task aims to infer objects' physics …

Semantics-aware spatial-temporal binaries for cross-modal video retrieval

M Qi, J Qin, Y Yang, Y Wang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
With the current exponential growth of video-based social networks, video retrieval using
natural language is receiving ever-increasing attention. Most existing approaches tackle this …

Intelligent small sample defect detection of water walls in power plants using novel deep learning integrating deep convolutional GAN

Z Geng, C Shi, Y Han - IEEE Transactions on Industrial …, 2022 - ieeexplore.ieee.org
Thermal power generation is one of the main forms of electricity generation in the world, and
the share of thermal power generation in total electricity generation has long been …

[HTML][HTML] Temperature forecasting by deep learning methods

B Gong, M Langguth, Y Ji, A Mozaffari… - Geoscientific model …, 2022 - gmd.copernicus.org
Numerical weather prediction (NWP) models solve a system of partial differential equations
based on physical laws to forecast the future state of the atmosphere. These models are …

Semi-supervised teacher-reference-student architecture for action quality assessment

W Yun, M Qi, F Peng, H Ma - European Conference on Computer Vision, 2024 - Springer
Existing action quality assessment (AQA) methods often require a large number of label
annotations for fully supervised learning, which are laborious and expensive. In practice, the …

Sgformer: Semantic graph transformer for point cloud-based 3d scene graph generation

C Lv, M Qi, X Li, Z Yang, H Ma - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
In this paper, we propose a novel model called SGFormer, Semantic Graph TransFormer for
point cloud-based 3D scene graph generation. The task aims to parse a point cloud-based …

Weakly-supervised temporal action localization by inferring salient snippet-feature

W Yun, M Qi, C Wang, H Ma - Proceedings of the AAAI conference on …, 2024 - ojs.aaai.org
Weakly-supervised temporal action localization aims to locate action regions and identify
action categories in untrimmed videos simultaneously by taking only video-level labels as …

[HTML][HTML] Metacognition as a consequence of competing evolutionary time scales

F Kuchling, C Fields, M Levin - Entropy, 2022 - mdpi.com
Evolution is full of coevolving systems characterized by complex spatio-temporal interactions
that lead to intertwined processes of adaptation. Yet, how adaptation across multiple levels …

MapGen-GAN: A fast translator for remote sensing image to map via unsupervised adversarial learning

J Song, J Li, H Chen, J Wu - IEEE Journal of Selected Topics in …, 2021 - ieeexplore.ieee.org
Map is an essential medium for people to understand our changing planet. Recently,
research on generating and updating maps through remote sensing images has been an …

Multi-stage contrastive regression for action quality assessment

Q An, M Qi, H Ma - ICASSP 2024-2024 IEEE International …, 2024 - ieeexplore.ieee.org
In recent years, there has been growing interest in the video-based action quality
assessment (AQA). Most existing methods typically solve AQA problem by considering the …