الباحث العلمي من Google

Large-scale video classification with convolutional neural networks

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Reinforcement learning algorithms: A brief survey‏

AK Shakya, G Pillai, S Chakrabarty - Expert Systems with Applications, 2023‏ - Elsevier‏

Reinforcement Learning (RL) is a machine learning (ML) technique to learn sequential
decision-making in complex problems. RL is inspired by trial-and-error based human/animal …‏

حفظ اقتباس تم اقتباسها في عدد: 219 مقالات ذات صلة الإصدارات الـ 2كلها

[Free GPT-4]
[DeepSeek]

[PDF] cell.com Full View‏

Artificial intelligence for multimodal data integration in oncology‏

J Lipkova, RJ Chen, B Chen, MY Lu, M Barbieri… - Cancer cell, 2022‏ - cell.com‏

In oncology, the patient state is characterized by a whole spectrum of modalities, ranging
from radiology, histology, and genomics to electronic health records. Current artificial …‏

حفظ اقتباس تم اقتباسها في عدد: 339 مقالات ذات صلة الإصدارات الـ 8كلها

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

GhostNetv2: Enhance cheap operation with long-range attention‏

Y Tang, K Han, J Guo, C Xu, C Xu… - Advances in Neural …, 2022‏ - proceedings.neurips.cc‏

Light-weight convolutional neural networks (CNNs) are specially designed for applications
on mobile devices with faster inference speed. The convolutional operation can only capture …‏

حفظ اقتباس تم اقتباسها في عدد: 340 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Motiondiffuse: Text-driven human motion generation with diffusion model‏

M Zhang, Z Cai, L Pan, F Hong, X Guo… - IEEE transactions on …, 2024‏ - ieeexplore.ieee.org‏

Human motion modeling is important for many modern graphics applications, which typically
require professional skills. In order to remove the skill barriers for laymen, recent motion …‏

حفظ اقتباس تم اقتباسها في عدد: 514 مقالات ذات صلة الإصدارات الـ 8كلها

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Video probabilistic diffusion models in projected latent space‏

S Yu, K Sohn, S Kim, J Shin - Proceedings of the IEEE/CVF …, 2023‏ - openaccess.thecvf.com‏

Despite the remarkable progress in deep generative models, synthesizing high-resolution
and temporally coherent videos still remains a challenge due to their high-dimensionality …‏

حفظ اقتباس تم اقتباسها في عدد: 175 مقالات ذات صلة الإصدارات الـ 10كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Languagebind: Extending video-language pretraining to n-modality by language-based semantic alignment‏

B Zhu, B Lin, M Ning, Y Yan, J Cui, HF Wang… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

The video-language (VL) pretraining has achieved remarkable improvement in multiple
downstream tasks. However, the current VL pretraining framework is hard to extend to …‏

حفظ اقتباس تم اقتباسها في عدد: 158 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Expanding language-image pretrained models for general video recognition‏

B Ni, H Peng, M Chen, S Zhang, G Meng, J Fu… - European conference on …, 2022‏ - Springer‏

Contrastive language-image pretraining has shown great success in learning visual-textual
joint representation from web-scale data, demonstrating remarkable “zero-shot” …‏

حفظ اقتباس تم اقتباسها في عدد: 348 مقالات ذات صلة الإصدارات الـ 8كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Cogvideo: Large-scale pretraining for text-to-video generation via transformers‏

W Hong, M Ding, W Zheng, X Liu, J Tang - arxiv preprint arxiv:2205.15868, 2022‏ - arxiv.org‏

Large-scale pretrained transformers have created milestones in text (GPT-3) and text-to-
image (DALL-E and CogView) generation. Its application to video generation is still facing …‏

حفظ اقتباس تم اقتباسها في عدد: 487 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

S4nd: Modeling images and videos as multidimensional signals with state spaces‏

E Nguyen, K Goel, A Gu, G Downs… - Advances in neural …, 2022‏ - proceedings.neurips.cc‏

Visual data such as images and videos are typically modeled as discretizations of inherently
continuous, multidimensional signals. Existing continuous-signal models attempt to exploit …‏

حفظ اقتباس تم اقتباسها في عدد: 199 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

St-adapter: Parameter-efficient image-to-video transfer learning‏

J Pan, Z Lin, X Zhu, J Shao, H Li - Advances in Neural …, 2022‏ - proceedings.neurips.cc‏

Capitalizing on large pre-trained models for various downstream tasks of interest have
recently emerged with promising performance. Due to the ever-growing model size, the …‏

حفظ اقتباس تم اقتباسها في عدد: 261 مقالات ذات صلة الإصدارات الـ 8كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Large-scale video classification with convolutional neural networks

Reinforcement learning algorithms: A brief survey‏

Artificial intelligence for multimodal data integration in oncology‏

GhostNetv2: Enhance cheap operation with long-range attention‏

Motiondiffuse: Text-driven human motion generation with diffusion model‏

Video probabilistic diffusion models in projected latent space‏

Languagebind: Extending video-language pretraining to n-modality by language-based semantic alignment‏

Expanding language-image pretrained models for general video recognition‏

Cogvideo: Large-scale pretraining for text-to-video generation via transformers‏

S4nd: Modeling images and videos as multidimensional signals with state spaces‏

St-adapter: Parameter-efficient image-to-video transfer learning‏