Google 학술 검색

M Moor, O Banerjee, ZSH Abad, HM Krumholz… - Nature, 2023 - nature.com

The exceptionally rapid development of highly flexible, reusable artificial intelligence (AI)
models is likely to usher in newfound capabilities in medicine. We propose a new paradigm …

저장 인용 1024회 인용 관련 학술자료 전체 17개의 버전

[Free GPT-4]

[PDF] arxiv.org

A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - International Journal of …, 2024 - Springer

Abstract Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks across different data modalities. A PFM (eg, BERT, ChatGPT, GPT-4) is …

저장 인용 609회 인용 관련 학술자료 전체 2개의 버전

[Free GPT-4]

[PDF] thecvf.com

Segment anything

A Kirillov, E Mintun, N Ravi, H Mao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract We introduce the Segment Anything (SA) project: a new task, model, and dataset for
image segmentation. Using our efficient model in a data collection loop, we built the largest …

[Free GPT-4]

[PDF] arxiv.org

Dinov2: Learning robust visual features without supervision

M Oquab, T Darcet, T Moutakanni, H Vo… - arxiv preprint arxiv …, 2023 - arxiv.org

The recent breakthroughs in natural language processing for model pretraining on large
quantities of data have opened the way for similar foundation models in computer vision …

[Free GPT-4]

[PDF] nature.com

Segment anything in medical images

J Ma, Y He, F Li, L Han, C You, B Wang - Nature Communications, 2024 - nature.com

Medical image segmentation is a critical component in clinical practice, facilitating accurate
diagnosis, treatment planning, and disease monitoring. However, existing methods, often …

저장 인용 1321회 인용 관련 학술자료 전체 11개의 버전

[Free GPT-4]

[PDF] thecvf.com

Image as a foreign language: Beit pretraining for vision and vision-language tasks

W Wang, H Bao, L Dong, J Bjorck… - Proceedings of the …, 2023 - openaccess.thecvf.com

A big convergence of language, vision, and multimodal pretraining is emerging. In this work,
we introduce a general-purpose multimodal foundation model BEiT-3, which achieves …

저장 인용 450회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

SpectralGPT: Spectral remote sensing foundation model

D Hong, B Zhang, X Li, Y Li, C Li, J Yao… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

The foundation model has recently garnered significant attention due to its potential to
revolutionize the field of visual representation learning in a self-supervised manner. While …

저장 인용 442회 인용 관련 학술자료 전체 6개의 버전

[Free GPT-4]

[PDF] arxiv.org

Video-llava: Learning united visual representation by alignment before projection

B Lin, Y Ye, B Zhu, J Cui, M Ning, P **… - arxiv preprint arxiv …, 2023 - arxiv.org

The Large Vision-Language Model (LVLM) has enhanced the performance of various
downstream tasks in visual-language understanding. Most existing approaches encode …

저장 인용 431회 인용 관련 학술자료 전체 3개의 버전 HTML 버전

[Free GPT-4]

[PDF] thecvf.com

Open-vocabulary panoptic segmentation with text-to-image diffusion models

J Xu, S Liu, A Vahdat, W Byeon… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present ODISE: Open-vocabulary DIffusion-based panoptic SEgmentation, which unifies
pre-trained text-image diffusion and discriminative models to perform open-vocabulary …

저장 인용 426회 인용 관련 학술자료 전체 6개의 버전 HTML 버전

[Free GPT-4]

[PDF] nature.com

A foundation model for clinical-grade computational pathology and rare cancers detection

E Vorontsov, A Bozkurt, A Casson, G Shaikovski… - Nature medicine, 2024 - nature.com

The analysis of histopathology images with artificial intelligence aims to enable clinical
decision support systems and precision medicine. The success of such applications …

저장 인용 87회 인용 관련 학술자료 전체 4개의 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Masked autoencoders are scalable vision learners

Foundation models for generalist medical artificial intelligence

A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

Segment anything

Dinov2: Learning robust visual features without supervision

Segment anything in medical images

Image as a foreign language: Beit pretraining for vision and vision-language tasks

SpectralGPT: Spectral remote sensing foundation model

Video-llava: Learning united visual representation by alignment before projection

Open-vocabulary panoptic segmentation with text-to-image diffusion models

A foundation model for clinical-grade computational pathology and rare cancers detection