- Academic Search

Y Qin, S Hu, Y Lin, W Chen, N Ding, G Cui… - ACM Computing …, 2024 - dl.acm.org

Humans possess an extraordinary ability to create and utilize tools. With the advent of
foundation models, artificial intelligence systems have the potential to be equally adept in …

Uložit Citovat Počet citací tohoto článku: 255 Související články Všechny verze (počet: 6)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of embodied ai: From simulators to research tasks

J Duan, S Yu, HL Tan, H Zhu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

There has been an emerging paradigm shift from the era of “internet AI” to “embodied AI,”
where AI algorithms and agents no longer learn from datasets of images, videos or text …

Uložit Citovat Počet citací tohoto článku: 324 Související články Všechny verze (počet: 8)

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Objaverse: A universe of annotated 3d objects

M Deitke, D Schwenk, J Salvador… - Proceedings of the …, 2023 - openaccess.thecvf.com

Massive data corpora like WebText, Wikipedia, Conceptual Captions, WebImageText, and
LAION have propelled recent dramatic progress in AI. Large neural models trained on such …

Uložit Citovat Počet citací tohoto článku: 763 Související články Všechny verze (počet: 5) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Synthetic data from diffusion models improves imagenet classification

S Azizi, S Kornblith, C Saharia, M Norouzi… - arxiv preprint arxiv …, 2023 - arxiv.org

Deep generative models are becoming increasingly powerful, now generating diverse high
fidelity photo-realistic samples given text prompts. Have they reached the point where …

Uložit Citovat Počet citací tohoto článku: 313 Související články Všechny verze (počet: 3) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Uložit Citovat Počet citací tohoto článku: 4768 Související články Všechny verze (počet: 2) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Habitat 2.0: Training home assistants to rearrange their habitat

A Szot, A Clegg, E Undersander… - Advances in neural …, 2021 - proceedings.neurips.cc

Abstract We introduce Habitat 2.0 (H2. 0), a simulation platform for training virtual robots in
interactive 3D environments and complex physics-enabled scenarios. We make …

Uložit Citovat Počet citací tohoto článku: 531 Související články Všechny verze (počet: 6) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

M Deitke, E VanderBilt, A Herrasti… - Advances in …, 2022 - proceedings.neurips.cc

Massive datasets and high-capacity models have driven many recent advancements in
computer vision and natural language understanding. This work presents a platform to …

Uložit Citovat Počet citací tohoto článku: 207 Související články Všechny verze (počet: 5) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Improving multimodal datasets with image captioning

T Nguyen, SY Gadre, G Ilharco… - Advances in Neural …, 2024 - proceedings.neurips.cc

Massive web datasets play a key role in the success of large vision-language models like
CLIP and Flamingo. However, the raw web data is noisy, and existing filtering methods to …

Uložit Citovat Počet citací tohoto článku: 67 Související články Všechny verze (počet: 6) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Kubric: A scalable dataset generator

K Greff, F Belletti, L Beyer, C Doersch… - Proceedings of the …, 2022 - openaccess.thecvf.com

Data is the driving force of machine learning, with the amount and quality of training data
often being more important for the performance of a system than architecture and training …

Uložit Citovat Počet citací tohoto článku: 216 Související články Všechny verze (počet: 5) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ai2-thor: An interactive 3d environment for visual ai

E Kolve, R Mottaghi, W Han, E VanderBilt… - arxiv preprint arxiv …, 2017 - arxiv.org

We introduce The House Of inteRactions (THOR), a framework for visual AI research,
available at http://ai2thor. allenai. org. AI2-THOR consists of near photo-realistic 3D indoor …

Uložit Citovat Počet citací tohoto článku: 984 Související články Všechny verze (počet: 2) Zobrazit jako HTML

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

Threedworld: A platform for interactive multi-modal physical simulation

Tool learning with foundation models

A survey of embodied ai: From simulators to research tasks

Objaverse: A universe of annotated 3d objects

Synthetic data from diffusion models improves imagenet classification

On the opportunities and risks of foundation models

Habitat 2.0: Training home assistants to rearrange their habitat

🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

Improving multimodal datasets with image captioning

Kubric: A scalable dataset generator

Ai2-thor: An interactive 3d environment for visual ai