- Academic Search

M Shridhar, L Manuelli, D Fox - Conference on Robot …, 2023 - proceedings.mlr.press

Transformers have revolutionized vision and natural language processing with their ability to
scale with large datasets. But in robotic manipulation, data is both limited and expensive …

Save Cite Cited by 463 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Blind image quality assessment via vision-language correspondence: A multitask learning perspective

W Zhang, G Zhai, Y Wei, X Yang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

We aim at advancing blind image quality assessment (BIQA), which predicts the human
perception of image quality without any reference information. We develop a general and …

Save Cite Cited by 179 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Instruction-driven history-aware policies for robotic manipulations

PL Guhur, S Chen, RG Pinel… - … on Robot Learning, 2023 - proceedings.mlr.press

In human environments, robots are expected to accomplish a variety of manipulation tasks
given simple natural language instructions. Yet, robotic manipulation is extremely …

Save Cite Cited by 102 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Manigaussian: Dynamic gaussian splatting for multi-task robotic manipulation

G Lu, S Zhang, Z Wang, C Liu, J Lu, Y Tang - European Conference on …, 2024 - Springer

Performing language-conditioned robotic manipulation tasks in unstructured environments
is highly demanded for general intelligent robots. Conventional robotic manipulation …

Save Cite Cited by 27 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] openreview.net

Chaineddiffuser: Unifying trajectory diffusion and keypose prediction for robotic manipulation

Z **an, N Gkanatsios, T Gervet, TW Ke… - … Annual Conference on …, 2023 - openreview.net

We present ChainedDiffuser, a policy architecture that unifies action keypose prediction and
trajectory diffusion generation for learning robot manipulation from demonstrations. Our …

Save Cite Cited by 61 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] caltech.edu

[PDF][PDF] Prismer: A vision-language model with an ensemble of experts

S Liu, L Fan, E Johns, Z Yu, C **ao… - arxiv preprint arxiv …, 2023 - authors.library.caltech.edu

Recent vision-language models have shown impressive multi-modal generation
capabilities. However, typically they require training huge models on massive datasets. As a …

Save Cite Cited by 48 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Act3d: Infinite resolution action detection transformer for robotic manipulation

T Gervet, Z **an, N Gkanatsios… - arxiv preprint arxiv …, 2023 - arxiv.org

3D perceptual representations are well suited for robot manipulation as they easily encode
occlusions and simplify spatial reasoning. Many manipulation tasks require high spatial …

Save Cite Cited by 32 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] openreview.net

Act3d: 3d feature field transformers for multi-task robotic manipulation

T Gervet, Z **an, N Gkanatsios… - 7th Annual Conference …, 2023 - openreview.net

3D perceptual representations are well suited for robot manipulation as they easily encode
occlusions and simplify spatial reasoning. Many manipulation tasks require high spatial …

Save Cite Cited by 52 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Image quality-aware diagnosis via meta-knowledge co-embedding

H Che, S Chen, H Chen - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com

Medical images usually suffer from image degradation in clinical practice, leading to
decreased performance of deep learning-based models. To resolve this problem, most …

Save Cite Cited by 17 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Forkmerge: Mitigating negative transfer in auxiliary-task learning

J Jiang, B Chen, J Pan, X Wang, D Liu… - Advances in …, 2024 - proceedings.neurips.cc

Abstract Auxiliary-Task Learning (ATL) aims to improve the performance of the target task by
leveraging the knowledge obtained from related tasks. Occasionally, learning multiple tasks …

Save Cite Cited by 22 Related articles All 7 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Auto-lambda: Disentangling dynamic task relationships

Perceiver-actor: A multi-task transformer for robotic manipulation

Blind image quality assessment via vision-language correspondence: A multitask learning perspective

Instruction-driven history-aware policies for robotic manipulations

Manigaussian: Dynamic gaussian splatting for multi-task robotic manipulation

Chaineddiffuser: Unifying trajectory diffusion and keypose prediction for robotic manipulation

[PDF][PDF] Prismer: A vision-language model with an ensemble of experts

Act3d: Infinite resolution action detection transformer for robotic manipulation

Act3d: 3d feature field transformers for multi-task robotic manipulation

Image quality-aware diagnosis via meta-knowledge co-embedding

Forkmerge: Mitigating negative transfer in auxiliary-task learning