Urmăriți
Meera Hahn
Meera Hahn
Research Scientist, Google
Adresă de e-mail confirmată pe google.com - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Videopoet: A large language model for zero-shot video generation
D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, G Schindler, R Hornung, ...
arXiv preprint arXiv:2312.14125, 2023
2002023
Photorealistic video generation with diffusion models
A Gupta, L Yu, K Sohn, X Gu, M Hahn, FF Li, I Essa, L Jiang, J Lezama
European Conference on Computer Vision, 393-411, 2024
1422024
Tripping through time: Efficient Localization of Activities in Videos
M Hahn, A Kadav, JM Rehg, HP Graf
British Machine Vision Conference (BMVC), 2020
1082020
No RL, No Simulation: Learning to Navigate without Navigating
M Hahn, D Chaplot, S Tulsiani, M Mukadam, JM Rehg, A Gupta
Advances in Neural Information Processing Systems, 2021
882021
Action2Vec: A Crossmodal Embedding Approach to Action Learning
M Hahn, A Silva, JM Rehg
The IEEE Conference Conference on Computer Vision and Pattern Recognition …, 2018
692018
Situated bayesian reasoning framework for robots operating in diverse everyday environments
S Chernova, V Chu, A Daruna, H Garrison, M Hahn, P Khante, W Liu, ...
Robotics Research: The 18th International Symposium ISRR, 353-369, 2020
312020
Where are you? localization from embodied dialog
M Hahn, J Krantz, D Batra, D Parikh, JM Rehg, S Lee, P Anderson
Empirical Methods in Natural Language Processing (EMNLP), 2020
272020
Deep tracking: Visual tracking using deep convolutional networks
M Hahn, S Chen, A Dehghan
arXiv preprint arXiv:1512.03993, 2015
112015
Learning to localize and align fine-grained actions to sparse instructions
M Hahn, N Ruiz, JB Alayrac, I Laptev, JM Rehg
The IEEE Conference Conference on Computer Vision and Pattern Recognition …, 2017
92017
Videopoet: A large language model for zero-shot video generation, 2024
D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, G Schindler, R Hornung, ...
URL https://arxiv. org/abs/2312.14125 1, 2024
6*2024
Efficient and fine-grained video retrieval
A Kadav, I Melvin, HP Graf, M Hahn
US Patent 11,568,247, 2023
62023
Transformer-based Localization from Embodied Dialog with Large-scale Pre-training
M Hahn, JM Rehg
Conference of the Asia-Pacific Association for Computational Linguistics …, 2022
62022
SiRoK: Situated Robot Knowledge-Understanding the Balance Between Situated Knowledge and Variability.
AA Daruna, V Chu, W Liu, M Hahn, P Khante, S Chernova, A Thomaz
AAAI Spring Symposia, 2018
42018
Which way is right?: Uncovering limitations of Vision-and-Language Navigation model
M Hahn, A Raj, JM Rehg
arXiv preprint arXiv:2312.00151, 2023
22023
FineStyle: Fine-grained Controllable Style Personalization for Text-to-image Models
G Zhang, K Sohn, M Hahn, H Shi, I Essa
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
12024
Text and Click inputs for unambiguous open vocabulary instance segmentation
N Warner, M Hahn, J Huang, I Essa, V Birodkar
arXiv preprint arXiv:2311.14822, 2023
12023
Learning a Visually Grounded Memory Assistant
M Hahn, K Carlberg, R Desai, J Hillis
arXiv preprint arXiv:2210.03787, 2022
12022
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
S Yu, M Hahn, D Kondratyuk, J Shin, A Gupta, J Lezama, I Essa, D Ross, ...
arXiv preprint arXiv:2502.12632, 2025
2025
Learning Complex Non-Rigid Image Edits from Multimodal Conditioning
N Warner, J Kolb, M Hahn, V Birodkar, J Huang, I Essa
arXiv preprint arXiv:2412.10219, 2024
2024
Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty
M Hahn, W Zeng, N Kannen, R Galt, K Badola, B Kim, Z Wang
arXiv preprint arXiv:2412.06771, 2024
2024
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20