Segueix
An Yan
Títol
Citada per
Citada per
Any
CosRec: 2D convolutional neural networks for sequential recommendation
A Yan, S Cheng, WC Kang, M Wan, J McAuley
Proceedings of the 28th ACM international conference on information and …, 2019
1372019
RadBERT: adapting transformer-based language models to radiology
A Yan, J McAuley, X Lu, J Du, EY Chang, A Gentili, CN Hsu
Radiology: Artificial Intelligence 4 (4), e210258, 2022
1292022
PA3D: Pose-action 3D machine for video recognition
A Yan, Y Wang, Z Li, Y Qiao
Proceedings of the ieee/cvf conference on computer vision and pattern …, 2019
1072019
Weakly supervised contrastive learning for chest x-ray report generation
A Yan, Z He, X Lu, J Du, E Chang, A Gentili, J McAuley, CN Hsu
arXiv preprint arXiv:2109.12242, 2021
812021
Gpt-4v in wonderland: Large multimodal models for zero-shot smartphone gui navigation
A Yan, Z Yang, W Zhu, K Lin, L Li, J Wang, J Yang, Y Zhong, J McAuley, ...
arXiv preprint arXiv:2311.07562, 2023
802023
Bridging language and items for retrieval and recommendation
Y Hou, J Li, Z He, A Yan, X Chen, J McAuley
arXiv preprint arXiv:2403.03952, 2024
752024
Learning concise and descriptive attributes for visual recognition
A Yan, Y Wang, Y Zhong, C Dong, Z He, Y Lu, WY Wang, J Shang, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
682023
Gpt-4v (ision) as a generalist evaluator for vision-language tasks
X Zhang, Y Lu, W Wang, A Yan, J Yan, L Qin, H Wang, X Yan, WY Wang, ...
arXiv preprint arXiv:2311.01361, 2023
582023
xgen-mm (blip-3): A family of open large multimodal models
L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ...
arXiv preprint arXiv:2408.08872, 2024
422024
Personalized showcases: Generating multi-modal explanations for recommendations
A Yan, Z He, J Li, T Zhang, J McAuley
Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023
352023
Visualize before you write: Imagination-guided open-ended text generation
W Zhu, A Yan, Y Lu, W Xu, XE Wang, M Eckstein, WY Wang
arXiv preprint arXiv:2210.03765, 2022
352022
Multimodal text style transfer for outdoor vision-and-language navigation
W Zhu, XE Wang, TJ Fu, A Yan, P Narayana, K Sone, S Basu, WY Wang
arXiv preprint arXiv:2007.00229, 2020
332020
Robust and interpretable medical image classifiers via concept bottleneck models
A Yan, Y Wang, Y Zhong, Z He, P Karypis, Z Wang, C Dong, A Gentili, ...
arXiv preprint arXiv:2310.03182, 2023
322023
Personalized complementary product recommendation
A Yan, C Dong, Y Gao, J Fu, T Zhao, Y Sun, J McAuley
The ACM Web Conference, 2022
322022
A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law
ZZ Chen, J Ma, X Zhang, N Hao, A Yan, A Nourbakhsh, X Yang, ...
arXiv preprint arXiv:2405.01769, 2024
212024
Cross-lingual vision-language navigation
A Yan, XE Wang, J Feng, L Li, WY Wang
arXiv preprint arXiv:1910.11301, 2019
212019
Medeval: A multi-level, multi-task, and multi-domain medical benchmark for language model evaluation
Z He, Y Wang, A Yan, Y Liu, EY Chang, A Gentili, J McAuley, CN Hsu
arXiv preprint arXiv:2310.14088, 2023
132023
L2c: Describing visual differences needs semantic understanding of individuals
A Yan, XE Wang, TJ Fu, WY Wang
arXiv preprint arXiv:2102.01860, 2021
132021
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
A Yan, Z Yang, J Wu, W Zhu, J Yang, L Li, K Lin, J Wang, J McAuley, ...
arXiv preprint arXiv:2404.16375, 2024
102024
Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving
J Echterhoff, A Yan, K Han, A Abdelraouf, R Gupta, J McAuley
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
102024
En aquests moments el sistema no pot dur a terme l'operació. Torneu-ho a provar més tard.
Articles 1–20