Prati
An Yan
Naslov
Citirano
Citirano
Godina
CosRec: 2D convolutional neural networks for sequential recommendation
A Yan, S Cheng, WC Kang, M Wan, J McAuley
Proceedings of the 28th ACM international conference on information and …, 2019
1362019
RadBERT: adapting transformer-based language models to radiology
A Yan, J McAuley, X Lu, J Du, EY Chang, A Gentili, CN Hsu
Radiology: Artificial Intelligence 4 (4), e210258, 2022
1302022
PA3D: Pose-action 3D machine for video recognition
A Yan, Y Wang, Z Li, Y Qiao
Proceedings of the ieee/cvf conference on computer vision and pattern …, 2019
1092019
Bridging language and items for retrieval and recommendation
Y Hou, J Li, Z He, A Yan, X Chen, J McAuley
arXiv preprint arXiv:2403.03952, 2024
832024
Weakly supervised contrastive learning for chest x-ray report generation
A Yan, Z He, X Lu, J Du, E Chang, A Gentili, J McAuley, CN Hsu
arXiv preprint arXiv:2109.12242, 2021
812021
Gpt-4v in wonderland: Large multimodal models for zero-shot smartphone gui navigation
A Yan, Z Yang, W Zhu, K Lin, L Li, J Wang, J Yang, Y Zhong, J McAuley, ...
arXiv preprint arXiv:2311.07562, 2023
802023
Learning concise and descriptive attributes for visual recognition
A Yan, Y Wang, Y Zhong, C Dong, Z He, Y Lu, WY Wang, J Shang, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
682023
Gpt-4v (ision) as a generalist evaluator for vision-language tasks
X Zhang, Y Lu, W Wang, A Yan, J Yan, L Qin, H Wang, X Yan, WY Wang, ...
arXiv preprint arXiv:2311.01361, 2023
602023
xgen-mm (blip-3): A family of open large multimodal models
L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ...
arXiv preprint arXiv:2408.08872, 2024
472024
Personalized showcases: Generating multi-modal explanations for recommendations
A Yan, Z He, J Li, T Zhang, J McAuley
Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023
402023
Visualize before you write: Imagination-guided open-ended text generation
W Zhu, A Yan, Y Lu, W Xu, XE Wang, M Eckstein, WY Wang
arXiv preprint arXiv:2210.03765, 2022
352022
Personalized complementary product recommendation
A Yan, C Dong, Y Gao, J Fu, T Zhao, Y Sun, J McAuley
The ACM Web Conference, 2022
332022
Multimodal text style transfer for outdoor vision-and-language navigation
W Zhu, XE Wang, TJ Fu, A Yan, P Narayana, K Sone, S Basu, WY Wang
arXiv preprint arXiv:2007.00229, 2020
322020
Robust and interpretable medical image classifiers via concept bottleneck models
A Yan, Y Wang, Y Zhong, Z He, P Karypis, Z Wang, C Dong, A Gentili, ...
arXiv preprint arXiv:2310.03182, 2023
312023
A survey on large language models for critical societal domains: Finance, healthcare, and law
ZZ Chen, J Ma, X Zhang, N Hao, A Yan, A Nourbakhsh, X Yang, ...
arXiv preprint arXiv:2405.01769, 2024
252024
Cross-lingual vision-language navigation
A Yan, XE Wang, J Feng, L Li, WY Wang
arXiv preprint arXiv:1910.11301, 2019
212019
List items one by one: A new data source and learning paradigm for multimodal llms
A Yan, Z Yang, J Wu, W Zhu, J Yang, L Li, K Lin, J Wang, J McAuley, ...
arXiv preprint arXiv:2404.16375, 2024
132024
L2c: Describing visual differences needs semantic understanding of individuals
A Yan, XE Wang, TJ Fu, WY Wang
arXiv preprint arXiv:2102.01860, 2021
132021
MedEval: a multi-level, multi-task, and multi-domain medical benchmark for language model evaluation
Z He, Y Wang, A Yan, Y Liu, EY Chang, A Gentili, J McAuley, CN Hsu
arXiv preprint arXiv:2310.14088, 2023
122023
Driving through the concept gridlock: Unraveling explainability bottlenecks in automated driving
J Echterhoff, A Yan, K Han, A Abdelraouf, R Gupta, J McAuley
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
112024
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20