Explainability for large language models: A survey H Zhao, H Chen, F Yang, N Liu, H Deng, H Cai, S Wang, D Yin, M Du ACM Transactions on Intelligent Systems and Technology 15 (2), 1-38, 2024 | 418 | 2024 |
The softness of tumour-cell-derived microparticles regulates their drug-delivery efficiency Q Liang, N Bie, T Yong, K Tang, X Shi, Z Wei, H Jia, X Zhang, H Zhao, ... Nature biomedical engineering 3 (9), 729-740, 2019 | 193 | 2019 |
The impact of reasoning step length on large language models M Jin, Q Yu, D Shu, H Zhao, W Hua, Y Meng, Y Zhang, M Du arXiv preprint arXiv:2401.04925, 2024 | 64 | 2024 |
Usable XAI: 10 strategies towards exploiting explainability in the LLM era X Wu, H Zhao, Y Zhu, Y Shi, F Yang, T Liu, X Zhai, W Yao, J Li, M Du, ... arXiv preprint arXiv:2403.08946, 2024 | 33 | 2024 |
Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers? M Jin, Q Yu, J Huang, Q Zeng, Z Wang, W Hua, H Zhao, K Mei, Y Meng, ... arXiv preprint arXiv:2404.07066, 2024 | 31* | 2024 |
Towards Uncovering How Large Language Model Works: An Explainability Perspective H Zhao, F Yang, S Bo, H Lakkaraju, M Du arXiv preprint arXiv:2402.10688, 2024 | 21* | 2024 |
Highly biocompatible drug-delivery systems based on DNA nanotechnology X Li, L Hong, T Song, A Rodríguez-Patón, C Chen, H Zhao, X Shi Journal of Biomedical Nanotechnology 13 (7), 747-757, 2017 | 18 | 2017 |
DNA origami frame filled with two types of single-stranded tiles C Chen, J Xu, L Ruan, H Zhao, X Li, X Shi Nanoscale 14 (14), 5340-5346, 2022 | 6 | 2022 |
Beyond single concept vector: Modeling concept subspace in llms with gaussian distribution H Zhao, H Zhao, B Shen, A Payani, F Yang, M Du ICLR 2025, 2025 | 2 | 2025 |
Isothermal approach to assemble spatial DNA nanotubes for drug delivery X Shi, H Zhao, X Li, T Song Oncotarget 5, 2018 | 2 | 2018 |
Exploring multilingual probing in large language models: A cross-language analysis D Li, M Jin, Q Zeng, H Zhao, M Du arXiv preprint arXiv:2409.14459, 2024 | 1 | 2024 |
Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability D Shu, H Zhao, J Hu, W Liu, L Cheng, M Du arXiv preprint arXiv:2501.01346, 2025 | | 2025 |
Mitigating Shortcuts in Language Models with Soft Label Encoding Z He, H Deng, H Zhao, N Liu, M Du arXiv preprint arXiv:2309.09380, 2023 | | 2023 |