Towards urban general intelligence: A review and outlook of urban foundation models

W Zhang, J Han, Z Xu, H Ni, H Liu, H **ong - arxiv preprint arxiv …, 2024‏ - arxiv.org
Machine learning techniques are now integral to the advancement of intelligent urban
services, playing a crucial role in elevating the efficiency, sustainability, and livability of …

Deep learning for cross-domain data fusion in urban computing: Taxonomy, advances, and outlook

X Zou, Y Yan, X Hao, Y Hu, H Wen, E Liu, J Zhang… - Information …, 2025‏ - Elsevier
As cities continue to burgeon, Urban Computing emerges as a pivotal discipline for
sustainable development by harnessing the power of cross-domain data fusion from diverse …

Urbanclip: Learning text-enhanced urban region profiling with contrastive language-image pretraining from the web

Y Yan, H Wen, S Zhong, W Chen, H Chen… - Proceedings of the …, 2024‏ - dl.acm.org
Urban region profiling from web-sourced data is of utmost importance for urban computing.
We are witnessing a blossom of LLMs for various fields, especially in multi-modal data …

Urban foundation models: A survey

W Zhang, J Han, Z Xu, H Ni, H Liu… - Proceedings of the 30th …, 2024‏ - dl.acm.org
Machine learning techniques are now integral to the advancement of intelligent urban
services, playing a crucial role in elevating the efficiency, sustainability, and livability of …

Knowledge-infused contrastive learning for urban imagery-based socioeconomic prediction

Y Liu, X Zhang, J Ding, Y **, Y Li - … of the ACM web conference 2023, 2023‏ - dl.acm.org
Monitoring sustainable development goals requires accurate and timely socioeconomic
statistics, while ubiquitous and frequently-updated urban imagery in web like satellite/street …

Urban region embedding via multi-view contrastive prediction

Z Li, W Huang, K Zhao, M Yang, Y Gong… - Proceedings of the AAAI …, 2024‏ - ojs.aaai.org
Recently, learning urban region representations utilizing multi-modal data (information
views) has become increasingly popular, for deep understanding of the distributions of …

Multi-temporal image analysis of wetland dynamics using machine learning algorithms

RW Aslam, I Naz, H Shu, J Yan, A Quddoos… - Journal of …, 2024‏ - Elsevier
Wetlands play a crucial role in enhancing groundwater quality, mitigating natural hazards,
controlling erosion, and providing essential habitats for unique flora and wildlife. Despite …

Profiling urban streets: A semi-supervised prediction model based on street view imagery and spatial topology

M Chen, Z Li, W Huang, Y Gong, Y Yin - Proceedings of the 30th ACM …, 2024‏ - dl.acm.org
With the expansion and growth of cities, profiling urban areas with the advent of multi-modal
urban datasets (eg, points-of-interest and street view imagery) has become increasingly …

Geolocation representation from large language models are generic enhancers for spatio-temporal learning

J He, T Nie, W Ma - arxiv preprint arxiv:2408.12116, 2024‏ - arxiv.org
In the geospatial domain, universal representation models are significantly less prevalent
than their extensive use in natural language processing and computer vision. This …

ReFound: Crafting a Foundation Model for Urban Region Understanding upon Language and Visual Foundations

C **ao, J Zhou, Y **ao, J Huang, H **ong - Proceedings of the 30th ACM …, 2024‏ - dl.acm.org
Understanding urban regional characteristics is pivotal in driving critical insights for urban
planning and management. We have witnessed the successful application of pre-trained …