A hierarchical indexing strategy for optimizing Apache Spark with HDFS to efficiently query big geospatial raster data

F Hu, C Yang, Y Jiang, Y Li, W Song… - … Journal of Digital …, 2020 - Taylor & Francis
Earth observations and model simulations are generating big multidimensional array-based
raster data. However, it is difficult to efficiently query these big raster data due to the …

A cloud-based framework for large-scale log mining through apache spark and elasticsearch

Y Li, Y Jiang, J Gu, M Lu, M Yu, EM Armstrong… - Applied Sciences, 2019 - mdpi.com
The volume, variety, and velocity of different data, eg, simulation data, observation data, and
social media data, are growing ever faster, posing grand challenges for data discovery. An …

An open-source framework unifying stream and batch processing

K Deshpande, M Rao - Inventive Computation and Information …, 2022 - Springer
Log monitoring and analysis plays critical role in identifying events and traces to understand
system behaviour at that point in time and to ensure predictive, corrective actions if required …

Improving search ranking of geospatial data based on deep learning using user behavior data

Y Li, Y Jiang, C Yang, M Yu, L Kamal… - Computers & …, 2020 - Elsevier
Finding geospatial data has been a big challenge regarding the data size and heterogeneity
across various domains. Previous work has explored using machine learning to improve …

A smart web-based geospatial data discovery system with oceanographic data as an example

Y Jiang, Y Li, C Yang, F Hu, EM Armstrong… - … International Journal of …, 2018 - mdpi.com
Discovering and accessing geospatial data presents a significant challenge for the Earth
sciences community as massive amounts of data are being produced on a daily basis. In this …

Modelling auto-scalable big data enabled log analytic framework

D Kiran, M Rao - … Technologies: Proceedings of Fifth ICCNCT 2022, 2022 - Springer
Log generation is a continuous process that generates large amounts of log data in various
forms and rates that may be analysed to acquire valuable insights. Various open-source and …

A Distributed Computing Framework to Manage, Query, and Analyze Big Geospatial Data for Urban Studies-Case Studies with Urban Heat Island and Tourist …

F Hu - 2018 - search.proquest.com
Urban system, as a sub-component of the Earth system, is complex and dynamic being
composed of numerous interactions among natural, human-built, and social entities (**g et …

Improving Geospatial Data Search Ranking Using Deep Learning and User Behaviour Data

Y Jiang - 2018 - search.proquest.com
Finding Earth science data has been a challenging problem given both the quantity of data
available and the heterogeneity of the data across a wide variety of domains. Current search …