Neurdb: On the design and implementation of an ai-powered autonomous database

Z Zhao, S Cai, H Gao, H Pan, S **ang, N **ng… - arxiv preprint arxiv …, 2024 - arxiv.org
Databases are increasingly embracing AI to provide autonomous system optimization and
intelligent in-database analytics, aiming to relieve end-user burdens across various industry …

A Comparison of End-to-End Decision Forest Inference Pipelines

H Guan, S Masood, M Dwarampudi, V Gunda… - Proceedings of the …, 2023 - dl.acm.org
Decision forest, including RandomForest, XGBoost, and LightGBM, dominates the machine
learning tasks over tabular data. Recently, several frameworks were developed for decision …

Chunk2vec: A novel resemblance detection scheme based on Sentence‐BERT for post‐deduplication delta compression in network transmission

C Wang, K Wang, M Li, F Wei, N **ong - IET Communications, 2024 - Wiley Online Library
Delta compression, as a complementary technique for data deduplication, has gained
widespread attention in network storage systems. It can eliminate redundant data between …

An efficient learning based approach for automatic record deduplication with benchmark datasets

M Ravikanth, S Korra, G Mamidisetti, M Goutham… - Scientific Reports, 2024 - nature.com
With technological innovations, enterprises in the real world are managing every iota of data
as it can be mined to derive business intelligence (BI). However, when data comes from …

A comparison of decision forest inference platforms from a database perspective

H Guan, MR Dwarampudi, V Gunda, H Min… - arxiv preprint arxiv …, 2023 - arxiv.org
Decision forest, including RandomForest, XGBoost, and LightGBM, is one of the most
popular machine learning techniques used in many industrial scenarios, such as credit card …

Serving Deep Learning Model in Relational Databases

L Zhou, Q Lin, K Chowdhury, S Masood… - arxiv preprint arxiv …, 2023 - arxiv.org
Serving deep learning (DL) models on relational data has become a critical requirement
across diverse commercial and scientific domains, sparking growing interest recently. In this …

CheckBullet: A Lightweight Checkpointing System for Robust Model Training on Mobile Networks

Y Jeon, H Choi, H Jeong, D Jung… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Training on time-series data generated from mobile networks is a resource-intensive and
time-consuming task that encounters various training failures. To cope with this issue, we …