Large models for time series and spatio-temporal data: A survey and outlook
Temporal data, notably time series and spatio-temporal data, are prevalent in real-world
applications. They capture dynamic system measurements and are produced in vast …
applications. They capture dynamic system measurements and are produced in vast …
Hyperimpute: Generalized iterative imputation with automatic model selection
Consider the problem of imputing missing values in a dataset. One the one hand,
conventional approaches using iterative imputation benefit from the simplicity and …
conventional approaches using iterative imputation benefit from the simplicity and …
Event Stream GPT: a data pre-processing and modeling library for generative, pre-trained transformers over continuous-time sequences of complex events
Generative, pre-trained transformers (GPTs, a type of" Foundation Models") have reshaped
natural language processing (NLP) through their versatility in diverse downstream tasks …
natural language processing (NLP) through their versatility in diverse downstream tasks …
[HTML][HTML] Lifting hospital electronic health record data treasures: challenges and opportunities
Electronic health records (EHRs) have been successfully used in data science and machine
learning projects. However, most of these data are collected for clinical use rather than for …
learning projects. However, most of these data are collected for clinical use rather than for …
What'sa good imputation to predict with missing values?
How to learn a good predictor on data with missing values? Most efforts focus on first
imputing as well as possible and second learning on the completed data to predict the …
imputing as well as possible and second learning on the completed data to predict the …
An extensive data processing pipeline for mimic-iv
M Gupta, B Gallamoza, N Cutrona… - … Learning for Health, 2022 - proceedings.mlr.press
An increasing amount of research is being devoted to applying machine learning methods to
electronic health record (EHR) data for various clinical purposes. This growing area of …
electronic health record (EHR) data for various clinical purposes. This growing area of …
WOODS: Benchmarks for out-of-distribution generalization in time series
Machine learning models often fail to generalize well under distributional shifts.
Understanding and overcoming these failures have led to a research field of Out-of …
Understanding and overcoming these failures have led to a research field of Out-of …
Energy forecasting with robust, flexible, and explainable machine learning algorithms
Energy forecasting is crucial in scheduling and planning future electric load, so as to
improve the reliability and safeness of the power grid. Despite recent developments of …
improve the reliability and safeness of the power grid. Despite recent developments of …
TradeMaster: a holistic quantitative trading platform empowered by reinforcement learning
The financial markets, which involve over\$90 trillion market capitals, attract the attention of
innumerable profit-seeking investors globally. Recent explosion of reinforcement learning in …
innumerable profit-seeking investors globally. Recent explosion of reinforcement learning in …
HiRID-ICU-Benchmark--A Comprehensive Machine Learning Benchmark on High-resolution ICU Data
The recent success of machine learning methods applied to time series collected from
Intensive Care Units (ICU) exposes the lack of standardized machine learning benchmarks …
Intensive Care Units (ICU) exposes the lack of standardized machine learning benchmarks …