Interactive and visual prompt engineering for ad-hoc task adaptation with large language models

H Strobelt, A Webson, V Sanh, B Hoover… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
State-of-the-art neural language models can now be used to solve ad-hoc language tasks
through zero-shot prompting without the need for supervised training. This approach has …

Big data visualization and analytics: Future research challenges and emerging applications

G Andrienko, N Andrienko, S Drucker, JD Fekete… - 2021 - dspace.mit.edu
In the context of data visualization and analytics, this report outlines some of the challenges
and emerging applications that arise in the Big Data era. In particularly, fourteen …

The art and practice of data science pipelines: A comprehensive study of data science pipelines in theory, in-the-small, and in-the-large

S Biswas, M Wardat, H Rajan - … of the 44th International Conference on …, 2022 - dl.acm.org
Increasingly larger number of software systems today are including data science
components for descriptive, predictive, and prescriptive analytics. The collection of data …

Incremental permutation feature importance (iPFI): Towards online explanations on data streams

F Fumagalli, M Muschalik, E Hüllermeier, B Hammer - Machine Learning, 2023 - Springer
Explainable artificial intelligence has mainly focused on static learning scenarios so far. We
are interested in dynamic scenarios where data is sampled progressively, and learning is …

An incremental dimensionality reduction method for visualizing streaming multidimensional data

T Fujiwara, JK Chou, S Shilpika, P Xu… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Dimensionality reduction (DR) methods are commonly used for analyzing and visualizing
multidimensional data. However, when data is a live streaming feed, conventional DR …

GPGPU linear complexity t-SNE optimization

N Pezzotti, J Thijssen, A Mordvintsev… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
In recent years the t-distributed Stochastic Neighbor Embedding (t-SNE) algorithm has
become one of the most used and insightful techniques for exploratory data analysis of high …

KD-Box: Line-segment-based KD-tree for interactive exploration of large-scale time-series data

Y Zhao, Y Wang, J Zhang, CW Fu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Time-series data-usually presented in the form of lines-plays an important role in many
domains such as finance, meteorology, health, and urban informatics. Yet, little has been …

Report on the first and second interdisciplinary time series analysis workshop (ITISA)

T Palpanas, V Beckmann - ACM SIGMOD Record, 2019 - dl.acm.org
The analysis of time-series data associated with modernday industrial operations and
scientific experiments is now pushing both computational power and resources to their …

Dbpal: A fully pluggable nl2sql training pipeline

N Weir, P Utama, A Galakatos, A Crotty… - Proceedings of the …, 2020 - dl.acm.org
Natural language is a promising alternative interface to DBMSs because it enables non-
technical users to formulate complex questions in a more concise manner than SQL …

Featureenvi: Visual analytics for feature engineering using stepwise selection and semi-automatic extraction approaches

A Chatzimparmpas, RM Martins… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The machine learning (ML) life cycle involves a series of iterative steps, from the effective
gathering and preparation of the data—including complex feature engineering processes …