Human-AI collaboration in data science: Exploring data scientists' perceptions of automated AI

D Wang, JD Weisz, M Muller, P Ram, W Geyer… - Proceedings of the …, 2019 - dl.acm.org
The rapid advancement of artificial intelligence (AI) is changing our lives in many ways. One
application domain is data science. New techniques in automating the creation of AI, known …

How ai developers overcome communication challenges in a multidisciplinary team: A case study

D Piorkowski, S Park, AY Wang, D Wang… - Proceedings of the …, 2021 - dl.acm.org
The development of AI applications is a multidisciplinary effort, involving multiple roles
collaborating with the AI developers, an umbrella term we use to include data scientists and …

Trust in AutoML: exploring information needs for establishing trust in automated machine learning systems

J Drozdal, J Weisz, D Wang, G Dass, B Yao… - Proceedings of the 25th …, 2020 - dl.acm.org
We explore trust in a relatively new area of data science: Automated Machine Learning
(AutoML). In AutoML, AI methods are used to generate and optimize machine learning …

How much automation does a data scientist want?

D Wang, QV Liao, Y Zhang, U Khurana… - arxiv preprint arxiv …, 2021 - arxiv.org
Data science and machine learning (DS/ML) are at the heart of the recent advancements of
many Artificial Intelligence (AI) applications. There is an active research thread in AI,\autoai …

Characterizing practices, limitations, and opportunities related to text information extraction workflows: a human-in-the-loop perspective

S Rahman, E Kandogan - Proceedings of the 2022 CHI Conference on …, 2022 - dl.acm.org
Information extraction (IE) approaches often play a pivotal role in text analysis and require
significant human intervention. Therefore, a deeper understanding of existing IE practices …

Eligibility of BPMN models for business process redesign

G Tsakalidis, K Vergidis, G Kougka, A Gounaris - Information, 2019 - mdpi.com
Business process redesign (BPR) is an organizational initiative for achieving competitive
multi-faceted advantages regarding business processes, in terms of cycle time, quality, cost …

[PDF][PDF] The History, Present, and Future of ETL Technology

A Simitsis, S Skiadopoulos, P Vassiliadis - DOLAP, 2023 - cs.uoi.gr
There is an abundance of data, but a large volume of it is unusable. Data may be noisy,
unstructured, stored in incompatible for direct analysis medium or format, and often …

Hyppo: using equivalences to optimize pipelines in exploratory machine learning

A Kontaxakis, D Sacharidis, A Simitsis… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
We present HYPPO, a novel system to optimize pipelines encountered in exploratory
machine learning. HYPPO exploits alternative computational paths of artifacts from past …

Towards Feature Engineering with Human and AI's Knowledge: Understanding Data Science Practitioners' Perceptions in Human&AI-Assisted Feature Engineering …

Q Zhu, D Wang, S Ma, AY Wang, Z Chen… - Proceedings of the …, 2024 - dl.acm.org
As AI technology continues to advance, the importance of human-AI collaboration becomes
increasingly evident, with numerous studies exploring its potential in various fields. One vital …

Toward building edge learning pipelines

A Gounaris, AV Michailidou… - IEEE Internet …, 2023 - ieeexplore.ieee.org
From a bird's eye point of view, large-scale data analytics workflows, eg, those executed in
popular tools, such as Apache Spark and Flink, are typically represented by directed acyclic …