Challenges in deploying machine learning: a survey of case studies
In recent years, machine learning has transitioned from a field of academic research interest
to a field capable of solving real-world business problems. However, the deployment of …
to a field capable of solving real-world business problems. However, the deployment of …
A survey of open source tools for machine learning with big data in the Hadoop ecosystem
With an ever-increasing amount of options, the task of selecting machine learning tools for
big data can be difficult. The available tools have advantages and drawbacks, and many …
big data can be difficult. The available tools have advantages and drawbacks, and many …
“Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI
AI models are increasingly applied in high-stakes domains like health and conservation.
Data quality carries an elevated significance in high-stakes AI due to its heightened …
Data quality carries an elevated significance in high-stakes AI due to its heightened …
A survey on FinTech
As a new term in the financial industry, FinTech has become a popular term that describes
novel technologies adopted by the financial service institutions. This term covers a large …
novel technologies adopted by the financial service institutions. This term covers a large …
A multi-level typology of abstract visualization tasks
The considerable previous work characterizing visualization usage has focused on low-level
tasks or interactions and high-level tasks, leaving a gap between them that is not addressed …
tasks or interactions and high-level tasks, leaving a gap between them that is not addressed …
Collaboration challenges in building ml-enabled systems: Communication, documentation, engineering, and process
The introduction of machine learning (ML) components in software projects has created the
need for software engineers to collaborate with data scientists and other specialists. While …
need for software engineers to collaborate with data scientists and other specialists. While …
A survey of data partitioning and sampling methods to support big data analysis
Computer clusters with the shared-nothing architecture are the major computing platforms
for big data processing and analysis. In cluster computing, data partitioning and sampling …
for big data processing and analysis. In cluster computing, data partitioning and sampling …
Exploration and explanation in computational notebooks
Computational notebooks combine code, visualizations, and text in a single document.
Researchers, data analysts, and even journalists are rapidly adopting this new medium. We …
Researchers, data analysts, and even journalists are rapidly adopting this new medium. We …
Human factors in model interpretability: Industry practices, challenges, and needs
As the use of machine learning (ML) models in product development and data-driven
decision-making processes became pervasive in many domains, people's focus on building …
decision-making processes became pervasive in many domains, people's focus on building …
How does machine learning change software development practices?
Adding an ability for a system to learn inherently adds uncertainty into the system. Given the
rising popularity of incorporating machine learning into systems, we wondered how the …
rising popularity of incorporating machine learning into systems, we wondered how the …