[SÁCH][B] Data cleaning

IF Ilyas, X Chu - 2019 - books.google.com
This is an overview of the end-to-end data cleaning process. Data quality is one of the most
important problems in data management, since dirty data often leads to inaccurate data …

Program synthesis

S Gulwani, O Polozov, R Singh - Foundations and Trends® in …, 2017 - nowpublishers.com
Program synthesis is the task of automatically finding a program in the underlying
programming language that satisfies the user intent expressed in the form of some …

Flashmeta: A framework for inductive program synthesis

O Polozov, S Gulwani - Proceedings of the 2015 ACM SIGPLAN …, 2015 - dl.acm.org
Inductive synthesis, or programming-by-examples (PBE) is gaining prominence with
disruptive applications for automating repetitive tasks in end-user programming. However …

Synthesizing highly expressive SQL queries from input-output examples

C Wang, A Cheung, R Bodik - Proceedings of the 38th ACM SIGPLAN …, 2017 - dl.acm.org
SQL is the de facto language for manipulating relational data. Though powerful, many users
find it difficult to write SQL queries due to highly expressive constructs. While using the …

Wrex: A unified programming-by-example interaction for synthesizing readable code for data scientists

I Drosos, T Barik, PJ Guo, R DeLine… - Proceedings of the 2020 …, 2020 - dl.acm.org
Data wrangling is a difficult and time-consuming activity in computational notebooks, and
existing wrangling tools do not fit the exploratory workflow for data scientists in these …

Automated feedback generation for introductory programming assignments

R Singh, S Gulwani, A Solar-Lezama - Proceedings of the 34th ACM …, 2013 - dl.acm.org
We present a new method for automatically providing feedback for introductory
programming problems. In order to use this method, we need a reference implementation of …

Spreadsheet data manipulation using examples

S Gulwani, WR Harris, R Singh - Communications of the ACM, 2012 - dl.acm.org
Millions of computer end users need to perform tasks over large spreadsheet data, yet lack
the programming knowledge to do such tasks automatically. We present a programming by …

Flashextract: A framework for data extraction by examples

V Le, S Gulwani - Proceedings of the 35th ACM SIGPLAN Conference …, 2014 - dl.acm.org
Various document types that combine model and view (eg, text files, webpages,
spreadsheets) make it easy to organize (possibly hierarchical) data, but make it difficult to …

Component-based synthesis for complex APIs

Y Feng, R Martins, Y Wang, I Dillig… - Proceedings of the 44th …, 2017 - dl.acm.org
Component-based approaches to program synthesis assemble programs from a database
of existing components, such as methods provided by an API. In this paper, we present a …

Foofah: Transforming data by example

Z **, MR Anderson, M Cafarella… - Proceedings of the 2017 …, 2017 - dl.acm.org
Data transformation is a critical first step in modern data analysis: before any analysis can be
done, data from a variety of sources must be wrangled into a uniform format that is amenable …