Landscape of High-Performance Python to Develop Data Science and Machine Learning Applications
Python has become the prime language for application development in the data science and
machine learning domains. However, data scientists are not necessarily experienced …
machine learning domains. However, data scientists are not necessarily experienced …
Efficient execution of user-defined functions in SQL queries
User-defined functions (UDFs) have been widely used to overcome the expressivity
limitations of SQL and complement its declarative nature with functional capabilities. UDFs …
limitations of SQL and complement its declarative nature with functional capabilities. UDFs …
YeSQL: " you extend SQL" with rich and highly performant user-defined functions in relational databases
The diversity and complexity of modern data management applications have led to the
extension of the relational paradigm with syntactic and semantic support for User-Defined …
extension of the relational paradigm with syntactic and semantic support for User-Defined …
[PDF][PDF] Daphne: An open and extensible system infrastructure for integrated data analysis pipelines
Integrated data analysis (IDA) pipelines---that combine data management (DM) and query
processing, high-performance computing (HPC), and machine learning (ML) training and …
processing, high-performance computing (HPC), and machine learning (ML) training and …
Bladedisc: Optimizing dynamic shape machine learning workloads via compiler approach
Compiler optimization plays an increasingly important role to boost the performance of
machine learning models for data processing and management. With increasingly complex …
machine learning models for data processing and management. With increasingly complex …
Building a compiled query engine in python
The simplicity of Python and its rich set of libraries has made it the most popular language
for data science. Moreover, the interpreted nature of Python offers an easy debugging …
for data science. Moreover, the interpreted nature of Python offers an easy debugging …
User-defined functions in modern data engines
Modern data management applications involve complex processing tasks over large
volumes of data. Although this falls naturally within the scope of relational databases, many …
volumes of data. Although this falls naturally within the scope of relational databases, many …
Predicate pushdown for data science pipelines
Predicate pushdown is a widely adopted query optimization. Existing systems and prior work
mostly use pattern-matching rules to decide when a predicate can be pushed through …
mostly use pattern-matching rules to decide when a predicate can be pushed through …
Containerized execution of UDFs: an experimental evaluation
User-defined functions (UDFs) have long been used as the de facto way to extend the
capabilities of data management systems. However, they are restricted to the specificities of …
capabilities of data management systems. However, they are restricted to the specificities of …
[PDF][PDF] The key to effective udf optimization: Before inlining, first perform outlining
Although user-defined functions (UDFs) are a popular way to augment SQL's declarative
approach with procedural code, the mismatch between programming paradigms creates a …
approach with procedural code, the mismatch between programming paradigms creates a …