Nanopore sequencing technology and tools for genome assembly: computational analysis of the current state, bottlenecks and future directions
Nanopore sequencing technology has the potential to render other sequencing
technologies obsolete with its ability to generate long reads and provide portability …
technologies obsolete with its ability to generate long reads and provide portability …
{TVM}: An automated {End-to-End} optimizing compiler for deep learning
There is an increasing need to bring machine learning to a wide diversity of hardware
devices. Current frameworks rely on vendor-specific operator libraries and optimize for a …
devices. Current frameworks rely on vendor-specific operator libraries and optimize for a …
A survey of processors with explicit multithreading
T Ungerer, B Robič, J Šilc - ACM Computing Surveys (CSUR), 2003 - dl.acm.org
Hardware multithreading is becoming a generally applied technique in the next generation
of microprocessors. Several multithreaded processors are announced by industry or already …
of microprocessors. Several multithreaded processors are announced by industry or already …
[PDF][PDF] TVM: end-to-end optimization stack for deep learning
Scalable frameworks, such as TensorFlow, MXNet, Caffe, and PyTorch drive the current
popularity and utility of deep learning. However, these frameworks are optimized for a …
popularity and utility of deep learning. However, these frameworks are optimized for a …
Piranha: A scalable architecture based on single-chip multiprocessing
The microprocessor industry is currently struggling with higher development costs and
longer design times that arise from exceedingly complex processors that are pushing the …
longer design times that arise from exceedingly complex processors that are pushing the …
[PDF][PDF] Dynamic partitioning of shared cache memory
This paper proposes dynamic cache partitioning amongst simultaneously executing
processes/threads. We present a general partitioning scheme that can be applied to set …
processes/threads. We present a general partitioning scheme that can be applied to set …
Locality exists in graph processing: Workload characterization on an ivy bridge server
Graph processing is an increasingly important application domain and is typically
communication-bound. In this work, we analyze the performance characteristics of three …
communication-bound. In this work, we analyze the performance characteristics of three …
Balancing thoughput and fairness in SMT processors
K Luo, J Gummaraju, M Franklin - 2001 IEEE International …, 2001 - computer.org
Simultaneous Multithreading (SMT) is an execution model that executes multiple threads in
parallel within a single processor pipeline. Usually, an SMT processor uses shared …
parallel within a single processor pipeline. Usually, an SMT processor uses shared …
Memristor for computing: Myth or reality?
CMOS technology and its sustainable scaling have been the enablers for the design and
manufacturing of computer architectures that have been fuelling a wider range of …
manufacturing of computer architectures that have been fuelling a wider range of …
AGORA: Attributed goal-oriented requirements analysis method
This paper presents an extended version of the goal-oriented requirements analysis method
called AGORA, where attribute values, eg contribution values and preference matrices, are …
called AGORA, where attribute values, eg contribution values and preference matrices, are …