[PDF][PDF] Distributional semantics resources for biomedical text processing

S Moen, TSS Ananiadou - Proceedings of LBM, 2013 - bio.nlplab.org
The openly available biomedical literature contains over 5 billion words in publication
abstracts and full texts. Recent advances in unsupervised language processing methods …

HOT: A height optimized trie index for main-memory database systems

R Binna, E Zangerle, M Pichl, G Specht… - Proceedings of the 2018 …, 2018 - dl.acm.org
We present the Height Optimized Trie (HOT), a fast and space-efficient in-memory index
structure. The core algorithmic idea of HOT is to dynamically vary the number of bits …

Dictionary-based order-preserving string compression for main memory column stores

C Binnig, S Hildenbrand, F Färber - Proceedings of the 2009 ACM …, 2009 - dl.acm.org
Column-oriented database systems [19, 23] perform better than traditional row-oriented
database systems on analytical workloads such as those found in decision support and …

Designing far memory data structures: Think outside the box

MK Aguilera, K Keeton, S Novakovic… - Proceedings of the …, 2019 - dl.acm.org
Technologies like RDMA and Gen-Z, which give access to memory outside the box, are
gaining in popularity. These technologies provide the abstraction of far memory, where …

PIM-trie: A Skew-resistant Trie for Processing-in-Memory

H Kang, Y Zhao, GE Blelloch, L Dhulipala… - Proceedings of the 35th …, 2023 - dl.acm.org
Memory latency and bandwidth are significant bottlenecks in designing in-memory indexes.
Processing-in-memory (PIM), an emerging hardware design approach, alleviates this …

Confluo: Distributed monitoring and diagnosis stack for high-speed networks

A Khandelwal, R Agarwal, I Stoica - 16th USENIX Symposium on …, 2019 - usenix.org
Confluo is an end-host stack that can be integrated with existing network management tools
to enable monitoring and diagnosis of network-wide events using telemetry data distributed …

Blink-hash: An Adaptive Hybrid Index for In-Memory Time-Series Databases

H Cha, X Hao, T Wang, H Zhang, A Akella… - Proceedings of the VLDB …, 2023 - dl.acm.org
High-speed data ingestion is critical in time-series workloads that are driven by the growth of
Internet of Things (IoT) applications. We observe that traditional tree-based indexes …

Adaptive hybrid indexes

C Anneser, A Kipf, H Zhang, T Neumann… - Proceedings of the 2022 …, 2022 - dl.acm.org
While index structures are crucial components in high-performance query processing
systems, they occupy a large fraction of the available memory. Recently-proposed compact …

Dictionary-based order-preserving string compression for main memory column stores

C Binnig, F Faerber, S Hildenbrand - US Patent 7,868,789, 2011 - Google Patents
Methods and systems are described that involve usage of dictionaries for compressing a
large set of variable-length string values with fixed-length integer keys in column stores. The …

Efficient in-memory indexing with generalized prefix trees

M Boehm, B Schlegel, PB Volk, U Fischer… - … , Technologie und Web …, 2011 - dl.gi.de
Efficient data structures for in-memory indexing gain in importance due to (1) the
exponentially increasing amount of data,(2) the growing main-memory capacity, and (3) the …