Dynatrace banner image
Otmar Ertl

Otmar Ertl

Lead Researcher

Otmar is a Senior Principal Software Mathematician at Dynatrace, with a current research focus on algorithms and technologies for real-time analysis of high-volume data. He is particularly interested in any kind of data reduction techniques such as sampling or data sketching for cardinality, quantile and similarity estimation.

He received his master’s degree in physics in 2005 and a PhD in technical sciences in 2010 – both from the Vienna University of Technology.

Authored publications

A Comprehensive Benchmarking Analysis of Fault Recovery in Stream Processing Frameworks

Nowadays, several software systems rely on stream processing architectures to deliver scalable performance and handle large volumes of data in near real time. Stream processing frameworks facilitate scalable computing by distributing the application's execution across multiple machines. Despite performance being extensively studied, the measurement...

Adriano Vogel, Sören Henning, Esteban Perez-Wohlfeil, Otmar Ertl, Rick Rabiser

| 18th ACM International Conference on Distributed and Event-Based Systems (DEBS'24) | 2024

ExaLogLog: Space-Efficient and Practical Approximate Distinct Counting up to the Exa-Scale

This work introduces ExaLogLog, a new data structure for approximate distinct counting, which has the same practical properties as the popular HyperLogLog algorithm. It is commutative, idempotent, mergeable, reducible, has a constant-time insert operation, and supports distinct counts up to the exa-scale. At the same time, as theoretically derived ...

Otmar Ertl 

| Data Structures and Algorithms arXiv:2402.13726 | 2024

High-level Stream Processing: A Complementary Analysis of Fault Recovery

Parallel computing is very important to accelerate the performance of software systems. Additionally, considering that a recurring challenge is to process high data volumes continuously, stream processing emerged as a paradigm and software architectural style. Several software systems rely on stream processing to deliver scalable performance, where...

Adriano Vogel, Sören Henning, Esteban Perez-Wohlfeil, Otmar Ertl, Rick Rabiser

| Distributed, Parallel, and Cluster Computing, arXiv:2405.07917 | 2024

JumpBackHash: Say Goodbye to the Modulo Operation to Distribute Keys Uniformly to Buckets

The distribution of keys to a given number of buckets is a fundamental task in distributed data processing and storage. A simple, fast, and therefore popular approach is to map the hash values of keys to buckets based on the remainder after dividing by the number of buckets. Unfortunately, these mappings are not stable when the number of buckets ch...

Otmar Ertl

| Data Structures and Algorithms arXiv:2403.18682 | 2024

ShuffleBench: A Benchmark for Large-Scale Data Shuffling Operations with Distributed Stream Processing Frameworks

Distributed stream processing frameworks help building scalable and reliable applications that perform transformations and aggregations on continuous data streams. This paper introduces ShuffleBench, a novel benchmark to evaluate the performance of modern stream processing frameworks. In contrast to other benchmarks, it focuses on use cases where s...

Sören Henning, Adriano Vogel, Michael Leichtfried, Otmar Ertl, Rick Rabiser

| ICPE '24: Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering | 2024

A systematic mapping of performance in distributed stream processing systems

Several software systems are built upon stream processing architectures to process large amounts of data in near real-time. Today's distributed stream processing systems (DSPSs) spread the processing among multiple machines to provide scalable performance. However, high-performance and Quality of Service (QoS) in distributed stream processing are c...

Adriano Vogel, Sören Henning,Otmar Ertl, Rick Rabiser

| Euromicro Conference on Software Engineering and Advanced Applications | 2023

Benchmarking Stream Processing Frameworks for Large Scale Data Shuffling

Distributed stream processing frameworks help building scalable and reliable applications that perform transformations and aggregations on continuous data streams. We outline our ongoing research on designing a new benchmark for distributed stream processing frameworks. In contrast to other benchmarks, it focuses on use cases where stream processin...

Sören HenningAdriano Vogel, Michael Leichtfried, Otmar Ertl, Rick Rabiser 

| Softwaretechnik-Trends | 2023

Cardinality Estimation Adaptive Cuckoo Filters (CE-ACF): Approximate Membership Check and Distinct Query Count for High-Speed Network Monitoring

In network monitoring applications, it is often beneficial to employ a fast approximate set-membership filter to check if a given packet belongs to a monitored flow. Recent adaptive filter designs, such as the Adaptive Cuckoo Filter, are especially promising for such use cases as they adapt fingerprints to eliminate recurring false positives. In ma...

Pedro Reviriego, Jim Apple, Alvaro Alonso, Otmar Ertl, Niv Dayan

| IEEE/ACM Transactions on Networking | 2023

UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting

Since its invention HyperLogLog has become the standard algorithm for approximate distinct counting. Due to its space efficiency and suitability for distributed systems, it is widely used and also implemented in numerous databases. This work presents UltraLogLog, which shares the same practical properties as HyperLogLog. It is commutative, idempote...

Otmar Ertl

| arXiv preprint arXiv:2308.16862 | 2023

Computing the Similarity Estimate Using Approximate Memory

In many computing applications there is a need to compute the similarity of sets of elements. When the sets have many elements or comparison involve many sets, computing the similarity requires significant computational effort and storage capacity. As in most cases, a reasonably accurate estimate is sufficient, many algorithms for similarity estima...

Pedro Reviriego, Shanshan Liu, Otmar Ertl, Fabrizio Lombardi

| IEEE Transactions on Emerging Topics in Computing | 2021