Asynchronous nested parallelism for dynamic applications in distributed memory | Conference Paper individual record
abstract

© Springer International Publishing Switzerland 2016. Nested parallelism is of increasing interest for both expressivity and performance. Many problems are naturally expressed with this divide-and-conquer software design approach. In addition, programmers with target architecture knowledge employ nested parallelism for performance, imposing a hierarchy in the application to increase locality and resource utilization, often at the cost of implementation complexity. While dynamic applications are a natural fit for the approach, support for nested parallelism in distributed systems is generally limited to well-structured applications engineered with distinct phases of intranode computation and inter-node communication. This model makes expressing irregular applications difficult and also hurts performance by introducing unnecessary latency and synchronizations. In this paper we describe an approach to asynchronous nested parallelism which provides uniform treatment of nested computation across distributed memory. This approach allows efficient execution while supporting dynamic applications which cannot be mapped onto the machine in the rigid manner of regular applications. We use several graph algorithms as examples to demonstrate our library’s expressivity, flexibility, and performance.

author list (cited authors)
Papadopoulos, I., Thomas, N., Fidel, A., Hoxha, D., Amato, N. M., & Rauchwerger, L.
editor list (cited editors)
Shen, X., Mueller, F., & Tuck, J.
publication date
2016
published in
citation count

1