Euro-Par 2015: Parallel Processing
21st International Conference on Parallel and Distributed Computing, Vienna, Austria, August 24-28, 2015, Proceedings
Samenvatting
This book constitutes the refereed proceedings of the 21st International Conference on Parallel and Distributed Computing, Euro-Par 2015, held in Vienna, Austria, in August 2015. The 51 revised full papers presented together with 2 invited papers were carefully reviewed and selected from 190 submissions. The papers are organized in the following topical sections: support tools and environments; performance modeling, prediction and evaluation; scheduling and load balancing; architecture and compilers; parallel and distributed data management; grid, cluster and cloud computing; distributed systems and algorithms; parallel and distributed programming, interfaces and languages; multi- and many-core programming; theory and algorithms for parallel computation; numerical methods and applications; and accelerator computing.
Specificaties
Inhoudsopgave
Leveraging MPI-3 Shared-Memory Extensions for Efficient PGAS Runtime Systems.- A Practical Transactional Memory Interface.- A Multicore Parallelization of Continuous Skyline Queries on Data Streams.- A Fast and Scalable Graph Coloring Algorithm for Multi-core and Many-core Architectures.- A Composable Deadlock-Free Approach to Object-Based Isolation.- Scalable Data-Driven PageRank: Algorithms, System Issues & Lessons Learned.- How Many Threads Will Be Too Many? On the Scalability of OpenMP Implementations.- Efficient Nested Dissection for Multicore Architectures.- Scheduling Trees of Malleable Tasks for Sparse Linear Algebra.- Elastic Tasks: Unifying Task Parallelism and SPMD Parallelism with an Adaptive Runtime.- Semi-discrete Matrix-Free Formulation of 3D Elastic Full Waveform Inversion Modeling.- 10,000 Performance Models per Minute - Scalability of the UG4 Simulation Framework.- Exploiting Task-Based Parallelism in Bayesian Uncertainty Quantification.- Parallelization of an Advection-Diffusion Problem Arising in Edge Plasma Physics Using Hybrid MPI/OpenMP Programming.- Behavioral Non-Portability in Scientific Numeric Computing.- Fast Parallel Suffix Array on the GPU.- Effective Barrier Synchronization on Intel Xeon Phi Coprocessor.- High Performance Multi-GPU SpMV for Multi-component PDE-based Applications.- Accelerating Lattice Boltzmann Applications with OpenACC.- High-Performance and Scalable Design of MPI-3 RMA on Xeon Phi Clusters.- Improving Performance of Convolutional Neural Networks by Separable Filters on GPU.- Iterative Sparse Triangular Solves for Preconditioning.- Targeting the Parallella.- Systematic Fusion of CUDA Kernels for Iterative Sparse Linear System Solvers.- Efficient Execution of Multiple CUDA Applications using Transparent Suspend, Resume and Migration.