General IPDPS Info



2019 Advance Program

Please visit the IPDPS website regularly for updates, since there may be schedule revisions. Authors who have corrections should send email to giving full details.

MONDAY - 20 May 2019




Monday Workships
To be announced

TUESDAY - 21 May 2019


Opening Session
8:00 AM - 8:30 AM


Keynote Session
8:30 AM - 9:30 AM

KEYNOTE SPEAKER – To be announced

Morning Break 9:30 AM -10:00 AM

PhD Forum
All day

PhD Forum Posters

On Display All Day Tuesday and Wednesday

Parallel Technical
Sessions 1, 2, 3, & 4

10:00 AM - 12:00 PM

SESSION 1: Graph Algorithms 1

LACC: A Linear-Algebraic Algorithm for Finding Connected Components in Distributed Memory
Ariful Azad (Indiana University), Aydin Buluc (Lawrence Berkeley National Lab, UC Berkeley)

Shared-memory Exact Minimum Cuts
Monika Henzinger, Alexander Noe, and Christian Schulz (University of Vienna)

Distributed Weighted All Pairs Shortest Paths Through Pipelining
Udit Agarwal and Vijaya Ramachandran (UT Austin)

Local Distributed Algorithms in Highly Dynamic Networks
Philipp Bamberger, Fabian Kuhn, and Yannic Maus (University of Freiburg)


SESSION 2: HPC Systems


Effects and benefits of node sharing strategies in HPC batch systems
Alvaro Frank, Tim Süss, and André Brinkmann (Johannes Gutenberg University Mainz)


Design Space Exploration of Next-Generation HPC Machines
Constantino Gomez, Francesc Martinez, Adria Armejach, Marc Casas, Filippo Mantovani, and Miquel Moreto (Barcelona Supercomputing Center)


A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning
Tal Ben-Nun, Maciej Besta, Simon Huber, Alexandros Nikolaos Ziogas, Daniel Peter, Torsten Hoefler (ETH Zurich)


Double-precision FPUs in High-Performance Computing: an Embarrassment of Riches?
Jens Domke and Kazuaki Matsumura (Tokyo Institute of Technology), Mohamed Wahib (AIST-TokyoTech Real World Big-Data Computation Open Innovation Laboratory), Haoyu Zhang, Keita Yashima, Toshiki Tsuchikawa, Yohei Tsuji, and Artur Podobas (Tokyo Institute of Technology), Satoshi Matsuoka (RIKEN Center for Computational Science/ R-CCS)


SESSION 3: Numerical Algorithms


Communication-avoiding CholeskyQR2 for rectangular matrices
Edward Hutter and Edgar Solomonik (University of Illinois at Urbana-Champaign)


Asynchronous Multigrid Methods
Jordi Wolfson-Pou and Edmond Chow (Georgia Institute of Technology) -


Fast Batched Matrix Multiplication for Small Sizes using Half Precision Arithmetic on GPUs
Ahmad Abdelfattah, Stanimire Tomov, and Jack Dongarra (University of Tennessee) 

Load-Balanced Sparse MTTKRP on GPUs
Israt Nisa (The Ohio State University), Jiajia Li (Pacific Northwest National Laboratory), Aravind Sukumaran Rajam (The Ohio State University), Richard Vuduc (Georgia Institute of Technology), and P. (Saday) Sadayappan (The Ohio State University)

SESSION 4: Scheduling and Load Balancing I


Practically Efficient Scheduler for Minimizing Average Flow Time of Parallel Jobs
Kunal Agrawal and I-Ting Angelina Lee (Washington University in St. Louis), Jing Li (New Jersey Institute of Technology), Kefu Lu (Washington University in St. Louis), and Benjamin Moseley (Carnegie Mellon University)

Scheduling on (Un-)Related Machines with Setup Times

Klaus Jansen and Marten Maack (University of Kiel) and Alexander Mäcker (Paderborn University)

A scalable clustering-based task scheduler for homogeneous processors using DAG partitioning
M. Yusuf Özkaya (Georgia Institute of Technology), Anne Benoit (LIP, ENS Lyon), Bora Ucar (CNRS), Julien Herrmann and Ümit V. Çatalyürek (Georgia Institute of Technology)


Reservation Strategies for Stochastic Jobs
Guillaume Aupy (French Institute for Research in Computer Science and Automation (INRIA), University of Bordeaux), Ana Gainaru (Department of EECS, Vanderbilt University, Nashville; Labri, Univ. of Bordeaux), Valentin Honoré (Labri, Univ. of Bordeaux; Inria), Padma Raghavan (Department of EECS, Vanderbilt University, Nashville; University of Tennessee Knoxville), Yves Robert (Laboratoire LIP, ENS Lyon; University of Tennessee Knoxville), and Hongyang Sun (Department of EECS, Vanderbilt University, Nashville)

Parallel Technical Sessions 5, 6, 7, & 8
1:30 PM - 3:30 PM

SESSION 5: Accelerating Neural Networks


Exploiting Flow Graph of System of ODEs to Accelerate the Simulation of Biologically-Detailed Neural Networks
Bruno Magalhaes (Blue Brain Project, École polytechnique fédérale de Lausanne), Michael Hines (Yale School of Medicine, Yale University), Thomas Sterling (Indiana university), and Felix Schuermann (Blue Brain Project, École polytechnique fédérale de Lausanne)


Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training
Jiawen Liu and Dong Li (University of California, Merced), Gokcen Kestor (Pacific Northwest National Laboratory), and Jeffrey Vetter (Oak Ridge National Laboratory)


Dynamic Memory Management for GPU-based training of Deep Neural Networks
Shriram S B, Anshuj Garg, and Purushottam Kulkarni (Indian Institute of Technology Bombay)


Improving Strong-Scaling of CNN Training by Exploiting Finer-Grained Parallelism
Nikoli Dryden (University of Illinois at Urbana-Champaign, Lawrence Livermore National Laboratory), Naoya Maruyama, Tom Benson, and Tim Moon (Lawrence Livermore National Laboratory), Marc Snir (University of Illinois at Urbana-Champaign), and Brian Van Essen (Lawrence Livermore National Laboratory)



SESSION 6: GPU Computing I


Excavating the Potential of GPU for Accelerating Graph Traversal
Pengyu Wang, Lu Zhang, Chao Li, and Minyi Guo (Shanghai Jiao Tong University)


ParILUT -  A Parallel Threshold ILU for GPUs 
Hartwig Anzt (Karlsruhe Institute of Technology, University of Tennessee), Tobias Ribizel (Karlsruhe Institute of Technology), Goran Flegar (Universidad Jaume I), Edmond Chow (Georgia Institute of Technology, Oak Ridge National Lab), and Jack Dongarra (University of Tennessee, Oak Ridge National Lab)


C-GDR: High-Performance Container-aware GPUDirect MPI Communication Schemes on RDMA Networks
Jie Zhang, Xiaoyi Lu, Ching-Hsiang Chu, and Dhabaleswar K. (D.K.) Panda (The Ohio State University)


Slate: Enabling Workload-Aware Efficient Multiprocessing for Modern GPGPUs
Tyler Allen, Rong Ge, and Xizhou Feng (Clemson University)



SESSION 7: Learning and Prediction Systems


A Deep Recurrent Neural Network Based Predictive Control Framework for Reliable Distributed Stream Data Processing
Jielong Xu, Jian Tang, Zhiyuan Xu, and Chengxiang Yin (Syracuse University), Kevin Kwiat and Charles Kamhoua (ARL)


Architecting Racetrack Memory preshift through pattern-based prediction mechanisms
Adrian Colaso, Pablo Prieto, Pablo Abad, Valentin Puente, and Jose Angel Gregorio (University of Cantabria)


DLHub: Model and Data Serving for Science
Ryan Chard (Argonne National Laboratory), Zhuozhao Li, Kyle Chard, Logan Ward, Yadu Babuji, Anna Woodard, Steven Tuecke, Ben Blaiszik, and Michael Franklin (University of Chicago), Ian Foster (Argonne National Laboratory)


Identifying Latent Reduced Models to Precondition Lossy Compression
Huizhang Luo and Qing Liu (New Jersey Institute of Technology), Hong Jiang (University of Texas at Arlington), and Mengchu Zhou (New Jersey Institute of Technology)



SESSION 8: Multicore Computing


QoS-Driven Coordinated Management of Resources to Save Energy in Multicore Systems
Mehrzad Nejat, Miquel Pericas, and Per Stenström (Chalmers University of Technology)


Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems
Md Vasimuddin and Sanchit Misra (Intel Corporation), Heng Li (Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute; Department of Biomedical Informatics, Harvard Medical School), and Srinivas Aluru (Georgia Institute of Technology)


Power and Performance Tradeoffs for Visualization Algorithms
Stephanie Labasan (Lawrence Livermore National Laboratory, University of Oregon), Matthew Larsen (Lawrence Livermore National Laboratory), Hank Childs (University of Oregon), and Barry Rountree (Lawrence Livermore National Laboratory)


Northup: Divide-and-Conquer Programming in Systems with Heterogeneous Memories and Processors
Shuai Che (Alibaba) and Jieming Yin (AMD Research)


Afternoon Break 3:30 PM - 4:00 PM

Best Papers

4:00 PM - 6:00 PM

Best Paper Nominees - Plenary


Distributed Approximate k-Core Decomposition and Min-Max Edge Orientation: Breaking the Diameter Barrier
T-H. Hubert Chan (University of Hong Kong), Mauro Sozio (Telecom ParisTech), and Bintao Sun (University of Hong Kong)


FALCON: Efficient Designs for Zero-copy MPI Datatype Processing on Emerging Architectures
Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Mohammadreza Bayatpour, Hari Subramoni, and Dhabaleswar K. (DK) Panda (The Ohio State University)


Two Elementary Instructions make Compare-and-Swap
Pankaj Khanchandani and Roger Wattenhofer (ETH Zurich)


Robust Dynamic Resource Allocation via Probabilistic Task Pruning in Heterogeneous Computing Systems
James Gentry, Chavit Denninnart, and Mohsen Amini Salehi (University of Louisiana at Lafayette)

WEDNESDAY - 22 May 2019


Keynote Session
8:30 AM – 9:30 AM

KEYNOTE SPEAKER – To be announced

Morning Break 9:30 AM - 10:00 AM

PhD Forum
All day

PhD Forum Posters

On Display All Day Tuesday and Wednesday

Parallel Technical
Sessions 9, 10, 11, & 12

10:00 AM - 12:00 PM

SESSION 9: Cloud Computing


Z-Dedup:A Case for Deduplicating Compressed Contents in Cloud
Zhichao Yan and Hong Jiang (University of Texas at Arlington), Yujuan Tan (Chongqing University), and Stan Skelton (NetApp)


An Architecture and Stochastic Method for Database Container Placement in the Edge-Fog-Cloud Continuum
Petar Kochovski (University of Ljubljana), Rizos Sakellariou (University of Manchester), Marko Bajec (University of Ljubljana) Pavel Drobintsev (Peter the Great St. Petersburg Polytechnic University), and Vlado Stankovski (University of Ljubljana)


Online Live VM Migration Algorithms to Minimize Total Migration Time and Downtime
Nikos Tziritas and Cheng-Zhong Xu (Shenzhen Institutes of Advanced Technology), Samee Khan (North Dakota State University), Thanasis Loukopoulos (University of Thessaly), and Albert Zomaya (University of Sydney)


Semantics-aware Virtual Machine Image Management in IaaS Clouds
Nishant Saurabh (University of Innsbruck, Klagenfurt University), Julian Remmers (University of Innsbruck), Dragi Kimovski (Klagenfurt University), Radu Prodan (Klagenfurt University, University of Innsbruck), and Jorge G. Barbosa (LIACC, Faculdade de Engenharia da Universidade do Porto)



SESSION 10: Graph Algorithms II


Composing Optimization Techniques for Vertex-Centric Graph Processing via Communication Channels
Yongzhe Zhang (National Institute of Informatics, SOKENDAI) and Zhenjiang Hu (National Institute of Informatics, University of Tokyo)


CuSP: A Customizable Streaming Edge Partitioner for Distributed Graph Analytics
Loc D. Hoang, Roshan Dathathri, Gurbinder Gill, and Keshav Pingali (The University of Texas at Austin)


Accelerating Sequence Alignment to Graphs
Chirag Jain (Georgia Institute of Technology), Sanchit Misra (Intel Corporation), Haowen Zhang (Georgia Institute of Technology), Alexander Dilthey (University Hospital of Dusseldorf), and Srinivas Aluru (Georgia Institute of Technology)


Accurate, Efficient and Scalable Graph Embedding
Hanqing Zeng, Hongkuan Zhou, and Ajitesh Srivastava (University of Southern California), Rajgopal Kannan (US Army Research Lab), and Viktor Prasanna (University of Southern California)



SESSION 11: Linear Algebra


Matrix Powers Kernels for Thick-restart Lanczos with Explicit External Deflation
Ichitaro Yamazaki (University of Tennessee), Zhaojun Bai (University of California, Davis), and Jack Dongarra (University of Tennessee)


Revisiting the I/O Complexity of Fast Matrix Multiplication with Recomputations
Roy Nissim and Oded Schwartz (The Hebrew University of Jerusalem)


Computation of Matrix Chain Products on Parallel Machines
Elad Weiss and Oded Schwartz (The Hebrew University Jerusalem)


Overlapping Communications with Other Communications and its Application to Distributed Dense Matrix Computations
Hua Huang (Georgia Institute of Technology) and Edmond Chow (Georgia Institute of Technology)



SESSION 12: Storage Systems


Data Jockey: Automatic Data Management for HPC Multi-Tiered Storage Systems
Woong Shin, Christopher D. Brumgard, Bing Xie, and Sudharshan S. Vazhkudai (Oak Ridge National Laboratory), Devarshi Ghoshal (Lawrence Berkeley National Laboratory), Sarp Oral (Oak Ridge National Laboratory), and Lavanya Ramakrishnan (Lawrence Berkeley National Laboratory)


NCQ-Aware I/O Scheduling for Conventional Solid State Drives
Hao Fan and Song Wu (HUST), Shadi Ibrahim (INRIA), Ximing Chen, Hai Jin, and Jiang Xiao (HUST)


Optimizing the Parity-Check Matrix for Efficient Decoding of RS-based Cloud Storage Systems
Junqing Gu, Chentao Wu, Xin Xie, Han Qiu, Jie Li, and Minyi Guo (Shanghai Jiao Tong University), Xubin He (Temple University), Yuanyuan Dong and Yafei Zhao (Alibaba Group)


D3: Deterministic Data Distribution for Efficient Data Reconstruction in Erasure-Coded Distributed Storage Systems
Zhipeng Li, Min Lv, Yinlong Xu, Yongkun Li, and Liangliang Xu (University of Science and Technology of China)

Parallel Technical Sessions 13, 14, 15, & 16
1:30 PM – 3:30 PM

SESSION 13: Applications I


SunwayLB: Enabling Extreme-Scale Lattice Boltzmann Method Based Computing Fluid Dynamics Simulations on Sunway TaihuLight
Zhao Liu (Tsinghua University, National Supercomputing Center in Wuxi), Xuesen Chu (Tsinghua University, China Ship Scientific Research Center), Xiaojing Lv, Hongsong Meng, and Shupeng Shi (National Supercomputing Center in Wuxi), Wenji Han (China Ship Scientific Research Center), Haohuan Fu and Guangwen Yang (Tsinghua University, National Supercomputing Center in Wuxi)


Containers in HPC: A Scalability and Portability Study in Production Biological Simulations
Oleksandr Rudyy, Marta Garcia-Gasulla, Raül Sirvent, Filippo Mantovani, Alfonso Santiago, and Mariano Vazquez (Barcelona Supercomputing Center)


PaKman: Scalable Assembly of Large Genomes on Distributed Memory Machines
Priyanka Ghosh (Washington State University), Sriram Krishnamoorthy (Pacific Northwest National Laboratory), and Ananth Kalyanaraman (Washington State University)


Language Modeling at Scale
Mostofa Patwary (Baidu), Milind Chabbi (Unaffiliated), Heewoo Jun, Jiaji Huang, Greg Diamos, and Kenneth Church (Baidu)



SESSION 14: File Systems


DYRS: Bandwidth-Aware Disk-to-Memory Migration of Cold Data in Big-Data File Systems
Simbarashe Dzinamarira (Rice University), Florin Dinu (University of Sydney), and S. Eugene Ng (Rice University)


iez: Resource Contention Aware Load Balancing for Large-Scale Parallel File Systems
Bharti Wadhwa and Arnab K. Paul (Virginia Tech), Sarah Neuwirth (University of Heidelberg), Feiyi Wang and Sarp Oral (Oak Ridge National Laboratory), Ali R. Butt, Jon Bernard, and Kirk W. Cameron (Virginia Tech)


SimFS: A Simulation Data Virtualizing File System Interface
Salvatore Di Girolamo, Pirmin Schmid, Thomas Shulthess, and Torsten Hoefler (ETH Zurich)


Sizing and Partitioning Strategies for Burst-Buffers to Reduce IO Contention
Guillaume Aupy, Olivier Beaumont, and Lionel Eyraud-Dubois (Inria, University of Bordeaux)


SESSION 15: GPU Computing II


On Optimizing Complex Stencils on GPUs
Prashant S. Rawat, Miheer Vaidya, Aravind Sukumaran-Rajam, and Atanas Rountev (The Ohio State University), Louis-Noel Pouchet (Colorado State University), and P. Sadayappan (The Ohio State University)


Themis: Predicting and Reining in Application-Level Slowdown on Spatial Multitasking GPUs
Wenyi Zhao and Quan Chen (Shanghai Jiao Tong University), Hao Lin and Jianfeng Zhang (Alibaba Group), Jingwen Leng, Chao Li, Wenli Zheng, Li Li, and Minyi Guo (Shanghai Jiao Tong University)


Exploiting Adaptive Data Compression to Improve Performance and Energy-efficiency of Compute Workloads in Multi-GPU Systems
Mohammad Khavari, Yifan Sun, Nicolas Bohm Agostini, and David Kaeli (Northeastern University)


Dual Pattern Compression Using Data-Preprocessing for Large-Scale GPU Architectures
Kyung Hoon Kim, Priyank Devpura, Abhishek Nayyar, Andrew Doolittle, Kihwan Yum, and Eun Jung Kim (Texas A&M University)



SESSION 16: Scheduling and Load Balancing II


Adapting Batch Scheduling to Workload Characteristics: What can we Expect From Online Learning?
Arnaud Legrand (CNRS), Denis Trystram (Grenoble INP), and Salah Zrigui (Université Grenoble Alpes)


Aladdin: Optimized Maximum Flow Management for Shared Production Clusters
WU Heng, Zhang Wenbo, Xu Yuanjia, and Tao Huang (Institute of Software, Chinese Academy of Sciences), Haiyang Ding and Zheng Zhang (Alibaba Group)


mmWave Wireless Backhaul Scheduling of Stochastic Packet Arrivals
Pawel Garncarek and Tomasz Jurdzinski (University of Wroclaw), Dariusz R. Kowalski (University of Liverpool), and Miguel A. Mosteiro (Pace University)


Tight & Simple Load Balancing
Petra Berenbrink (Universität Hamburg), Tom Friedetzky (Durham University), Dominik Kaaser and Peter Kling (Universität Hamburg)

Afternoon Break 3:30 PM - 4:00 PM

Plenary Panel

4:00 PM – 5:15 PM






THURSDAY - 23 May 2019


Keynote Session
8:30 AM - 9:30 AM

KEYNOTE SPEAKER – To be announced

Morning Break 9:30 AM - 10:00 AM

Parallel Technical Sessions 17, 18, 19, & 20
10:00 AM - 12:00 PM

SESSION 17: Managing Data


An Error-Reflective Consistency Model for Distributed Data Stores
Philip Dexter and Kenneth Chiu (SUNY Binghamton), Bedri Sendir (IBM Research)


A High-Performance Distributed Relational Database System for Scalable OLAP Processing
Jason Arnold, Boris Glavic, and Ioan Raicu (Illinois Institute of Technology)


An Approach for Parallel Loading and Pre-Processing of Unstructured Meshes Stored in Spatially Scattered Fashion
Ondrej Meca, Lubomir Riha, and Tomas Brzobohaty (IT4Innovations National Supercomputing Centre, Ostrava, Czech Republic)



SESSION 18: Message Passing


Exploring MPI Communication Models for Graph Applications Using Graph Matching as a Case Study
Sayan Ghosh (Washington State University), Mahantesh Halappanavar (Pacific Northwest National Laboratory),
Ananth Kalyanaraman (Washington State University), Arif Khan (Pacific Northwest National Laboratory), and Assefaw Gebremedhin (Washington State University)


BigSpa: An Efficient Interprocedural Static Analysis Engine in the Cloud
Zhiqiang Zuo, Rong Gu, Xi Jiang, Zhaokang Wang, Yihua Huang, Linzhang Wang, and Xuandong Li (Nanjing University)


An Efficient Collaborative Communication Mechanism for MPI Neighborhood Collectives
S. Mahdieh Ghazimirsaeed, Seyed H. Mirsadeghi, and Ahmad Afsahi (Queen's University)


SESSION 19: Managing Power and Energy


Understanding the Impact of Dynamic Power-Capping on Application Progress
Srinivasan Ramesh (University of Oregon), Swann Perarnau and Sridutt Bhalachandra (Argonne National Laboratory), Allen Malony (University of Oregon), and Pete Beckman (Argonne National Laboratory)


Modelling DVFS and UFS for Region-Based Energy Aware Tuning of HPC Applications
Mohak Chadha and Michael Gerndt (Technische Universität München)


SprintCon: Controllable and Efficient Computational Sprinting for Data Center Servers
Wenli Zheng (Shanghai Jiao Tong University), Xiaorui Wang (The Ohio State University), Yue Ma and Chao Li (Shanghai Jiao Tong University), Hao Lin (Alibaba Group), Bin Yao (Shanghai Jiao Tong University), Jianfeng Zhang (Alibaba Group), Minyi Guo (Shanghai Jiao Tong University)


Drowsy-DC: Data center power management system
Mathieu Bacou (IRIT, Université de Toulouse, CNRS, Toulouse; Atos Intégration, Toulouse), Grégoire Todeschi (IRIT, Université de Toulouse, CNRS, Toulouse, France), Alain Tchana (I3S), Daniel Hagimont (IRIT, Université de Toulouse, CNRS, Toulouse, France), Baptiste Lepers and Willy Zwaenepoel (EPFL)



SESSION 20: Networks


Distributed Dominating Set and Connected Dominating Set Construction in the Dynamic SINR Model
Dongxiao Yu (Institute of Intelligent Computing, School of Computer Science and Technology, Shandong University), Yifei Zou (Department of Computer Science, The University of Hong Kong), Yong Zhang (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences), Feng Li (Institute of Intelligent Computing, School of Computer Science and Technology, Shandong University), Jiguo Yu (Qilu University of Technology, Shandong Computer Science Center), Yu Wu (School of Computer Science and Technology, Dongguan University of Technology), Xiuzhen Cheng (Institute of Intelligent Computing, School of Computer Science and Technology, Shandong University), and Francis C.M. Lau (Department of Computer Science, The University of Hong Kong)


MULTISKIPGRAPH: A Self-stabilizing Overlay Network that Maintains Monotonic Searchability
Linghui Luo (Paderborn University, Heinz Nixdorf Institut), Christian Scheideler and Thim Strothmann (Paderborn University)


Network Size Estimation in Small-World Networks under Byzantine Faults
Soumyottam Chatterjee and Gopal Pandurangan (University of Houston), Peter Robinson (McMaster University)


MD-GAN: Multi-Discriminator Generative Adversarial Networks for Distributed Datasets
Corentin Hardy (Technicolor, Inria), Erwan Le Merrer (Technicolor), and Bruno Sericola (Inria)


Parallel Technical Sessions 21, 22, 23
1:30 PM - 3:30 PM

SESSION 21: Dealing with Faults


MOARD: Modeling Application Resilience to Transient Faults on Data Objects
Luanzheng Guo and Dong Li (University of California, Merced)


SAFIRE: Scalable and Accurate Fault Injection for Parallel Multithreaded Applications
Giorgis Georgakoudis (Lawrence Livermore National Laboratory, Queen's Unversity Belfast), Ignacio Laguna (Lawrence Livermore National Laboratory), Hans Vandierendonck and Dimitrios S. Nikolopoulos (Queen's University Belfast), Martin Schulz (Technische Universität München)


Optimal Placement of In-Memory Checkpoints Under Heterogeneous Failure Likelihoods
Zaeem Hussain, Taieb Znati, and Rami Melhem (University of Pittsburgh)


VeloC: Towards High Performance Adaptive Asynchronous Checkpointing at Large Scale
Bogdan Nicolae (Argonne National Laboratory), Adam Moody, Elsa Gonsiorowski, and Kathryn Mohror (Lawrence Livermore National Laboratory), Franck Cappello (Argonne National Laboratory)



SESSION 22: Optimizing Memory Behavior


HART: A Concurrent Hash-Assisted Radix Tree for DRAM-PM Hybrid Memory Systems
Wen Pan, Tao Xie, and Xiaojia Song (San Diego State University)


LLC-guided Data Migration in Hybrid Memory Systems
Evangelos Vasilakis (Chalmers University of Technology, CSE Dept.), Vassilis Papaefstathiou (Foundation for Research and Technology – Hellas/FORTH), Pedro Trancoso and Ioannis Sourdis (Chalmers University of Technology, CSE Dept)


Software-based Buffering of Associative Operations on Random Memory Addresses
Matthias Hauck (Heidelberg University/SAP, SAP SE), Marcus Paradies (Deutsches Zentrum für Luft- und Raumfahrt), and Holger Fröning (Heidelberg University)


Combining Prefetch Control and Cache Partitioning to Improve Multicore Performance
Gongjin Sun, Junjie Shen, and Alex Veidenbaum (University of California, Irvine)



SESSION 23: Programming Languages


UPC++: A High-Performance Communication Framework for Asynchronous Computation
John Bachan, Scott B. Baden, Steven Hofmeyr, Mathias Jacquelin, Amir Kamil, Dan Bonachea, Paul H. Hargrove, and Hadia Ahmed (Lawrence Berkeley National Laboratory)


Cpp-Taskflow: Fast Task-based Parallel Programming using Modern C++
Tsung-Wei Huang, Chun-Xun Lin, Guannan Guo, and Martin D. F. Wong (University of Illinois Urbana-Champaign)


Portal: A High-Performance Language and Compiler for Parallel N-body Problems
Laleh Aghababaie Beni, Saikiran Ramanan, and Aparna Chandramowlishwaran (University of California, Irvine)


SAC Goes Cluster: Fully Implicit Distributed Computing
Thomas Macht (University of Amsterdam, VU University Amsterdam) and Clemens Grelck (University of Amsterdam)

Afternoon Break 3:30 PM - 4:00 PM

Parallel Technical Sessions 24, 25, & 26
4:00 PM - 5:30 PM

SESSION 24: Accelerating Graph Processing


Incremental Graph Processing for On-Line Analytics
Scott Sallinen and Matei Ripeanu (University of British Columbia), Roger Pearce (Lawrence Livermore National Labs)


Incrementalization of Vertex-Centric Programs
Timothy A. K. Zakian (University of Oxford), Ludovic A. R. Capelli (University of Edinburgh), and Zhenjiang Hu (National Institute of Informatics, University of Tokyo)


GraphTinker : A High Performance Data Structure for Dynamic Graph Processing
Wole Jaiyeoba and Kevin Skadron (University of Virginia)



SESSION 25: Applications II


FastJoin: A Skewness-Aware Distributed Stream Join System
Shunjie Zhou, Fan Zhang, Hanhua Chen, and Hai Jin (Huazhong University of Science and Technology), Bing Bing Zhou (The University of Sydney)


A Bin-Based Bitstream Partitioning Approach for Parallel CABAC Decoding in Next Generation Video Coding
Philipp Habermann (Technische Universität Berlin), Chi Ching Chi and Mauricio Alvarez-Mesa (Spin Digital Video Technologies GmbH), Ben Juurlink (Technische Universität Berlin)


Stochastic Gradient Descent on Modern Hardware: Multi-core CPU or GPU? Synchronous or Asynchronous?
Yujing Ma, Florin Rusu, and Martin Torres (University of California Merced)



SESSION 26: Security and Reliability


Always be Two Steps Ahead of Your Enemy
Thorsten Götte (Paderborn University), Vipin Ravindran Vijayalakshmi (RWTH Aachen), and Christian Scheideler (Paderborn University)


Peace Through Superior Puzzling: An Asymmetric Sybil Defense
Diksha Gupta and Jared Saia (University of New Mexico), Maxwell Young (Mississippi State University)


Rethinking Support for Region Conflict Exceptions
Swarnendu Biswas (IIT Kanpur), Rui Zhang and Michael D. Bond (Ohio State University), Brandon Lucia (Carnegie Mellon University)

FRIDAY - 24 May 2019



Friday Workships
To be announced

March 22nd Deadline
for Advance Registration

Registration Details

Search IPDPS


Follow IPDPS


Tweets by @IPDPS

IPDPS 2018 Report

32nd IEEE International Parallel &
Distributed Processing Symposium 
May 21 – May 25, 2018
JW Marriott Parq Vancouver
Vancouver, British Columbia CANADA