default search action
IPDPS 2016: Chicago, IL, USA - Workshops
- 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2016, Chicago, IL, USA, May 23-27, 2016. IEEE Computer Society 2016, ISBN 978-1-5090-3682-0
Workshop 1-HCW - Heterogeneity in Computing Workshop
- Denis Trystram, Erik Saule:
HCW Introduction. 1-2 - Behrooz A. Shirazi:
Message from the HCW Steering Committee Chair. 3 - Denis Trystram:
Message from the HCW General Chair. 4 - Erik Saule:
Message from the HCW Program Committee Chair. 5 - Mahmut T. Kandemir:
HCW 2016 Keynote Talk. 6
Session 1: Heterogeneity in the Cloud
- Julio Proaño, Carmen Carrión, María Blanca Caminero:
Towards a Green, QoS-Enabled Heterogeneous Cloud Infrastructure. 7-16 - Rekha Singhal, Abhishek Verma:
Predicting Job Completion Time in Heterogeneous MapReduce Environments. 17-27 - Fouad Hanna, Loris Marchal, Jean-Marc Nicod, Laurent Philippe, Veronika Rehn-Sonigo, Hala Sabbah:
Minimizing Rental Cost for Multiple Recipe Applications in the Cloud. 28-37
Session 2: Heterogeneity in Single Node Systems
- Saeid Barati, Hank Hoffmann:
Providing Fairness in Heterogeneous Multicores with a Predictive, Adaptive Scheduler. 38-49 - Jeremy Bottleson, SungYe Kim, Jeff Andrews, Preeti Bindu, Deepak N. Murthy, Jingyi Jin:
clCaffe: OpenCL Accelerated Caffe for Convolutional Neural Networks. 50-57 - Bahareh Goodarzi, Martin Burtscher, Dhrubajyoti Goswami:
Parallel Graph Partitioning on a CPU-GPU Architecture. 58-66
Session 3: Heterogeneity and Energy
- Dylan Machovec, Bhavesh Khemka, Sudeep Pasricha, Anthony A. Maciejewski, Howard Jay Siegel, Gregory A. Koenig, Michael Wright, Marcia Hilton, Rajendra Rambharos, Neena Imam:
Dynamic Resource Management for Parallel Tasks in an Oversubscribed Energy-Constrained Heterogeneous Environment. 67-78 - JeeWhan Choi, Richard W. Vuduc:
Analyzing the Energy Efficiency of the Fast Multipole Method Using a DVFS-Aware Energy Model. 79-88 - John E. Stone, Michael J. Hallock, James C. Phillips, Joseph R. Peterson, Zaida Luthey-Schulten, Klaus Schulten:
Evaluation of Emerging Energy-Efficient Heterogeneous Computing Platforms for Biomolecular and Cellular Simulation Workloads. 89-100
Workshop 2-RAW - Reconfigurable Architectures Workshop
- Marco D. Santambrogio, Ramachandran Vaidyanathan, Diana Goehringer, Steven J. E. Wilton:
RAW Introduction and Committees. 101-102 - H. Peter Hofstee, Patrick Lysaght, Dirk van den Heuvel:
RAW 2016 Keynotes. 103-104
Session 1: Application Mapping and Design Space Exploration
- Lester Kalms, Diana Göhringer:
Clustering and Mapping Algorithm for Application Distribution on a Scalable FPGA Cluster. 105-113 - Syed Waqar Nabi, Wim Vanderbauwhede:
A Fast and Accurate Cost Model for FPGA Design Space Exploration in HPC Applications. 114-123 - Hyunsuk Nam, Roman Lysecky:
Latency, Power, and Security Optimization in Distributed Reconfigurable Embedded Systems. 124-131
Session 2: Applications
- Daniel Llamocca, Daniel N. Aloi:
A Reconfigurable Fixed-Point Architecture for Adaptive Beamforming. 132-138 - Aaron Mills, Phillip H. Jones, Joseph Zambreno:
Parameterizable FPGA-Based Kalman Filter Coprocessor Using Piecewise Affine Modeling. 139-147 - Chi Zhang, Ren Chen, Viktor K. Prasanna:
High Throughput Large Scale Sorting on a CPU-FPGA Heterogeneous Platform. 148-155 - Juan Andrés Pérez-Celis, José Martínez-Carranza, Alicia Morales-Reyes, Claudia Feregrino Uribe, René Cumplido:
An FPGA Architecture to Accelerate the Burrows Wheeler Transform by Using a Linear Sorter. 156-161
Session 3: Processor Architectures
- Mohamed El-Hadedy, Hristina Mihajloska, Danilo Gligoroski, Amit Kulkarni, Dirk Stroobandt, Kevin Skadron:
A 16-Bit Reconfigurable Encryption Processor for p-Cipher. 162-171 - Stephan Nolting, Guillermo Payá Vayá, Florian Giesemann, Holger Blume, Sebastian Niemann, Christian Müller-Schloer:
Dynamic Self-Reconfiguration of a MIPS-Based Soft-Processor Architecture. 172-180 - Steffen Vaas, Marc Reichenbach, Dietmar Fey:
An Application-Specific Instruction Set Processor for Power Quality Monitoring. 181-188
Session 4: Scheduler and Runtime Systems
- Andrea Purgato, Davide Tantillo, Marco Rabozzi, Donatella Sciuto, Marco D. Santambrogio:
Resource-Efficient Scheduling for Partially-Reconfigurable FPGA-Based Systems. 189-197 - Tajas Ruschke, Lukas Johannes Jung, Dennis Wolf, Christian Hochberger:
Scheduler for Inhomogeneous and Irregular CGRAs with Support for Complex Control Flow. 198-207 - Jens Rettkowski, Philipp Wehner, Evgheni Cutiscev, Diana Göhringer:
LinROS: A Linux-Based Runtime System for Reconfigurable MPSoCs. 208-216
Session 5: High Level Synthesis and Object-Oriented Programming
- Emanuele Del Sozzo, Andrea Solazzo, Antonio Miele, Marco D. Santambrogio:
On the Automation of High Level Synthesis of Convolutional Neural Networks. 217-224 - Gianluca C. Durelli, Fabrizio Spada, Christian Pilato, Marco D. Santambrogio:
Scala-Based Domain-Specific Language for Creating Accelerator-Based SoCs. 225-232 - Hongyuan Ding, Sen Ma, Miaoqing Huang, David Andrews:
OOGen: An Automated Generation Tool for Custom MPSoC Architectures Based on Object-Oriented Programming Methods. 233-240
Short Papers
- Benedikt Janßen, Moataz Naserddin, Michael Hübner:
A Hardware/Software Co-Design Approach for Control Applications with Static Real-Time Reallocation. 241-246 - Giulia Guidi, Enrico Reggiani, Lorenzo Di Tucci, Gianluca Durelli, Michaela Blott, Marco D. Santambrogio:
On How to Improve FPGA-Based Systems Design Productivity via SDAccel. 247-252 - Jones Yudi Mori, André Werner, Florian Fricke, Michael Hübner:
A Rapid Prototyping Method to Reduce the Design Time in Commercial High-Level Synthesis Tools. 253-258 - Salma Hesham, Diana Göhringer, Mohamed A. Abd El Ghany:
ARTNoCs: An Evaluation Framework for Hardware Architectures of Real-Time NoCs. 259-264 - Amit Kulkarni, Elias Vansteenkiste, Dirk Stroobandt, Andreas Brokalakis, Antonis Nikitakis:
A Fully Parameterized Virtual Coarse Grained Reconfigurable Array for High Performance Computing Applications. 265-270 - Anita Tino, Kaamran Raahemifar:
Assessing Multi-task Placement Algorithms in RCUs. 271-276 - Alexandra Kourfali, Dirk Stroobandt:
Efficient Hardware Debugging Using Parameterized FPGA Reconfiguration. 277-282 - Fynn Schwiegelshohn, Florian Kastner, Michael Hübner:
Enabling Dynamic Reconfiguration of Numerical Methods for the Robotic Motion Control Task. 283-288 - Martín Letras, Raudel Hernández-León, René Cumplido:
Hardware Architectures for Frequent Itemset Mining Based on Equivalence Classes Partitioning. 289-294 - Fabiola Casasopra, Gea Bianchi, Gianluca C. Durelli, Marco D. Santambrogio:
Parallel Protein Identification Using an FPGA-Based Solution. 295-299 - Nikolaos Stekas, Dirk van den Heuvel:
Face Recognition Using Local Binary Patterns Histograms (LBPH) on an FPGA-Based System on Chip (SoC). 300-304
Workshop 3-HIPS - High-Level Parallel Programming Models and Supportive Environments
- David Böhme, Xu Liu:
HIPS Introduction and Committees. 305-306 - Tim Mattson:
HIPS 2016 Keynote. 307
Session 1: Debugging and Optimization
- Faheem Ullah, Thomas R. Gross:
Detecting Anomalies in Concurrent Programs Based on Dynamic Control Flow Changes. 308-317 - Marc Sergent, David Goudin, Samuel Thibault, Olivier Aumage:
Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System. 318-327 - Shingo Okuno, Tasuku Hiraishi, Hiroshi Nakashima, Masahiro Yasugi, Jun Sese:
Reducing Redundant Search in Parallel Graph Mining Using Exceptions. 328-337
Session 2: Heterogeneous Computing
- Matt Martineau, Simon McIntosh-Smith, Wayne P. Gaudin:
Evaluating OpenMP 4.0's Effectiveness as a Heterogeneous Parallel Programming Model. 338-347 - Ebad Salehi, Ahmad Lashgar, Amirali Baniasadi:
Employing Compression Solutions under OpenACC. 348-356 - Craig Edward Rasmussen, Matthew J. Sottile, Søren Rasmussen, Daniel Nagle, William Dumas:
CAFe: Coarray Fortran Extensions for Heterogeneous Computing. 357-365
Session 3: Parallel Algorithms and Systems
- Peter Mills, Clinton Jeffery:
Embedding Concurrent Generators. 366-375 - Josef Weidendorfer, Jens Breitbart:
The Case for Binary Rewriting at Runtime for Efficient Implementation of High-Level Programming Models in HPC. 376-385 - Seyed Hessam Mirsadeghi, Ahmad Afsahi:
PTRAM: A Parallel Topology-and Routing-Aware Mapping Framework for Large-Scale HPC Systems. 386-396 - Joshua Dennis Booth, Kyungjoo Kim, Sivasankaran Rajamanickam:
A Comparison of High-Level Programming Choices for Incomplete Sparse Factorization Across Different Architectures. 397-406
Workshop 4-HiCOMB - High Performance Computational Biology
- Srinivas Aluru, David A. Bader, Ananth Kalyanaraman, Jaroslaw Zola:
HiCOMB Introduction and Committees. 407
Session I
- Constantin Scholl, Kassian Kobert, Tomás Flouri, Alexandros Stamatakis:
The Divisible Load Balance Problem with Shared Cost and Its Application to Phylogenetic Inference. 408-417 - Nikolaos Alachiotis, Doru-Thom Popovici, Tze Meng Low:
Efficient Computation of Linkage Disequilibria as Dense Linear Algebra Operations. 418-427 - Michael J. Hallock, Zaida Luthey-Schulten:
Improving Reaction Kernel Performance in Lattice Microbes: Particle-Wise Propensities and Run-Time Generated Code. 428-434
Session II
- Amir Bahmani, Alexander B. Sibley, Mahmoud Parsian, Kouros Owzar, Frank Mueller:
SparkScore: Leveraging Apache Spark for Distributed Genomic Inference. 435-442 - Shayan Shams, Nayong Kim, Xiandong Meng, Ming Tai Ha, Shantenu Jha, Zhong Wang, Joohyun Kim:
A Scalable Pipeline for Transcriptome Profiling Tasks with On-Demand Computing Clouds. 443-452 - Vipin Sachdeva, Srinivas Aluru, David A. Bader:
A Memory and Time Scalable Parallelization of the Reptile Error-Correction Code. 453-462
Session III
- Nuttiiya Seekhao, Caroline Shung, Joseph F. JáJá, Luc Mongeau, Nicole Y. K. Li-Jessen:
Real-Time Agent-Based Modeling Simulation with in-Situ Visualization of Complex Biological Systems: A Case Study on Vocal Fold Inflammation and Healing. 463-472 - M. Ali Mirzaei, Francesco Crescioli, Sebastien Viret, William Tromeur, Giovanni Calderini, Giovanni Marchiori, Guillaume Baulieu, Geoffrey Galbit:
A Novel Associative Memory Based Architecture for Sequence Alignment. 473-478
Workshop 5-APDCM - Advances in Parallel and Distributed Computational Models
- Oscar H. Ibarra, Koji Nakano, Akihiro Fujiwara, Susumu Matsumae:
APDCM Introduction and Committees. 479
Session 1: Graph Algorithms
- Jie Wu:
Stable Matching Beyond Bipartite Graphs. 480-488 - Paula Aguilera, Dong Ping Zhang, Nam Sung Kim, Nuwan Jayasena:
Fine-Grained Task Migration for Graph Algorithms Using Processing in Memory. 489-498
Session 2: Wireless Networks and Distributed Computing
- Wei Chen, Liang Hong, Sachin Shetty, Dan Chia-Tien Lo, Reginald Cooper:
Cross-Layered Security Approach with Compromised Nodes Detection in Cooperative Sensor Networks. 499-508 - Hideharu Kojima, Yuta Nagashima, Tatsuhiro Tsuchiya:
Model Checking Techniques for State Space Reduction in MANET Protocol Verification. 509-516 - Feng Luo, Pradip K. Srimani:
New Biology Inspired Anonymous Distributed Algorithms to Compute Dominating and Total Dominating Sets in Network Graphs. 517-524
Session 3: Distributed Computing and Models
- Ta Yuan Hsu, Ajay D. Kshemkalyani:
Performance of Causal Consistency Algorithms for Partially Replicated Systems. 525-534 - Hassan Nawaz, Gideon Juve, Rafael Ferreira da Silva, Ewa Deelman:
Performance Analysis of an I/O-Intensive Workflow Executing on Google Cloud and Amazon Web Services. 535-544 - Travis S. Humble, Alexander J. McCaskey, Jonathan Schrock, Hadayat Seddiqi, Keith A. Britt, Neena Imam:
Performance Models for Split-Execution Computing Systems. 545-554 - Ernesto Gomez, Keith E. Schubert, Zongqi Ritchie Cai:
A Model for Entropy of Parallel Execution. 555-560
Session 4: Parallel Computing
- James Alexander Edwards, Uzi Vishkin:
FFT on XMT: Case Study of a Bandwidth-Intensive Regular Algorithm on a Highly-Parallel Many Core. 561-569 - Makoto Nakayama, Kenichi Yamazaki, Satoshi Tanaka:
Parallelization of Recursive Preorder Traversal Based on Building and Winding Call Stacks. 570-579 - P. B. Jayaraj, K. Rahamathulla, G. Gopakumar:
A GPU Based Maximum Common Subgraph Algorithm for Drug Discovery Applications. 580-588 - Toru Fujita, Koji Nakano, Yasuaki Ito:
Bitwise Parallel Bulk Computation on the GPU, with Application to the CKY Parsing for Context-Free Grammars. 589-598 - Xin Zhou, Yasuaki Ito, Koji Nakano:
An Efficient Implementation of LZW Decompression in the FPGA. 599-607
Workshop 6-ASHES - Accelerators and Hybrid Exascale Systems
- James Dinan:
AsHES Introduction and Committees. 608-609 - Wen-mei W. Hwu:
AsHES 2016 Keynote. 610
Session 1: Programming Models and Tools
- Chris J. Newburn, Gaurav Bansal, Michael Wood, Luis Crivelli, Judit Planas, Alejandro Duran, Paulo Souza, Leonardo Borges, Piotr Luszczek, Stanimire Tomov, Jack J. Dongarra, Hartwig Anzt, Mark Gates, Azzam Haidar, Yulu Jia, Khairul Kabir, Ichitaro Yamazaki, Jesús Labarta:
Heterogeneous Streaming. 611-620 - John D. Leidel, Yong Chen:
HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations. 621-630 - Erik Zenker, Benjamin Worpitz, René Widera, Axel Huebl, Guido Juckeland, Andreas Knüpfer, Wolfgang E. Nagel, Michael Bussmann:
Alpaka - An Abstraction Library for Parallel Kernel Acceleration. 631-640 - Souley Madougou, Ana Lucia Varbanescu, Cees de Laat, Rob van Nieuwpoort:
A Tool for Bottleneck Analysis and Performance Prediction for GPU-Accelerated Applications. 641-652
Session 2: Algorithms and Applications
- Yulu Jia, Piotr Luszczek, Jack J. Dongarra:
Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures. 653-662 - Ryan Eberhardt, Mark Hoemmen:
Optimization of Block Sparse Matrix-Vector Multiplication on Shared-Memory Parallel Architectures. 663-672 - Joshua Dennis Booth, Sivasankaran Rajamanickam, Heidi Thornquist:
Basker: A Threaded Sparse LU Factorization Utilizing Hierarchical Parallelism and Data Layouts. 673-682 - Hartwig Anzt, Jack J. Dongarra, Moritz Kreutzer, Gerhard Wellein, Martin Koehler:
Efficiency of General Krylov Methods on GPUs - An Experimental Study. 683-691
Session 3: Workload Scheduling
- Luis Costero, Francisco D. Igual, Katzalin Olcoz, Sandra Catalán, Rafael Rodríguez-Sánchez, Enrique S. Quintana-Ortí:
Refactoring Conventional Task Schedulers to Exploit Asymmetric ARM big.LITTLE Architectures in Dense Linear Algebra. 692-701 - Valeria Cardellini, Alessandro Fanfarillo, Salvatore Filippone:
Heterogeneous CAF-Based Load Balancing on Intel Xeon Phi. 702-711 - Iman Faraji, Seyed Hessam Mirsadeghi, Ahmad Afsahi:
Topology-Aware GPU Selection on Multi-GPU Nodes. 712-720
Workshop 7-PCO - Parallel Computing and Optimization
- Didier El Baz, Bora Uçar:
PCO Introduction and Committees. 721
Session I: Parallel Computing and Optimization
- Kevin Ryan, Deepak Rajan, Shabbir Ahmed:
Scenario Decomposition for 0-1 Stochastic Programs: Improvements and Asynchronous Implementation. 722-729 - Lluís-Miquel Munguía, Geoffrey Oxberry, Deepak Rajan:
PIPS-SBB: A Parallel Distributed-Memory Branch-and-Bound Algorithm for Stochastic Mixed-Integer Programs. 730-739 - Adam Polak:
Counting Triangles in Large Graphs on GPU. 740-746 - Adel Dabah, Ahcène Bendjoudi, Didier El Baz, Abdelhakim AitZai:
GPU-Based Two Level Parallel B&B for the Blocking Job Shop Scheduling Problem. 747-755
Session II: Parallel Algorithms for Scheduling problems GPU-Based Two Level Parallel B&B for the Blocking Job Shop Scheduling
- Yumei Huo, Jun Xiong Huang:
Parallel Ant Colony Optimization for Flow Shop Scheduling Subject to Limited Machine Availability. 756-765 - Abhishek Awasthi, Jörg Lässig, Jens Leuschner, Thomas Weise:
GPGPU-Based Parallel Algorithms for Scheduling Against Due Date. 766-775 - Ali Al Buhussain, Robson Eduardo De Grande, Azzedine Boukerche:
Performance Analysis of Bio-Inspired Scheduling Algorithms for Cloud Environments. 776-785
Session III: Parallel Heuristics and Metaheuristics
- José-Matías Cutillas-Lozano, Domingo Giménez, Luis-Pedro García:
Optimizing Metaheuristics and Hyperheuristics through Multi-level Parallelism on a Many-Core System. 786-795 - Didier El Baz, Mhand Hifi, Lei Wu, Xiaochuan Shi:
A Parallel Ant Colony Optimization for the Maximum-Weight Clique Problem. 796-800 - Giovanni Cammarata, Antonella Di Stefano, Giovanni Morana, Daniele Zito:
Evaluating the Performance of A4SDN on Various Network Topologies. 801-808 - Ania Kaci, Huy-Nam Nguyen, Amir Nakib, Patrick Siarry:
Hybrid Heuristics for Mapping Task Problem on Large Scale Heterogeneous Platforms. 809-816 - Karl-Eduard Berger, François Galea, Bertrand Le Cun, Renaud Sirdey:
A Semi-Greedy Heuristic for the Mapping of Large Task Graphs. 817-824
Session IV: Combinatorial Scientific Computing
- Yu Jin, Joseph F. JáJá:
A High Performance Implementation of Spectral Clustering on CPU-GPU Platforms. 825-834 - Ning Hao, AmirReza Oghbaee, Mohammad Rostami, Nate Derbinsky, José Bento:
Testing Fine-Grained Parallelism for the ADMM on a Factor-Graph. 835-844 - Pingfan Li, Xuhao Chen, Zhe Quan, Jianbin Fang, Huayou Su, Tao Tang, Canqun Yang:
High Performance Parallel Graph Coloring on GPGPUs. 845-854
Workshop 8-GABB - Graph Algorithms Building Blocks
- Tim Mattson:
GABB Introduction and Committees. 855 - David A. Bader:
GABB 2016 Keynote. 856 - Mark Tullsen, Matthew J. Sottile:
Array Types for a Graph Processing Language. 857-866 - Jiahao Chen, Weijian Zhang:
The Right Way to Search Evolving Graphs. 867-876 - E. Jason Riedy:
Updating PageRank for Streaming Graphs. 877-884 - Sriram Srinivasan, Sanjukta Bhowmick, Sajal K. Das:
Application of Graph Sparsification in Developing Parallel Algorithms for Updating Connected Components. 885-891 - Keita Iwabuchi, Scott Sallinen, Roger A. Pearce, Brian Van Essen, Maya B. Gokhale, Satoshi Matsuoka:
Towards a Distributed Large-Scale Dynamic Graph Data Store. 892-901 - Brendan Gavin, Vijay Gadepally, Jeremy Kepner:
Enforced Sparse Non-negative Matrix Factorization. 902-911 - Peter Zhang, Marcin Zalewski, Andrew Lumsdaine, Samantha Misurda, Scott McMillan:
GBTL-CUDA: Graph Algorithms and Primitives for GPUs. 912-920 - Peter M. Kogge:
Jaccard Coefficients as a Potential Graph Benchmark. 921-928 - Patrick Dreher, Chansup Byun, Chris Hill, Vijay Gadepally, Bradley C. Kuszmaul, Jeremy Kepner:
PageRank Pipeline Benchmark: Proposal for a Holistic System Benchmark for Big-Data Platforms. 929-937
Workshop 9-EduPar - NSF/TCPP Workshop on Parallel and Distributed Computing Education
- Ramachandran Vaidyanathan, Sushil K. Prasad, Satish Puri:
EduPar Introduction and Committees. 938-940 - Randal E. Bryant:
EduPar 2016 Keynote. 941
Session 1: Programming Framework and Tools
- Abdul Dakkak, Carl Pearson, Wen-mei W. Hwu:
WebGPU: A Scalable Online Development Platform for GPU Programming Courses. 942-949 - Annette C. Feng, Wu-chun Feng:
Parallel Programming with Pictures in a Snap! 950-957 - José R. Ortiz-Ubarri, Rafael A. Arce-Nazario, Edusmildo Orozco:
Modules to Teach Parallel and Distributed Computing Using MPI for Python and Disco. 958-962 - Yinong Chen, Gennaro De Luca:
VIPLE: Visual IoT/Robotics Programming Language Environment for Computer Science Education. 963-971
Session 2: Instruction Techniques and Experiences
- Joel C. Adams, Patrick A. Crain, Christopher P. Dilley:
Seeing Multithreaded Behavior Using TSGL. 972-977 - Barry Wilkinson, Clayton Ferner:
The Suzaku Pattern Programming Framework. 978-986 - Shirley Moore, Steven R. Dunlop:
A Flipped Classroom Approach to Teaching Concurrency and Parallelism. 987-995 - Javier Cuenca, Domingo Giménez:
A Parallel Programming Course Based on an Execution Time-Energy Consumption Optimization Problem. 996-1003
Workshop 10-HPDAV - High Performance Data Analysis and Visualization
- Wes Bethel:
HPDAV Introduction and Committees. 1004-1005 - Jim Jeffers:
HPDAV 2016 Keynote. 1006
Full Papers Session I
- David Pugmire, James Kress, Jong Youl Choi, Scott Klasky, Tahsin M. Kurç, Michael Churchill, Matthew Wolf, Greg Eisenhauer, Hank Childs, Kesheng Wu, Alexander Sim, Junmin Gu, Jonathan Low:
Visualization and Analysis for Near-Real-Time Decision Making in Distributed Workflows. 1007-1013 - John E. Stone, Peter Messmer, Robert Sisneros, Klaus Schulten:
High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGL. 1014-1023 - Miyuru Dayarathna, Isuru Herath, Yasima Dewmini, Gayan Mettananda, Sameera Nandasiri, Sanath Jayasena, Toyotaro Suzumura:
Introducing Acacia-RDF: An X10-Based Scalable Distributed RDF Graph Database Engine. 1024-1032
Short Papers Session
- Philippe P. Pébay, Janine C. Bennett, David S. Hollman, Sean Treichler, Patrick S. McCormick, Christine Sweeney, Hemanth Kolla, Alex Aiken:
Towards Asynchronous Many-Task in Situ Data Analysis Using Legion. 1033-1037 - Silvio Rizzi, Mark Hereld, Joseph A. Insley, Preeti Malakar, Michael E. Papka, Thomas D. Uram, Venkatram Vishwanath:
Coupling LAMMPS and the vl3 Framework for Co-Visualization of Atomistic Simulations. 1038-1042 - Krishna Bharadwaj, Samuel Flores, Joshua Rodriguez, Lance Long, G. Elisabeta Marai:
Developing a Scalable SNMP Monitor. 1043-1047
Full Papers Session II
- John E. Stone, William R. Sherman, Klaus Schulten:
Immersive Molecular Visualization with Omnidirectional Stereoscopic Ray Tracing and Remote Rendering. 1048-1057 - Robert Sisneros, David Pugmire:
Tuned to Terrible: A Study of Parallel Particle Advection State of the Practice. 1058-1067
Workshop 11-VarSys - Variability in Parallel and Distributed Systems
- Kirk W. Cameron, Todd Gamblin, Dimitrios S. Nikolopoulos:
VarSys Introduction. 1068 - Allan Porterfield, Sridutt Bhalachandra, Wei Wang, Rob Fowler:
Variability: A Tuning Headache. 1069-1072 - Bilge Acun, Laxmikant V. Kalé:
Mitigating Processor Variation through Dynamic Load Balancing. 1073-1076 - Ivo Jimenez, Carlos Maltzahn, Jay F. Lofstead, Adam Moody, Kathryn M. Mohror, Remzi H. Arpaci-Dusseau, Andrea C. Arpaci-Dusseau:
Characterizing and Reducing Cross-Platform Performance Variability Using OS-Level Virtualization. 1077-1080 - Ali Anwar, Yue Cheng, Ali Raza Butt:
Towards Managing Variability in the Cloud. 1081-1084 - Jin-Seong Kim, Jae J. Jang, Im Young Jung:
Near Real-Time Tracking of IoT Device Users. 1085-1088
Workshop 12-HPPAC - High-Performance, Power-Aware Computing
- Barry Rountree, Shuaiwen Leon Song:
HPPAC Introduction and Committees. 1089
Lightning Talks A
- Chung-Hsing Hsu, Wu-chun Feng:
The Right Metric for Efficient Supercomputing: A Ten-Year Retrospective. 1090-1093 - Ryan E. Grant, Michael J. Levenhagen, Stephen L. Olivier, David Debonis, Kevin T. Pedretti, James H. Laros III:
Overcoming Challenges in Scalable Power Monitoring with the Power API. 1094-1097 - Shirley Moore:
Achieving Safety for Power Shifting in Overprovisioned High Performance Computing Systems. 1098-1101 - Rogelio Long, Shirley Moore:
POSITION PAPER: Countering the Noise-Induced Critical Path Problem. 1102-1105
Lightning Talks B
- Natalie J. Bates, Chung-Hsing Hsu, Neena Imam, Torsten Wilde, Dale Sartor:
Re-Examining HPC Energy Efficiency Dashboard Elements. 1106-1109 - Neha Gholkar, Frank Mueller, Barry Rountree:
A Power-Aware Cost Model for HPC Procurement. 1110-1113 - Christopher Eibel, Timo Hönig, Wolfgang Schröder-Preikschat:
Energy Claims at Scale: Decreasing the Energy Demand of HPC Workloads at OS Level. 1114-1117 - Daniel A. Ellsworth, Tapasya Patki, Swann Perarnau, Sangmin Seo, Abdelhalim Amer, Judicael A. Zounmevo, Rinku Gupta, Kazutomo Yoshii, Henry Hoffmann, Allen D. Malony, Martin Schulz, Peter H. Beckman:
Systemwide Power Management with Argo. 1118-1121
Regular Papers A
- Scott Walker, Marty McFadden:
Best Practices for Scalable Power Measurement and Control. 1122-1131 - Aniruddha Marathe, Hormozd Gahvari, Jae-Seung Yeom, Abhinav Bhatele:
LibPowerMon: A Lightweight Profiling Framework to Profile Program Context and System-Level Metrics. 1132-1141 - Matthias Maiterth, Martin Schulz, Dieter Kranzlmüller, Barry Rountree:
Power Balancing in an Emulated Exascale Environment. 1142-1149 - Sand Luz Correa, Mariam Umar, Kirk W. Cameron:
Combining Power and Performance Modeling for Application Analysis: A Case Study Using Aspen. 1150-1159 - Ryan S. Luley, Qinru Qiu:
Effective Utilization of CUDA Hyper-Q for Improved Power and Performance Efficiency. 1160-1169
Regular Papers B
- Nidhi Tiwari, Umesh Bellur, Santonu Sarkar, Maria Indrawan:
Identification of critical parameters for MapReduce energy efficiency using statistical Design of Experiments. 1170-1179 - Xingfu Wu, Valerie E. Taylor:
Utilizing Hardware Performance Counters to Model and Optimize the Energy and Performance of Large Scale Scientific Applications on Power-Aware Supercomputers. 1180-1189 - Jared Coplin, Martin Burtscher:
Energy, Power, and Performance Characterization of GPGPU Benchmark Programs. 1190-1199
Workshop 13-PDSEC - Parallel and Distributed Scientific and Engineering Computing
- Peter Strazdins, Raphaël Couturier, Keita Teranishi, Alan Gray, Thomas Rauber, Gudula Rünger, Laurence T. Yang:
PDSEC Introduction and Committees. 1200-1201
Session 1: Application and Task Parallelism
- Yuta Hirokawa, Taisuke Boku, Shunsuke A. Sato, Kazuhiro Yabana:
Electron Dynamics Simulation with Time-Dependent Density Functional Theory on Large Scale Symmetric Mode Xeon Phi Cluster. 1202-1211 - Jean Marie Couteyen Carpaye, Jean Roman, Pierre Brenner:
Towards an Efficient Task-Based Parallelization over a Runtime System of an Explicit Finite-Volume CFD Code with Adaptive Time Stepping. 1212-1221 - Alan Humphrey, Daniel Sunderland, Todd Harman, Martin Berzins:
Radiative Heat Transfer Calculation on 16384 GPUs Using a Reverse Monte Carlo Ray Tracing Approach with Adaptive Mesh Refinement. 1222-1231
Session 2: Resilience
- Peter E. Strazdins, Md. Mohsin Ali, Bert J. Debusschere:
Application Fault Tolerance for Shrinking Resources via the Sparse Grid Combination Technique. 1232-1238 - Anne Benoit, Aurélien Cavelan, Yves Robert, Hongyang Sun:
Two-Level Checkpointing and Verifications for Linear Task Graphs. 1239-1248
Session 3: Performance
- Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov, Jack J. Dongarra:
On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures. 1249-1258 - André Merzky, Shantenu Jha:
Synapse: Synthetic Application Profiler and Emulator. 1259-1268
Workshop 14-DPDNS - Dependable Parallel, Distributed and Network-Centric Systems
- Dimiter Avresky, Erik Maehle, Roberto Palmieri:
DPDNS Introduction and Committees. 1269 - Shlomi Dolev:
DPDNS 2016 Keynote. 1270
Session 1: Distributed Services
- Brendan Benshoof, Andrew Rosen, Anu G. Bourgeois, Robert W. Harrison:
Distributed Decentralized Domain Name Service. 1279-1287 - Kaliappa Ravindran:
Management Software for Protocol-level Adaptations in Dependable Network Services. 1288-1297
Session 2: Cloud and Fault Tolerance
- Soham Sinha, Di Niu, Zhi Wang, Paul Lu:
Mitigating Routing Inefficiencies to Cloud-Storage Providers: A Case Study. 1298-1306 - Roberto Palmieri:
Leaderless Consensus: The State of the Art. 1307-1310 - Alessandro Pellegrini, Pierangelo di Sanzo, Dimiter R. Avresky:
Proactive Cloud Management for Highly Heterogeneous Multi-cloud Infrastructures. 1311-1318
Session 3: Multicore Computing
- Vishal Chandra Sharma, Ganesh Gopalakrishnan, Sriram Krishnamoorthy:
Towards Resiliency Evaluation of Vector Programs. 1319-1328 - Gilles Bizot, Dimiter Avresky, Fabien Chaix:
Analysis of Adaptive Mapping of Parallelized Application on Multicore System. 1329-1338
Workshop 15-LSPP - Large-Scale Parallel Processing
- Kevin J. Barker, Christopher D. Carothers, Eric Van Hensbergen:
LSPP Introduction and Committees. 1339 - Michael E. Papka:
LSPP 2016 Keynote. 1340
Session 1: Making Efficient Use of Advanced Architectures
- Zhaokui Li, Jianbin Fang, Tao Tang, Xuhao Chen, Cheng Chen, Canqun Yang:
Evaluating the Performance Impact of Multiple Streams on the MIC-Based Heterogeneous Platform. 1341-1350 - Max Plauth, Wieland Hagen, Frank Feinbube, Felix Eberhardt, Lena Feinbube, Andreas Polze:
Parallel Implementation Strategies for Hierarchical Non-uniform Memory Access Systems by Example of the Scale-Invariant Feature Transform Algorithm. 1351-1359 - Ryan D. Friese:
Efficient Genetic Algorithm Encoding for Large-Scale Multi-objective Resource Allocation. 1360-1369
Session 2: Workflow Modeling and Optimization and Modeling at Scale
- Anirban Mandal, Paul Ruth, Ilya Baldin, Dariusz Król, Gideon Juve, Rajiv Mayani, Rafael Ferreira da Silva, Ewa Deelman, Jeremy S. Meredith, Jeffrey S. Vetter, Vickie E. Lynch, Benjamin Mayer, James Wynne, Mark P. Blanco, Christopher D. Carothers, Justin M. LaPre, Brian Tierney:
Toward an End-to-End Framework for Modeling, Monitoring and Anomaly Detection for Scientific Workflows. 1370-1379 - Kevin J. Barker, Darren J. Kerbyson:
Modeling the Performance and Energy Impact of Dynamic Power Steering. 1380-1389
Workshop 16-ParLearning - Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics
- Charalampos Chelmis, Sutanay Choudhury, Arindam Pal, Anand V. Panangadan, Weiqin Tong, Yinglong Xia:
ParLearning Introduction and Committees. 1390-1391 - Peter M. Kogge:
ParLearning 2016 Keynote. 1392
Session I
- Dianwei Han, Ankit Agrawal, Wei-keng Liao, Alok N. Choudhary:
A Novel Scalable DBSCAN Algorithm with Spark. 1393-1402 - Alex Gittens, Jey Kottalam, Jiyan Yang, Michael F. Ringenburg, Jatin Chhugani, Evan Racah, Mohitdeep Singh, Yushu Yao, Curt Fischer, Oliver Rübel, Benjamin P. Bowen, Norman G. Lewis, Michael W. Mahoney, Venkat Krishnamurthy, Prabhat:
A Multi-Platform Evaluation of the Randomized CX Low-Rank Matrix Factorization in Spark. 1403-1412 - Orhan Kislal, Mahmut T. Kandemir, Jagadish Kotra:
Cache-Aware Approximate Computing for Decision Tree Learning. 1413-1422 - Vasileios Zois, Anand V. Panangadan, Viktor K. Prasanna:
Accelerating Support Count for Association Rule Mining on GPUs. 1423-1432
Session II
- Andrew Wylie, Wei Shi, Jean-Pierre Corriveau, Yang Wang:
A Scheduling Algorithm for Hadoop MapReduce Workflows with Budget Constraints in the Heterogeneous Cloud. 1433-1442 - Yanik Ngoko, Denis Trystram, Valentin Reis, Christophe Cérin:
An Automatic Tuning System for Solving NP-Hard Problems in Clouds. 1443-1452 - Daniel G. Chavarría-Miranda, Vito Giovanni Castellana, Alessandro Morari, David Haglin, John Feo:
GraQL: A Query Language for High-Performance Attributed Graph Databases. 1453-1462 - Ismail El-Helw, Rutger F. H. Hofman, Wenzhe Li, Sungjin Ahn, Max Welling, Henri E. Bal:
Scalable Overlapping Community Detection. 1463-1472
Session III
- Xiang-You Peng, Yu-Bo Yang, Chang-Dong Wang, Dong Huang, Jian-Huang Lai:
An Efficient Parallel Nonlinear Clustering Algorithm Using MapReduce. 1473-1476 - Wenhua Yu, Lei Zhao, Xiangyu He, Jiacheng Zhou, Tong Cheng, Chengzhao Xue, Fan Yang:
A New Evaluation System for Scholars and Majors Based on Big-Data Techniques. 1477-1480 - Sarwar Morshed, Juwel Rana, Marcelo Milrad:
Open Source Initiatives and Frameworks Addressing Distributed Real-Time Data Analytics. 1481-1484
Workshop 17-JSSPP - Job Scheduling Strategies for Parallel Processing
- Walfredo Cirne, Narayan Desai:
JSSPP Introduction and Committees. 1485
Workshop 18-iWAPT - International Workshop on Automatic Performance Tuning
- Weichung Wang:
iWAPT Introduction and Committees. 1486-1487
Session 1
- Takahiro Katagiri, Masaharu Matsumoto, Satoshi Ohshima:
Auto-Tuning of Hybrid MPI/OpenMP Execution with Code Selection by ppOpen-AT. 1488-1495 - Satoshi Ohshima, Takahiro Katagiri, Masaharu Matsumoto:
Utilization and Expansion of ppOpen-AT for OpenACC. 1496-1505
Session 2
- Lars Kirkholt Melhus, Rune Erlend Jensen:
Measurement Bias from Address Aliasing. 1506-1515 - Hiroko Midorikawa:
Blk-Tune: Blocking Parameter Auto-Tuning to Minimize Input-Output Traffic for Flash-Based Out-of-Core Stencil Computations. 1516-1526 - Rong Gu, Zhiqiang Liu, Chunfeng Yuan, Yihua Huang:
A Time-Cost Based Automatic Scheduling Framework for Matrix Computation on Various Distributed Computing Platforms. 1527-1534
Session 3
- Amit Roy, Prasanna Balaprakash, Paul D. Hovland, Stefan M. Wild:
Exploiting Performance Portability in Search Algorithms for Autotuning. 1535-1544 - Piotr Luszczek, Mark Gates, Jakub Kurzak, Anthony Danalis, Jack J. Dongarra:
Search Space Generation and Pruning System for Autotuners. 1545-1554
Workshop 19-CHIUW - Chapel Implementers and Users Workshop
- Tom MacDonald, Greg Titus:
CHIUW Introduction and Committees. 1555-1556 - Nikhil Padmanabhan:
CHIUW 2016 Keynote. 1557
Session 1: Benchmarking and Optimization
- Richard B. Johnson, Jeffrey K. Hollingsworth:
Optimizing Chapel for Single-Node Environments. 1558-1567 - Engin Kayraklioglu, Olivier Serres, Ahmad Anbar, Hashem Elezabi, Tarek A. El-Ghazawi:
PGAS Access Overhead Characterization in Chapel. 1568-1577
Session 2: Chapel Improvement
- Philip A. Nelson, Greg Titus:
Chplvis: A Communication and Task Visualization Tool for Chapel. 1578-1585 - Konstantina Panagiotopoulou, Hans-Wolfgang Loidl:
Transparently Resilient Task Parallelism for Chapel. 1586-1595
Workshop 20-HPBDC - High-Performance Big Data Computing
- Dhabaleswar K. Panda, Jianfeng Zhan, Xiaoyi Lu:
HPBDC Introduction and Committees. 1596
Session I: High-Performance Big Data Applications and Systems
- Andrew J. Younge, Christopher Reidy, Robert Henschel, Geoffrey C. Fox:
Evaluation of SMP Shared Memory Machines for Use with In-Memory and OpenMP Big Data Applications. 1597-1606 - André Luckow, Ioannis Paraskevakos, George Chantzialexiou, Shantenu Jha:
Hadoop on HPC: Integrating Hadoop and Pilot-Based Dynamic Resource Management. 1607-1616 - Ruijian Wang, Chao Wang, Li Zha:
PACM: A Prediction-Based Auto-Adaptive Compression Model for HDFS. 1617-1626
Session II: High-Performance Streaming Systems
- Milinda Pathirage, Julian Hyde, Yi Pan, Beth Plale:
SamzaSQL: Scalable Fast Data Management with Streaming SQL. 1627-1636 - Supun Kamburugamuve, Saliya Ekanayake, Milinda Pathirage, Geoffrey C. Fox:
Towards High Performance Processing of Streaming Data in Large Data Centers. 1637-1644 - Yining Zhao, Haili Xiao:
Extracting Log Patterns from System Logs in LARGE. 1645-1652
Session III (Short Papers): Performance Studies of Big Data Systems and Applications
- Saba Sehrish, Jim Kowalkowski, Marc F. Paterno:
Exploring the Performance of Spark for a Scientific Use Case. 1653-1659 - Rui Zhang, Hongzhi Wang, Renu Tewari, Gero Schmidt, Deepika Kakrania:
Big Data for Medical Image Analysis: A Performance Study. 1660-1664
Workshop 21-HPCMASPA - Monitoring and Analysis for High Performance Computing Systems Plus Applications
- Benjamin A. Allan, Jim M. Brandt, Ann C. Gentile, Cory Lueninghoener, Nichamon Naksinehaboon, Boyana Norris, Narate Taerat:
HPCMASPA Introduction and Committees. 1665-1666 - William T. C. Kramer:
HPCMASPA 2016 Keynote. 1667
Session 1: Instrumentation and Metrics
- Christian Iwainsky, Christian H. Bischof:
Calltree-Controlled Instrumentation for Low-Overhead Survey Measurements. 1668-1677 - Mohammed Tanash, Nasim Ghazanfari, Omar Aaziz, Jonathan Cook:
Automatically Instrumenting Scientific Applications to Produce Heartbeat Events. 1678-1686 - Anthony M. Agelastos:
Defining Metrics to Distill Large-Scale HPC Platform and Application Performance Data into Actionable Quantities. 1687-1691
Session 2: Monitoring Systems
- Patricia Grubel, Hartmut Kaiser, Kevin A. Huck, Jeanine E. Cook:
Using Intrinsic Performance Counters to Assess Efficiency in Task-Based Parallel Applications. 1692-1701 - R. Todd Evans, James C. Browne, William L. Barth:
Understanding Application and System Performance Through System-Wide Monitoring. 1702-1710 - Jim M. Brandt, Ann C. Gentile, Michael T. Showerman, Jeremy Enos, Joshi Fullop, Gregory H. Bauer:
Large-Scale Persistent Numerical Data Source Monitoring System Experiences. 1711-1720 - Sam Sanchez, Amanda Bonnie, Graham van Heule, Conor Robinson, Adam DeConinck, Kathleen Kelly, Quellyn Snead, Jim M. Brandt:
Design and Implementation of a Scalable HPC Monitoring System. 1721-1725
Workshop 22-IPDRM - Emerging Parallel and Distributed Runtime Systems and Middleware
- Shuaiwen Leon Song, Todd Gamblin:
IPDRM Introduction and Committees. 1726 - Henry Hoffmann:
IPDRM 2016 Keynote. 1727
Session 1
- Simon Pickartz, Carsten Clauss, Stefan Lankes, Stephan Krempel, Thomas Moschny, Antonello Monti:
Non-intrusive Migration of MPI Processes in OS-Bypass Networks. 1728-1735 - Ezra Kissel, Martin Swany:
Photon: Remote Memory Access Middleware for High-Performance Runtime Systems. 1736-1743 - Joshua Suetterlein, Joshua Landwehr, Andrès Márquez, Joseph B. Manzano, Guang R. Gao:
Asynchronous Runtimes in Action: An Introspective Framework for a Next Gen Runtime. 1744-1751
Session 2
- Alireza Haghdoost, David H. C. Du:
OWBP: Flash-Aware Offline Write Buffer Policy. 1752-1758 - Seyed Hessam Mirsadeghi, Ahmad Afsahi:
Topology-Aware Rank Reordering for MPI Collectives. 1759-1768 - Anshuman Goswami, Jeffrey S. Young, Karsten Schwan, Naila Farooqui, Ada Gavrilovska, Matthew Wolf, Greg Eisenhauer:
GPUShare: Fair-Sharing Middleware for GPU Clouds. 1769-1776
Session 3
- Jie Zhang, Xiaoyi Lu, Dhabaleswar K. Panda:
Performance Characterization of Hypervisor-and Container-Based Virtualization for HPC on SR-IOV Enabled InfiniBand Clusters. 1777-1784 - Heng Zhang, Chunliang Hao, Yanjun Wu, Mingshu Li:
Macaca: A Scalable and Energy-Efficient Platform for Coupling Cloud Computing with Distributed Embedded Computing. 1785-1788 - Sanket Chintapalli, Derek Dagit, Bobby Evans, Reza Farivar, Thomas Graves, Mark Holderbaugh, Zhuo Liu, Kyle Nusbaum, Kishorkumar Patil, Boyang Peng, Paul Poulosky:
Benchmarking Streaming Computation Engines: Storm, Flink and Spark Streaming. 1789-1792
Workshop 23-ParSocial - Parallel and Distributed Processing for Computational Social Systems
- Eunice E. Santos, John Korah:
ParSocial Introduction and Committees. 1793-1794 - George Cybenko:
ParSocial 2016 Keynote. 1795
Paper Session 1
- Chao Huang, Jermaine Marshall, Dong Wang, Mianxiong Dong:
Towards Reliable Social Sensing in Cyber-Physical-Social Systems. 1796-1802 - Gennaro Cordasco, Carmine Spagnuolo, Vittorio Scarano:
Toward the New Version of D-MASON: Efficiency, Effectiveness and Correctness in Parallel and Distributed Agent-Based Simulations. 1803-1812 - Bhavani Thuraisingham, Murat Kantarcioglu, Latifur Khan, Barbara Carminati, Elena Ferrari, Leila Bahri:
Emergency-Driven Assured Information Sharing in Secure Online Social Networks: A Position Paper. 1813-1820
Paper Session 2
- Eunice E. Santos, John Korah, Vairavan Murugappan, Suresh Subramanian:
Efficient Anytime Anywhere Algorithms for Closeness Centrality in Large and Dynamic Graphs. 1821-1830 - Thanh Hong Nguyen, Arunesh Sinha, Milind Tambe:
Addressing Behavioral Uncertainty in Security Games: An Efficient Robust Strategic Solution for Defender Patrols. 1831-1838
Workshop 24-Roundtable I - PDC in Core Undergraduate Education
- Dick Brown, Suzanne J. Matthews:
Workshop 24-Roundtable I Introduction. 1839
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.