default search action
30th SBAC-PAD 2018: Lyon, France
- 30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018, Lyon, France, September 24-27, 2018. IEEE 2018, ISBN 978-1-5386-7769-8
Computer Architecture and Compilers
- Nishant Rao, Akshay Ramachandran, Amish Shah:
MLNoC: A Machine Learning Based Approach to NoC Design. 1-8 - Isaías B. Felzmann, Matheus Martins Susin, Liana Dessandre Duenha, Rodolfo Azevedo, Lucas Francisco Wanner:
ADeLe: Rapid Architectural Simulation for Approximate Hardware. 9-16 - Pedro Caldeira, Jeronimo Costa Penha, Lucas Bragança, Ricardo Ferreira, José Augusto Miranda Nacif, Renato Ferreira, Fernando Magno Quintão Pereira:
From Java to FPGA: An Experience with the Intel HARP System. 17-24 - Congmiao Li, Jean-Luc Gaudiot:
Online Detection of Spectre Attacks Using Microarchitectural Traces from Performance Counters. 25-28 - Luis Mattos, Divino Cesar S. Lucas, Juan Salamanca, Joao P. L. de Carvalho, Márcio Machado Pereira, Guido Araujo:
DOACROSS Parallelization Based on Component Annotation and Loop-Carried Probability. 29-32
Scheduling
- Louis-Claude Canon, Aurélie Kong Win Chang, Yves Robert, Frédéric Vivien:
Scheduling Independent Stochastic Tasks Under Deadline and Budget Constraints. 33-40 - Jirí Dokulil, Siegfried Benkner:
Adaptive Scheduling of Collocated Applications Using a Task-Based Runtime System. 41-48 - Vinicius Freitas, Alexandre de Limas Santana, Márcio Castro, Laércio Lima Pilla:
A Batch Task Migration Approach for Decentralized Global Rescheduling. 49-56 - Yubo Qin, Ivan Rodero, Pradeep Subedi, Manish Parashar, Sandro Rigo:
Exploring Power Budget Scheduling Opportunities and Tradeoffs for AMR-Based Applications. 57-64 - Congfeng Jiang, Yumei Wang, Dongyang Ou, Yeliang Qiu, Youhuizi Li, Jian Wan, Bing Luo, Weisong Shi, Christophe Cérin:
EASE: Energy Efficiency and Proportionality Aware Virtual Machine Scheduling. 65-68
Energy in the Cloud, Network
- David Guyon, Anne-Cécile Orgerie, Christine Morin:
Energy - Efficient IaaS-PaaS Co-Design for Flexible Cloud Deployment of Scientific Applications. 69-76 - Chaopeng Guo, Jean-Marc Pierson:
Frequency Selection Approach for Energy Aware Cloud Database. 77-84 - Benjamin Camus, Fanny Dufossé, Anne Blavette, Martin Quinson, Anne-Cécile Orgerie:
Network-Aware Energy-Efficient Virtual Machine Management in Distributed Cloud Infrastructures with On-Site Photovoltaic Production. 86-92 - Su-Hwan Jang, Jongpil Jeong, Byungjun Park:
A Novel Broker-Based Hierarchical Authentication Scheme in Proxy Mobile IPv6 Networks. 93-96
Applications
- Yuankun Fu, Feng Li, Fengguang Song, Luoding Zhu:
Designing a Parallel Memory-Aware Lattice Boltzmann Algorithm on Manycore Systems. 97-106 - Jucele Franca de Alencar Vasconcellos, Edson Norberto Cáceres, Henrique Mongelli, Siang Wun Song:
A New Efficient Parallel Algorithm for Minimum Spanning Tree. 107-114 - Natalia Kalinnik, Robert Kiesel, Thomas Rauber, Marcel Richter, Gudula Rünger:
Exploring Self-Adaptivity Towards Performance and Energy for Time-Stepping Methods. 115-123 - Daniel Oliveira, Francis Birck Moreira, Paolo Rech, Philippe Olivier Alexandre Navaux:
Predicting the Reliability Behavior of HPC Applications. 124-131
GPU Based Computing
- Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Thomas Grützmacher:
Variable-Size Batched Condition Number Calculation on GPUs. 132-139 - Ming-Hung Chen, I-Hsin Chung, Bülent Abali, Paul Crumley:
Towards a Single-Host Many-GPU System. 140-147 - Matthias Korch, Tim Werner:
Exploiting Limited Access Distance for Kernel Fusion Across the Stages of Explicit One-Step Methods on GPUs. 148-157 - Suren Chilingaryan, Evelina Ametova, Andreas Kopmann, Alessandro Mirone:
Balancing Load of GPU Subsystems to Accelerate Image Reconstruction in Parallel Beam Tomography. 158-166 - Eugenio Gianniti, Li Zhang, Danilo Ardagna:
Performance Prediction of GPU-Based Deep Learning Applications. 167-170
Programming Paradigms and Memory
- Romain Fontaine, Laure Gonnord, Lionel Morel:
Polyhedral Dataflow Programming: A Case Study. 171-179 - George Kornaros, Marcello Coppola:
Enabling Efficient Job Dispatching in Accelerator-Extended Heterogeneous Systems with Unified Address Space. 180-188 - Mohammad Shakeel Laghari, Najeeb Ahmad, Didem Unat:
Phase-Based Data Placement Scheme for Heterogeneous Memory Systems. 189-196 - João Vieira, Nuno Roma, Pedro Tomás, Paolo Ienne, Gabriel Falcão Paiva Fernandes:
Exploiting Compute Caches for Memory Bound Vector Operations. 197-200
Data Analytics, Locality and I/O
- Shouwei Chen, Ivan Rodero:
Exploring the Potential of Next Generation Software-Defined in Memory Frameworks. 201-208 - Yevhen Alforov, Thomas Ludwig, Anastasiia Novikova, Michael Kuhn, Julian M. Kunkel:
Towards Green Scientific Data Compression Through High-Level I/O Interfaces. 209-216 - Luiz Angelo Steffenel:
Improving the Performance of Fog Computing Through the Use of Data Locality. 217-224 - Alberto Miranda, Ramon Nou, Toni Cortes:
ECHOFS: A Scheduler-Guided Temporary Filesystem to Leverage Node-Local NVMS. 225-228 - Hartwig Anzt, Jack J. Dongarra:
A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs. 229-232
Performance Prediction and Evaluation
- Markus Wittmann, Georg Hager, Radim Janalík, Martin Lanser, Axel Klawonn, Oliver Rheinbach, Olaf Schenk, Gerhard Wellein:
Multicore Performance Engineering of Sparse Triangular Solves Using a Modified Roofline Model. 233-241 - Nelson Mimura Gonzalez, José R. Brunheroto, Fausto Artico, Yoonho Park, Tereza Cristina M. B. Carvalho, Charles Christian Miers, Maurício Aronne Pillon, Guilherme Piêgas Koslovski:
Predicting the Performance Impact of Increasing Memory Bandwidth for Scientific Workflows. 242-249 - Milan Radulovic, Kazi Asifuzzaman, Darko Zivanovic, Nikola Rajovic, Guillaume Colin de Verdière, Dirk Pleiter, Manolis Marazakis, Nikolaos D. Kallimanis, Paul M. Carpenter, Petar Radojkovic, Eduard Ayguadé:
Mainstream vs. Emerging HPC: Metrics, Trade-Offs and Lessons Learned. 250-257 - Gabriel Fernandez, Francisco J. Cazorla, Jaume Abella, Sylvain Girbal:
Assessing Time Predictability Features of ARM Big. LITTLE Multicores. 258-261 - Pierre Huchant, Denis Barthou, Marie Christine Counilh:
Adaptive Partitioning for Iterated Sequences of Irregular OpenCL Kernels. 262-265
IoT, Fog, Edge, and Cloud Computing
- Fabíola Martins Campos de Oliveira, Edson Borin:
Partitioning Convolutional Neural Networks for Inference on Constrained Internet-of-Things Devices. 266-273 - Ali Reza Zamani, Daniel Balouek-Thomert, Juan J. Villalobos, Ivan Rodero, Manish Parashar:
Runtime Management of Data Quality for Scientific Observatories Using Edge and In-Transit Resources. 274-281 - Jose Pergentino Araujo Neto, Donald M. Pianto, Célia Ghedini Ralha:
A Fault-Tolerant Agent-Based Architecture for Transient Servers in Fog Computing. 282-289
HPML 2018 Workshop: Section I
- Raul Puri, Robert Kirby, Nikolai Yakovenko, Bryan Catanzaro:
Large Scale Language Modeling: Converging on 40GB of Text in Four Hours. 290-297 - Guojing Cong, Giacomo Domeniconi, Joshua Shapiro, Fan Zhou, Barry Chen:
Accelerating Deep Neural Network Training for Action Recognition on a Cluster of GPUs. 298-305 - Renato Luiz de Freitas Cunha, Eduardo R. Rodrigues, Matheus Palhares Viana, Dário Augusto Borges Oliveira:
An Argument in Favor of Strong Scaling for Deep Neural Networks with Small Datasets. 306-313 - Kazumasa Sakivama, Shinpei Kato, Yutaka Ishikawa, Atsushi Hori, Abraham Monrroy:
Deep Learning on Large-Scale Muticore Clusters. 314-321 - Behzad Salami, Osman S. Unsal, Adrián Cristal Kestelman:
On the Resilience of RTL NN Accelerators: Fault Characterization and Mitigation. 322-329 - David M. Chan, Roshan Rao, Forrest Huang, John F. Canny:
T-SNE-CUDA: GPU-Accelerated T-SNE and its Applications to Modern Data. 330-338
HPML 2018 Workshop: Section II
- M. Todd Young, Jacob D. Hinkle, Arvind Ramanathan, Ramakrishnan Kannan:
HyperSpace: Distributed Bayesian Hyperparameter Optimization. 339-347 - Marisol Monterrubio Velasco, José Carlos Carrasco-Jiménez, Octavio Castillo Reyes, Fernando M. Cucchietti, Josep de la Puente:
A Machine Learning Approach for Parameter Screening in Earthquake Simulation. 348-355 - Kenny Peou, Alan Kelly, Joel Falcou, Cécile Germain:
A Case Study on Optimizing Accurate Half Precision Average. 356-363 - Paul-Cristian Sarbu, Hans-Joachim Bungartz:
Optimization of a Sparse Grid-Based Data Mining Kernel for Architectures Using AVX-512. 364-371 - Matheus Alcântara Souza, Lucas Andrade Maciel, Pedro Henrique Penna, Henrique C. Freitas:
Energy Efficient Parallel K-Means Clustering for an Intel® Hybrid Multi-Chip Package. 372-379
HPML 2018 Workshop: Section III
- Christina Diedhiou, Bryan Carpenter, Aamir Shafi, Soumabha Sarkar, Ramazan Esmeli, Ryan Gadsdon:
Performance Comparison of a Parallel Recommender Algorithm Across Three Hadoop-Based Frameworks. 380-387 - Shirin Tavara, Alexander Schliep:
Effect of Network Topology on the Performance of ADMM-Based SVMs. 388-393 - Luis Fernando L. Grim, André Leon S. Gradvohl:
High-Performance Ensembles of Online Sequential Extreme Learning Machine for Regression and Time Series Forecasting. 394-401
WAMCA 2018 Workshop: Architecture and Performance Analysis
- Matheus Alcântara Souza, Henrique C. Freitas, Jean-François Méhaut:
Design Space Exploration of Energy Efficient NoC-and Cache-Based Many-Core Architecture. 402-409 - Fabio Verbosio, Jurai Kardos, Mauro Bianco, Olaf Schenk:
Highly Scalable Stencil-Based Matrix-Free Stochastic Estimator for the Diagonal of the Inverse. 410-419 - Shad Kirmani, Hongyang Sun, Padma Raghavan:
A Scalability and Sensitivity Study of Parallel Geometric Algorithms for Graph Partitioning. 420-427
WAMCA 2018 Workshop: OpenMP Parallelization
- Matheus Mortatti, Hervé Yviquel, Guido Araujo:
Automatic Ray-Tracer Cloud Offloading in OPENMP. 428-435 - Olfa Haggui, Claude Tadonki, Fatma Sayadi, Bouraoui Ouni:
Evaluation of an OPENMP Parallelization of Lucas-Kanade on a NUMA-Manycore. 436-441 - Taylor Lloyd, Artem Chikin, Sanket Kedia, Dhruv Jain, José Nelson Amaral:
Automated GPU Grid Geometry Selection for OPENMP Kernels. 442-449 - Abdoul Wahid Mainassara Checkaraou, Alban Rousset, Xavier Besseron, Sébastien Varrette, Bernhard Peters:
Hybrid MPI+openMP Implementation of eXtended Discrete Element Method. 450-457
WAMCA 2018 Workshop: Hybrid Parallelization
- Evan Coleman, Erik J. Jensen, Masha Sosonkina:
Impacts of Three Soft-Fault Models on Hybrid Parallel Asynchronous Iterative Methods. 458-465 - Guillaume Latu, Yuuichi Asahi, Julien Bigot, Tamas B. Fehér, Virginie Grandgirard:
Scaling and Optimizing the Gysela Code on a Cluster of Many-Core Processors. 466-473
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.