SC14 New Orleans, LA

Technical Program Posters

  1. Advanced Computation for High Intensity Accelerators. James Amundson, Qiming Lu, Paul Lebrun, Alexandru Macridin, Leo Michelotti, Chong Shik Park, Saba Sehrish, Panagiotis Spentzouris and Eric Stern (Fermi National Accelerator Laboratory)
  2. Association Rule Mining with the Micron Automata Processor. Ke Wang, Mircea Stan and Kevin Skadron (University of Virginia)
  3. Bandwidth-Aware Resource Management for Extreme Scale Systems. Zhou Zhou, Xu Yang and Zhiling Lan (Illinois Institute of Technology), Paul Rich, Wei Tang and Vitali Morozov (Argonne National Laboratory) and Narayan Desai (Ericsson)
  4. bgclang: Creating an Alternative, Customizable, Toolchain for the Blue Gene/Q. Hal Finkel (Argonne National Laboratory)
  5. Big Data Analytics on Object Stores: A Performance Study. Lukas Rupprecht (Imperial College London) and Rui Zhang and Dean Hildebrand (IBM Corporation)
  6. Characterizing Application Sensitivity to Network Performance. Eli Rosenthal (Brown University) and Edgar A. Leon (Lawrence Livermore National Laboratory)
  7. A Cloud-Based Interactive Data Infrastructure for Sensor Networks. Tonglin Li (Illinois Institute of Technology), Kate Keahey, Rajesh Sankaran and Pete Beckman (Argonne National Laboratory) and Ioan Raicu (Illinois Institute of Technology)
  8. Comparing Algorithms for Detecting Abrupt Change Points in Data. Cody L. Buntain (University of Maryland), Christopher Natoli (University of Chicago) and Miroslav Živković (University of Amsterdam)
  9. Cosmography with SDvision. Daniel Pomarede (CEA Saclay and Institute of Research into the Fundamental Laws of the Universe), Helene Courtois (University Claude Bernard I, CNRS National Institute of Nuclear and Particle Physics and Lyon Institute of Nuclear Physics), Yehuda Hoffman (Hebrew University of Jerusalem) and R. Brent Tully (University of Hawaii, Honolulu)
  10. Creating High-Performance Linear Algebra Using Lighthouse and Build-to-Order BLAS. Jeffrey J. Cook and Elizabeth R. Jessup (University of Colorado), Sa-Lin C. Bernstein (Argonne National Laboratory) and Boyana Norris (University of Oregon)
  11. Development of Distributed Parallel Explicit Moving Particle Simulation (MPS) Method and Zoom Up Tsunami Analysis on Urban Areas. Kohei Murotani and Seiichi Koshizuka (University of Tokyo), Masao Ogino (Nagoya University) and Ryuji Shioya and Yasushi Nakabayashi (Toyo University)
  12. Diagnosing Network Bottlenecks: One-sided Message Contention. Nathan R. Tallent, Abhinav Vishnu, Hubertus Van Dam, Jeff Daily, Darren Kerbyson and Adolfy Hoisie (Pacific Northwest National Laboratory)
  13. DySectAPI: Scalable Prescriptive Debugging. Nicklas Bo Jensen and Niklas Quarfot Nielsen (Technical University of Denmark), Gregory L. Lee (Lawrence Livermore National Laboratory), Sven Karlsson (Technical University of Denmark) and Dong H. Ahn, Matthew Legendre and Martin Schulz (Lawrence Livermore National Laboratory)
  14. Early Evaluation of the SX-ACE Processor. Ryusuke Egawa (Tohoku University), Shintaro Momose (NEC Corporation), Kazuhiko Komatsu (Tohoku University), Yoko Isobe (NEC Corporation), Hiroyuki Takizawa (Tohoku University), Akihiro Musa (NEC Corporation) and Hiroaki Kobayashi (Tohoku University)
  15. Efficient Data Compression by Efficient Use of HDF5 Format. Katsumi Hagita (National Defense Academy), Manabu Omiya (Hokkaido University), Takashi Honda (ZEON Corporation) and Masao Ogino (Nagoya University)
  16. Employing Machine Learning for the Selection of Robust Algorithms for the Dynamic Scheduling of Scientific Applications. Nitin Sukhija and Srishti Srivastava (Mississippi State University), Florina M. Ciorba (Technical University Dresden), Ioana Banicescu (Mississippi State University) and Brandon Malone (Helsinki Institute for Information Technology)
  17. Exploring Hybrid Hardware and Data Placement Strategies for the Graph 500 Challenge. Scott Sallinen, Daniel Borges, Abdullah Gharaibeh and Matei Ripeanu (University of British Columbia)
  18. Exposing MPI Objects for Debugging. Laust Brock-Nannestad (Technical University of Denmark), John DelSignore (Rogue Wave Software, Inc.), Jeffrey M. Squyres (Cisco Systems, Inc.), Sven Karlsson (Technical University of Denmark) and Kathryn Mohror (Lawrence Livermore National Laboratory)
  19. Fault Injection, Detection, and Correction in CLAMR Using F-SEFI. Brian Atkinson (Clemson University), Nathan DeBardeleben and Qiang Guan (Los Alamos National Laboratory) and William M. Jones (Coastal Carolina University)
  20. A Framework for Analyzing the Community Land Model within the Community Earth System Models. Dali Wang (Oak Ridge National Laboratory), Wei Wu and Yang Xu (University of Tennessee) and Tomislav Janjusic, Wei Ding, Frank Winkler, Nick Forrington and Oscar Hernandez (Oak Ridge National Laboratory)
  21. A Framework for Resource Aware Multithreading. Sunil Shrestha (University of Delaware), Joseph Manzano, Andres Marquez and John Feo (Pacific Northwest National Laboratory) and Guang Gao (University of Delaware)
  22. GPGPU enabled HPC Cloud Platform Based on OpenStack. Tae Joon Jun, Van Quoc Dung, Myong Hwan Yoo and Daeyoung Kim (Korea Advanced Institute of Science and Technology) and HyeYoung Cho and Jaegyoon Hahm (Korea Institute of Science and Technology Information)
  23. GPU Acceleration of Small Dense Matrix Computation of the One-Sided Factorizations. Tingxing Dong, Mark Gates, Azzam Haidar, Piotr Luszczek and Stanimire Tomov (University of Tennessee, Knoxville)
  24. Greater than 10x Acceleration of Fusion Plasma Edge Simulations Using the Parareal Algorithm. Debasmita Samaddar (Culham Centre for Fusion Energy/UK Atomic Energy Authority), David P. Coster (Max Planck Institute of Plasma Physics), Xavier Bonnin (National Center for Scientific Research), Christoph Bergmeister (University of Innsbruck), Eva Havlickova (Culham Centre for Fusion Energy/UK Atomic Energy Authority) and Wael R. Elwasif, Lee A. Berry and Donald B. Batchelor (Oak Ridge National Laboratory)
  25. Hardware Accelerated Linear Programming: Parallelizing the Simplex Method with OpenCL. Bradley de Vlugt (Western University), Maysam Mirahmadi (IBM Corporation) and Serguei L. Primak and Abdallah Shami (Western University)
  26. A High Performance C++ Generic Benchmark for Computational Epidemiology. Aniket Pugaonkar, Sandeep Gupta, Keith R. Bisset and Madhav V. Marathe (Virginia Polytechnic Institute and State University)
  27. HPC and Distributed Computing for Students in Science and Non-Science Programs. Suzanne McIntosh (Cloudera, Inc. and New York University)
  28. Hydra: An HTML5-Based Application for High-Throughput Visualization of Ligand Docking. Yuan Zhao (University of California, San Diego) and Jason Haga (National Institute of Advanced Industrial Science and Technology)
  29. I/O Monitoring in a Hadoop Cluster. Carson L. Wiens, Joshua M. C. Long and Joel R. Ornstein (Los Alamos National Laboratory)
  30. Interoperating MPI and Charm++ for Productivity and Performance. Nikhil Jain (University of Illinois at Urbana-Champaign), Abhinav Bhatele and Jae-Seung Yeom (Lawrence Livermore National Laboratory), Mark F. Adams (Lawrence Berkeley National Laboratory), Francesco Miniati (ETH Zurich), Chao Mei (Google) and Laxmikant Kale (University of Illinois at Urbana-Champaign)
  31. Kokkos Implementation of Albany: a Performance-Portable Finite Element Application. Irina Demeshko, H. Carter Edwards, Michael A. Heroux, Roger P. Pawlowski, Eric T. Phipps and Andrew G. Salinger (Sandia National Laboratories)
  32. Large-Scale Granular Simulations Using Dynamic Load Balance on a GPU Supercomputer. Satori Tsuzuki and Takayuki Aoki (Tokyo Institute of Technology)
  33. Large-Scale Parallel Visualization of Particle Datasets Using Point Sprites. Silvio Rizzi, Mark Hereld, Joseph Insley, Michael E. Papka, Thomas Uram and Venkatram Vishwanath (Argonne National Laboratory)
  34. Lessons From Analyzing Fan-In Communications. Terry Jones and Bradley Settlemyer (Oak Ridge National Laboratory)
  35. Leveraging Naturally Distributed Data Redundancy to Optimize Collective Replication. Bogdan Nicolae, Massimiliano Meneghin and Pierre Lemarinier (IBM Corporation)
  36. Lightweight Scheduling for Improving Load Balance Without Losing Locality. Vivek Kale (University of Illinois), William Gropp (University of Illinois at Urbana-Champaign) and Simplice Donfack (French Institute for Research in Computer Science and Automation)
  37. Lossy Compression for Checkpointing: Fallible or Feasible?. Xiang Ni (University of Illinois at Urbana-Champaign), Tanzima Islam, Kathryn Mohror and Adam Moody (Lawrence Livermore National Laboratory) and Laxmikant Kale (University of Illinois at Urbana-Champaign)
  38. Machine Learning Algorithms for the Performance and Energy-Aware Characterization of Linear Algebra Kernels on Multithreaded Architectures. A. Cristiano I. Malossi, Yves Ineichen, Costas Bekas and Alessandro Curioni (IBM Corporation) and Enrique S. Quintana-Ortí (James I University)
  39. Massively Parallel and Near Linear Time Graph Analytics. Fazle Elahi Faisal, Yves Ineichen, A. Cristiano I. Malossi, Peter Staar, Costas Bekas and Alessandro Curioni (IBM Corporation)
  40. The MDTM Project. Liang Zhang (Fermi National Accelerator Laboratory), Tan Li and Yufei Ren (Stony Brook University), Phil DeMar (Fermi National Accelerator Laboratory), Shudong Jin (Stony Brook University), Dantong Yu (Brookhaven National Laboratory) and Wenji Wu (Fermi National Accelerator Laboratory)
  41. MetaMorph: A Modular Library for Democratizing the Acceleration of Parallel Computing Across Heterogeneous Devices. Paul D. Sathre and Wu-chun Feng (Virginia Polytechnic Institute and State University)
  42. Monetary Cost Optimizations for HPC Applications on Amazon Clouds: Checkpoints and Replicated Execution. Yifan Gong, Bingsheng He and Amelie Chi Zhou (Nanyang Technological University)
  43. A Multiple Time Stepping Algorithm for Efficient Multiscale Modeling of Platelets Flowing in Blood Plasma. Na Zhang, Peng Zhang, Li Zhang, Danny Bluestein and Yuefan Deng (Stony Brook University)
  44. New Parallelization Model of Sequential Monte Carlo Analysis with Prediction-Correction Computing. Eiji Tomiyama and Hiroshi Koyama (Research Organization for Information Science and Technology) and Katsumi Hagita (National Defense Academy)
  45. Optimization and Highly Parallel Implementation of Domain Decomposition Based Algorithms. Lubomir Riha, Tomas Brzobohaty, Alexandros Markopoulos, Marta Jarosova and Tomas Kozubek (IT4Innovations)
  46. Optimizing CAD and Mesh Generation Workflow for SeisSol. Sebastian Rettenberger (Technical University Munich), Cameron Smith (Rensselaer Polytechnic Institute) and Christian Pelties (Ludwig Maximilian University of Munich)
  47. Optimizing Stencil Computations: Multicore-Optimized Wavefront Diamond Blocking on Shared and Distributed Memory Systems. Tareq Malas (King Abdullah University of Science and Technology), Georg Hager (Erlangen Regional Computing Center), Hatem Ltaief (King Abdullah University of Science and Technology), Holger Stengel and Gerhard Wellein (University of Erlangen-Nuremberg) and David Keyes (King Abdullah University of Science and Technology)
  48. PACC: An Extension of OpenACC for Pipelined Processing of Large Data on a GPU. Tomochika Kato, Fumihiko Ino and Kenichi Hagihara (Osaka University)
  49. Parallel Clustering Coefficient Computation Using GPUs. Tahsin Reza (University of British Columbia), Tanuj Kr Aasawat (Jadavpur University) and Matei Ripeanu (University of British Columbia)
  50. Parallel High-Order Geometric Multigrid Methods on Adaptive Meshes for Highly Heterogeneous Nonlinear Stokes Flow Simulations of Earth's Mantle. Johann Rudi (University of Texas at Austin), Hari Sundar (University of Utah), Tobin Isaac and Georg Stadler (University of Texas at Austin), Michael Gurnis (California Institute of Technology) and Omar Ghattas (University of Texas at Austin)
  51. The Parallel Java 2 Library: Parallel Programming in 100% Java. Alan R. Kaminsky (Rochester Institute of Technology)
  52. Performance Grading of GPU-Based Implementation of Space Computing Systems Image Compression. Olympia Kremmyda, Vasilis Dimitsas and Dimitris Gizopoulos (National and Kapodistrian University of Athens)
  53. Performance Model for Large-Scale Neural Simulations with NEST. Wolfram Schenck (Juelich Research Center), Andrew V. Adinetz (Juelich Supercomputing Center), Yury V. Zaytsev (Juelich Research Center), Dirk Pleiter (Juelich Supercomputing Center) and Abigail Morrison (Juelich Research Center)
  54. Performance of Block Jacobi-Davidson Eigensolvers. Melven Roehrig-Zoellner and Jonas Thies (German Aerospace Center), Moritz Kreutzer (Erlangen Regional Computing Center), Andreas Alvermann and Andreas Pieper (University of Greifswald), Achim Basermann (German Aerospace Center), Georg Hager and Gerhard Wellein (Erlangen Regional Computing Center) and Holger Fehske (University of Greifswald)
  55. Performance of Sparse Matrix-Multiple Vectors Multiplication on Multicore and GPUs. Walid Abu-Sufah (University of Illinois at Urbana-Champaign and University of Jordan) and Khalid Ahmad (University of Jordan)
  56. Performance Optimization and Evaluation of a Global Climate Application Using a 440m Horizontal Mesh on the K Computer. Masaaki Terai and Hisashi Yashiro (RIKEN), Kiyotaka Sakamoto (Fujitsu), Shin-ichi Iga and Hirofumi Tomita (RIKEN), Masaki Satoh (University of Tokyo) and Kazuo Minami (RIKEN)
  57. Performance Portable Parallel Programming - Compile-Time Defined Parallelization and Storage Order for Accelerators and CPUs. Michel Müller (Tokyo Institute of Technology)
  58. A Power API for the HPC Community. David DeBonis, Ryan E. Grant, Stephen L. Olivier, Michael Levenhagen, Suzanne M. Kelly, Kevin T. Pedretti and James H. Laros (Sandia National Laboratories)
  59. Power Shifting Opportunities on BG/Q Using Memory Throttling. Bo Li (Virginia Polytechnic Institute and State University) and Edgar A. Leon (Lawrence Livermore National Laboratory)
  60. PyFR: An Open Source Python Framework for High-Order CFD on Heterogeneous Platforms. Freddie D. Witherden, Brian C. Vermeire and Peter E. Vincent (Imperial College London)
  61. Raexplore: Enabling Rapid, Automated Architecture Exploration for Full Applications. Yao Zhang, Prasanna Balaprakash, Jiayuan Meng, Vitali Morozov, Scott Parker and Kalyan Kumaran (Argonne National Laboratory)
  62. RHF SCF Parallelization in GAMESS by Using MPI and OpenMP. Yuri Alexeev, Graham Fletcher and Vitali Morozov (Argonne National Laboratory)
  63. Rolls-Royce Hydra CFD Code on GPUs Using OP2 Abstraction. Istvan Z. Reguly and Gihan R. Mudalige (University of Oxford), Carlo Bertolli (IBM Corporation), Michael B. Giles (University of Oxford), Adam Betts and Paul H. J. Kelly (Imperial College London) and David Radford (Rolls-Royce plc)
  64. A Roofline Performance Analysis of an Algebraic Multigrid Solver. Alex Druinsky, Brian Austin, Xiaoye S. Li, Osni Marques, Eric Roman and Samuel Williams (Lawrence Berkeley National Laboratory)
  65. Scalable and Highly Available Fault Resilient Programming Middleware for Exascale Computing. Atsuko Takefusa, Tsutomu Ikegami, Hidemoto Nakada, Ryousei Takano, Takayuki Tozawa and Yoshio Tanaka (National Institute of Advanced Industrial Science and Technology)
  66. Scalable Arbitrary-Order Pseudo-Spectral Electromagnetic Solver. Jean-Luc Vay, Leroy Anthony Drummond and Alice Koniges (Lawrence Berkeley National Laboratory), Brendan Godfrey (University of Maryland and Lawrence Berkeley National Laboratory) and Irving Haber (University of Maryland)
  67. Scaling OpenMP Programs to Thousand Cores on the Numascale Architecture. Dirk Schmidl (RWTH Aachen University) and Atle Vesterkjær (Numascale AS)
  68. Skeptical Programming and Selective Reliability. James Elliott (North Carolina State University), Mark Hoemmen (Sandia National Laboratories) and Frank Mueller (North Carolina State University)
  69. Space-Filling Curves for Domain Decomposition in Scientific Simulations. Aparna Sasidharan and Marc Snir (University of Illinois at Urbana-Champaign)
  70. Tightly Coupled Accelerators Architecture for Low-latency Inter-Node Communication Between Accelerators. Toshihiro Hanawa (University of Tokyo) and Yuetsu Kodama, Taisuke Boku and Mitsuhisa Sato (University of Tsukuba)
  71. Toward Effective Detection of Silent Data Corruptions for HPC Applications. Sheng Di (Argonne National Laboratory), Eduardo Berrocal (Illinois Institute of Technology) and Leonardo Bautista-Gomez, Katherine Heisey, Rinku Gupta and Franck Cappello (Argonne National Laboratory)
  72. Using Global View Resilience (GVR) to add Resilience to Exascale Applications. Hajime Fujita and Nan Dun (University of Chicago and Argonne National Laboratory), Aiman Fang and Zachary A. Rubenstein (University of Chicago), Ziming Zheng (HP Vertica), Kamil Iskra (Argonne National Laboratory), Jeff Hammond (Intel Corporation), Anshu Dubey (Lawrence Berkeley National Laboratory), Pavan Balaji (Argonne National Laboratory) and Andrew A. Chien (University of Chicago and Argonne National Laboratory)
  73. Using IKAROS to Form Scalable Storage Platforms. Christos Filippidis (National Center of Scientific Research Demokritos), Yiannis Cotronis (University of Athens) and Christos Markou (National Center of Scientific Research Demokritos)
  74. Visualization of Particle Beam Simulations in the IFMIF Accelerator. Bruno Thooris, Phu-Anh-Phi Nghiem and Daniel Pomarède (CEA Saclay and Institute of Research into the Fundamental Laws of the Universe)
  75. Visualizing the Behavior of Large Programs. Hoa Nguyen (University of Utah) and Greg Bronevetsky (Lawrence Livermore National Laboratory)
  76. XcalableACC – a Directive-Based Language Extension for Accelerated Parallel Computing. Hitoshi Murai, Masahiro Nakao and Takenori Shimosaka (RIKEN) and Akihiro Tabuchi, Taisuke Boku and Mitsuhisa Sato (University of Tsukuba)

SC14 Archive