| 2012 |
| 53 | Performance characterization of global address space applications: a case study with NWChem. Jeff R. Hammond, Sriram Krishnamoorthy, Sameer Shende, Nichols A. Romero, Allen D. Malony. Concurrency and Computation: Practice and Experience (24): 135-154 (2012). Web SearchBibTeXDownload |
| 2011 |
| 52 | An Approach to Creating Performance Visualizations in a Parallel Profile Analysis Tool. Wyatt Spear, Allen D. Malony, Chee Wai Lee, Scott Biersdorff, Sameer Shende. Euro-Par Workshops (2) 2011, 156-165. Web SearchBibTeXDownload |
| 51 | Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs. Allen D. Malony, Scott Biersdorff, Sameer Shende, Heike Jagode, Stanimire Tomov, Guido Juckeland, Robert Dietrich, Duncan Poole, Christopher Lamb. ICPP 2011, 176-185. Web SearchBibTeXDownload |
| 2010 |
| 50 | Design and Implementation of a Hybrid Parallel Performance Measurement System. Alan Morris, Allen D. Malony, Sameer Shende, Kevin A. Huck. ICPP 2010, 492-501. Web SearchBibTeXDownload |
| 49 | Improving the Scalability of Performance Evaluation Tools. Sameer Suresh Shende, Allen D. Malony, Alan Morris. PARA (2) 2010, 441-451. Web SearchBibTeXDownload |
| 2009 |
| 48 | A Generic and Configurable Source-Code Instrumentation Component. Markus Geimer, Sameer Shende, Allen D. Malony, Felix Wolf. ICCS (2) 2009, 696-705. Web SearchBibTeXDownload |
| 2008 |
| 47 | Integrated parallel performance views. Aroon Nataraj, Allen D. Malony, Sameer Shende, Alan Morris. Cluster Computing (11): 57-73 (2008). Web SearchBibTeXDownload |
| 46 | Observing Performance Dynamics Using Parallel Profile Snapshots. Alan Morris, Wyatt Spear, Allen D. Malony, Sameer Shende. Euro-Par 2008, 162-171. Web SearchBibTeXDownload |
| 45 | Parametric Studies in Eclipse with TAU and PerfExplorer. Kevin A. Huck, Wyatt Spear, Allen D. Malony, Sameer Shende, Alan Morris. Euro-Par Workshops 2008, 283-294. Web SearchBibTeXDownload |
| 44 | Performance Tool Workflows. Wyatt Spear, Allen D. Malony, Alan Morris, Sameer Shende. ICCS (3) 2008, 276-285. Web SearchBibTeXDownload |
| 43 | Evolution of a Parallel Performance System. Allen D. Malony, Sameer Shende, Alan Morris, Scott Biersdorff, Wyatt Spear, Kevin A. Huck, Aroon Nataraj. Parallel Tools Workshop 2008, 169-190. Web SearchBibTeXDownload |
| 42 | Knowledge support and automation for performance analysis with PerfExplorer 2.0. Kevin A. Huck, Allen D. Malony, Sameer Shende, Alan Morris. Scientific Programming (16): 123-134 (2008). Web SearchBibTeXDownload |
| 2007 |
| 41 | Performance modeling of component assemblies. Nick Trebon, Allen Morris, Jaideep Ray, Sameer Shende, Allen D. Malony. Concurrency and Computation: Practice and Experience (19): 685-696 (2007). Web SearchBibTeXDownload |
| 40 | TAUoverSupermon : Low-Overhead Online Parallel Performance Monitoring. Aroon Nataraj, Matthew J. Sottile, Alan Morris, Allen D. Malony, Sameer Shende. Euro-Par 2007, 85-96. Web SearchBibTeXDownload |
| 39 | Compensation of Measurement Overhead in Parallel Performance Profiling. Allen D. Malony, Sameer Shende, Alan Morris, Felix Wolf. IJHPCA (21): 174-194 (2007). Web SearchBibTeXDownload |
| 38 | Supporting Nested OpenMP Parallelism in the TAU Performance System. Alan Morris, Allen D. Malony, Sameer Shende. International Journal of Parallel Programming (35): 417-436 (2007). Web SearchBibTeXDownload |
| 37 | Scalable, Automated Performance Analysis with TAU and PerfExplorer. Kevin A. Huck, Allen D. Malony, Sameer Shende, Alan Morris. PARCO 2007, 629-636. Web SearchBibTeX |
| 2006 |
| 36 | Kernel-Level Measurement for Integrated Parallel Performance Views: the KTAU Project. Aroon Nataraj, Allen D. Malony, Sameer Shende, Alan Morris. CLUSTER 2006. Web SearchBibTeXDownload |
| 35 | Bridging the language gap in scientific computing: the Chasm approach. Craig Edward Rasmussen, Matthew J. Sottile, Sameer Shende, Allen D. Malony. Concurrency and Computation: Practice and Experience (18): 151-162 (2006). Web SearchBibTeXDownload |
| 34 | Early Experiences with KTAU on the IBM BG/L. Aroon Nataraj, Allen D. Malony, Alan Morris, Sameer Shende. Euro-Par 2006, 99-110. Web SearchBibTeXDownload |
| 33 | Integrating TAU with Eclipse: A Performance Analysis System in an Integrated Development Environment. Wyatt Spear, Allen D. Malony, Alan Morris, Sameer Shende. HPCC 2006, 230-239. Web SearchBibTeXDownload |
| 32 | The Tau Parallel Performance System. Sameer Shende, Allen D. Malony. IJHPCA (20): 287-311 (2006). Web SearchBibTeXDownload |
| 31 | A Component Architecture for High-Performance Scientific Computing. David E. Bernholdt, Benjamin A. Allan, Robert C. Armstrong, Felipe Bertrand, Kenneth Chiu, Tamara Dahlgren, Kostadin Damevski, Wael R. Elwasif, Thomas Epperly, Madhusudhan Govindaraju, Daniel S. Katz, James Arthur Kohl, Manojkumar Krishnan, Gary Kumfert, Jay Walter Larson, Sophia Lefantzi, Michael J. Lewis, Allen D. Malony, Lois C. McInnes, Jarek Nieplocha, Boyana Norris, Steven G. Parker, Jaideep Ray, Sameer Shende, Theresa L. Windus, Shujia Zhou. IJHPCA (20): 163-202 (2006). Web SearchBibTeXDownload |
| 30 | Supporting Nested OpenMP Parallelism in the TAU Performance System. Alan Morris, Allen D. Malony, Sameer Shende. IWOMP 2006, 279-288. Web SearchBibTeXDownload |
| 29 | Workload Characterization Using the TAU Performance System. Sameer Shende, Allen D. Malony, Alan Morris. PARA 2006, 289-296. Web SearchBibTeXDownload |
| 28 | Optimization of Instrumentation in Parallel Performance Evaluation Tools. Sameer Shende, Allen D. Malony, Alan Morris. PARA 2006, 440-449. Web SearchBibTeXDownload |
| 27 | TAUg: Runtime Global Performance Data Access Using MPI. Kevin A. Huck, Allen D. Malony, Sameer Shende, Alan Morris. PVM/MPI 2006, 313-321. Web SearchBibTeXDownload |
| 2005 |
| 26 | Performance technology for parallel and distributed component software. Allen D. Malony, Sameer Shende, Nick Trebon, Jaideep Ray, Robert C. Armstrong, Craig Edward Rasmussen, Matthew J. Sottile. Concurrency - Practice and Experience (17): 117-141 (2005). Web SearchBibTeXDownload |
| 25 | Models for On-the-Fly Compensation of Measurement Overhead in Parallel Performance Profiling. Allen D. Malony, Sameer Shende. Euro-Par 2005, 72-82. Web SearchBibTeXDownload |
| 24 | Trace-Based Parallel Performance Overhead Compensation. Felix Wolf, Allen D. Malony, Sameer Shende, Alan Morris. HPCC 2005, 617-628. Web SearchBibTeXDownload |
| 23 | Overhead Compensation in Performance Profiling. Allen D. Malony, Sameer Shende. Parallel Processing Letters (15): 19-36 (2005). Web SearchBibTeXDownload |
| 22 | Phase-Based Parallel Performance Profiling. Allen D. Malony, Sameer Shende, Alan Morris. PARCO 2005, 203-210. Web SearchBibTeX |
| 21 | Performance Profiling Overhead Compensation for MPI Programs. Sameer Shende, Allen D. Malony, Alan Morris, Felix Wolf. PVM/MPI 2005, 359-367. Web SearchBibTeXDownload |
| 20 | A Scalable Approach to MPI Application Performance Analysis. Shirley Moore, Felix Wolf, Jack Dongarra, Sameer Shende, Allen D. Malony, Bernd Mohr. PVM/MPI 2005, 309-316. Web SearchBibTeXDownload |
| 2004 |
| 19 | Computational Quality of Service for Scientific Components. Boyana Norris, Jaideep Ray, Robert C. Armstrong, Lois C. McInnes, David E. Bernholdt, Wael R. Elwasif, Allen D. Malony, Sameer Shende. CBSE 2004, 264-271. Web SearchBibTeXDownload |
| 18 | Overhead Compensation in Performance Profiling. Allen D. Malony, Sameer Shende. Euro-Par 2004, 119-132. Web SearchBibTeXDownload |
| 17 | Performance Measurement and Modeling of Component Applications in a High Performance Computing Environment: A Case Study. Jaideep Ray, Nick Trebon, Robert C. Armstrong, Sameer Shende, Allen D. Malony. IPDPS 2004. Web SearchBibTeXDownload |
| 2003 |
| 16 | Integration and application of TAU in parallel Java environments. Sameer Shende, Allen D. Malony. Concurrency and Computation: Practice and Experience (15): 501-519 (2003). Web SearchBibTeXDownload |
| 15 | ParaProf: A Portable, Extensible, and Scalable Tool for Parallel Performance Profile Analysis. Robert Bell, Allen D. Malony, Sameer Shende. Euro-Par 2003, 17-26. Web SearchBibTeXDownload |
| 14 | Performance Instrumentation and Measurement for Terascale Systems. Jack Dongarra, Allen D. Malony, Shirley Moore, Philip Mucci, Sameer Shende. International Conference on Computational Science 2003, 53-62. Web SearchBibTeXDownload |
| 13 | Performance Analysis Integration in the Uintah Software Development Cycle. J. Davison de St. Germain, Alan Morris, Steven G. Parker, Allen D. Malony, Sameer Shende. International Journal of Parallel Programming (31): 35-53 (2003). Web SearchBibTeXDownload |
| 12 | A Performance Interface for Component-Based Applications. Sameer Shende, Allen D. Malony, Craig Edward Rasmussen, Matthew J. Sottile. IPDPS 2003, 278. Web SearchBibTeXDownload |
| 11 | Online Remote Trace Analysis of Parallel Applications on High-Performance Clusters. Holger Brunst, Allen D. Malony, Sameer Shende, Robert Bell. ISHPC 2003, 440-449. Web SearchBibTeXDownload |
| 10 | Online Performance Observation of Large-Scale Parallel Applications. Allen D. Malony, Sameer Shende, Robert Bell. PARCO 2003, 761-768. Web SearchBibTeX |
| 2002 |
| 9 | Integrating Performance Analysis in the Uintah Software Development Cycle. J. Davison de St. Germain, Alan Morris, Steven G. Parker, Allen D. Malony, Sameer Shende. ISHPC 2002, 190-206. Web SearchBibTeXDownload |
| 8 | Design and Prototype of a Performance Tool Interface for OpenMP. Bernd Mohr, Allen D. Malony, Sameer Shende, Felix Wolf. The Journal of Supercomputing (23): 105-128 (2002). Web SearchBibTeXDownload |
| 2001 |
| 7 | Integration and applications of the TAU performance system in parallel Java environments. Sameer Shende, Allen D. Malony. Java Grande 2001, 87-96. Web SearchBibTeXDownload |
| 6 | On using SCALEA for performance analysis of distributed and parallel programs. Hong Linh Truong, Thomas Fahringer, Georg Madsen, Allen D. Malony, Hans Moritsch, Sameer Shende. SC 2001, 34. Web SearchBibTeXDownload |
| 5 | Performance Technology for Complex Parallel and Distributed Systems. Allen D. Malony, Sameer Shende. Scalable Computing: Practice and Experience (4) (2001). Web SearchBibTeXDownload |
| 2000 |
| 4 | A Tool Framework for Static and Dynamic Analysis of Object-Oriented Software with Templates. Kathleen A. Lindlan, Janice E. Cuny, Allen D. Malony, Sameer Shende, Bernd Mohr, Reid Rivenburgh, Craig Edward Rasmussen. SC 2000, 49. Web SearchBibTeXDownload |
| 1999 |
| 3 | SMARTS: exploiting temporal locality and parallelism through vertical execution. Suvas Vajracharya, Steve Karmesin, Peter H. Beckman, James Crotinger, Allen D. Malony, Sameer Shende, R. R. Oldehoeft, Stephen Smith. International Conference on Supercomputing 1999, 302-310. Web SearchBibTeXDownload |
| 2 | A Runtime Monitoring Framework for the TAU Profiling System. Timothy J. Sheehan, Allen D. Malony, Sameer Shende. ISCOPE 1999, 170-181. Web SearchBibTeXDownload |
| 1998 |
| 1 | Dynamic Performance Callstack Sampling: Merging TAU and DAQV. Sameer Shende, Allen D. Malony, Steven T. Hackstadt. PARA 1998, 515-520. Web SearchBibTeXDownload |