2013
55Inspector/executor load balancing algorithms for block-sparse tensor contractions. David Ozog, Sameer Shende, Allen D. Malony, Jeff R. Hammond, James Dinan, Pavan Balaji. ICS 2013, 483-484. Web SearchBibTeXDownload
2012
54Performance characterization of global address space applications: a case study with NWChem. Jeff R. Hammond, Sriram Krishnamoorthy, Sameer Shende, Nichols A. Romero, Allen D. Malony. Concurrency and Computation: Practice and Experience (24): 135-154 (2012). Web SearchBibTeXDownload
2011
53An Approach to Creating Performance Visualizations in a Parallel Profile Analysis Tool. Wyatt Spear, Allen D. Malony, Chee Wai Lee, Scott Biersdorff, Sameer Shende. Euro-Par Workshops (2) 2011, 156-165. Web SearchBibTeXDownload
52Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs. Allen D. Malony, Scott Biersdorff, Sameer Shende, Heike Jagode, Stanimire Tomov, Guido Juckeland, Robert Dietrich, Duncan Poole, Christopher Lamb. ICPP 2011, 176-185. Web SearchBibTeXDownload
51Characterizing I/O Performance Using the TAU Performance System. Sameer Shende, Allen D. Malony, Wyatt Spear, Karen Schuchardt. PARCO 2011, 647-655. Web SearchBibTeXDownload
2010
50Design and Implementation of a Hybrid Parallel Performance Measurement System. Alan Morris, Allen D. Malony, Sameer Shende, Kevin A. Huck. ICPP 2010, 492-501. Web SearchBibTeXDownload
2009
49A Generic and Configurable Source-Code Instrumentation Component. Markus Geimer, Sameer Shende, Allen D. Malony, Felix Wolf. ICCS (2) 2009, 696-705. Web SearchBibTeXDownload
48Performance Tool Integration in a GPU Programming Environment: Experiences with TAU and HMPP. Allen D. Malony, Shangkar Mayanglambam, Laurent Morin, Matthew J. Sottile, Stéphane Bihan, Sameer Shende, François Bodin. PARCO 2009, 685-692. Web SearchBibTeXDownload
2008
47Integrated parallel performance views. Aroon Nataraj, Allen D. Malony, Sameer Shende, Alan Morris. Cluster Computing (11): 57-73 (2008). Web SearchBibTeXDownload
46Observing Performance Dynamics Using Parallel Profile Snapshots. Alan Morris, Wyatt Spear, Allen D. Malony, Sameer Shende. Euro-Par 2008, 162-171. Web SearchBibTeXDownload
45Parametric Studies in Eclipse with TAU and PerfExplorer. Kevin A. Huck, Wyatt Spear, Allen D. Malony, Sameer Shende, Alan Morris. Euro-Par Workshops 2008, 283-294. Web SearchBibTeXDownload
44Performance Tool Workflows. Wyatt Spear, Allen D. Malony, Alan Morris, Sameer Shende. ICCS (3) 2008, 276-285. Web SearchBibTeXDownload
43Evolution of a Parallel Performance System. Allen D. Malony, Sameer Shende, Alan Morris, Scott Biersdorff, Wyatt Spear, Kevin A. Huck, Aroon Nataraj. Parallel Tools Workshop 2008, 169-190. Web SearchBibTeXDownload
42Knowledge support and automation for performance analysis with PerfExplorer 2.0. Kevin A. Huck, Allen D. Malony, Sameer Shende, Alan Morris. Scientific Programming (16): 123-134 (2008). Web SearchBibTeXDownload
2007
41Performance modeling of component assemblies. Nick Trebon, Allen Morris, Jaideep Ray, Sameer Shende, Allen D. Malony. Concurrency and Computation: Practice and Experience (19): 685-696 (2007). Web SearchBibTeXDownload
40TAUoverSupermon : Low-Overhead Online Parallel Performance Monitoring. Aroon Nataraj, Matthew J. Sottile, Alan Morris, Allen D. Malony, Sameer Shende. Euro-Par 2007, 85-96. Web SearchBibTeXDownload
39Compensation of Measurement Overhead in Parallel Performance Profiling. Allen D. Malony, Sameer Shende, Alan Morris, Felix Wolf. IJHPCA (21): 174-194 (2007). Web SearchBibTeXDownload
38Supporting Nested OpenMP Parallelism in the TAU Performance System. Alan Morris, Allen D. Malony, Sameer Shende. International Journal of Parallel Programming (35): 417-436 (2007). Web SearchBibTeXDownload
37Scalable, Automated Performance Analysis with TAU and PerfExplorer. Kevin A. Huck, Allen D. Malony, Sameer Shende, Alan Morris. PARCO 2007, 629-636. Web SearchBibTeX
2006
36Kernel-Level Measurement for Integrated Parallel Performance Views: the KTAU Project. Aroon Nataraj, Allen D. Malony, Sameer Shende, Alan Morris. CLUSTER 2006. Web SearchBibTeXDownload
35Bridging the language gap in scientific computing: the Chasm approach. Craig Edward Rasmussen, Matthew J. Sottile, Sameer Shende, Allen D. Malony. Concurrency and Computation: Practice and Experience (18): 151-162 (2006). Web SearchBibTeXDownload
34Early Experiences with KTAU on the IBM BG/L. Aroon Nataraj, Allen D. Malony, Alan Morris, Sameer Shende. Euro-Par 2006, 99-110. Web SearchBibTeXDownload
33Integrating TAU with Eclipse: A Performance Analysis System in an Integrated Development Environment. Wyatt Spear, Allen D. Malony, Alan Morris, Sameer Shende. HPCC 2006, 230-239. Web SearchBibTeXDownload
32A Component Architecture for High-Performance Scientific Computing. David E. Bernholdt, Benjamin A. Allan, Robert C. Armstrong, Felipe Bertrand, Kenneth Chiu, Tamara Dahlgren, Kostadin Damevski, Wael R. Elwasif, Thomas Epperly, Madhusudhan Govindaraju, Daniel S. Katz, James Arthur Kohl, Manojkumar Krishnan, Gary Kumfert, Jay Walter Larson, Sophia Lefantzi, Michael J. Lewis, Allen D. Malony, Lois C. McInnes, Jarek Nieplocha, Boyana Norris, Steven G. Parker, Jaideep Ray, Sameer Shende, Theresa L. Windus, Shujia Zhou. IJHPCA (20): 163-202 (2006). Web SearchBibTeXDownload
31The Tau Parallel Performance System. Sameer Shende, Allen D. Malony. IJHPCA (20): 287-311 (2006). Web SearchBibTeXDownload
30Supporting Nested OpenMP Parallelism in the TAU Performance System. Alan Morris, Allen D. Malony, Sameer Shende. IWOMP 2006, 279-288. Web SearchBibTeXDownload
29Optimization of Instrumentation in Parallel Performance Evaluation Tools. Sameer Shende, Allen D. Malony, Alan Morris. PARA 2006, 440-449. Web SearchBibTeXDownload
28Workload Characterization Using the TAU Performance System. Sameer Shende, Allen D. Malony, Alan Morris. PARA 2006, 289-296. Web SearchBibTeXDownload
27TAUg: Runtime Global Performance Data Access Using MPI. Kevin A. Huck, Allen D. Malony, Sameer Shende, Alan Morris. PVM/MPI 2006, 313-321. Web SearchBibTeXDownload
2005
26Performance technology for parallel and distributed component software. Allen D. Malony, Sameer Shende, Nick Trebon, Jaideep Ray, Robert C. Armstrong, Craig Edward Rasmussen, Matthew J. Sottile. Concurrency - Practice and Experience (17): 117-141 (2005). Web SearchBibTeXDownload
25Models for On-the-Fly Compensation of Measurement Overhead in Parallel Performance Profiling. Allen D. Malony, Sameer Shende. Euro-Par 2005, 72-82. Web SearchBibTeXDownload
24Trace-Based Parallel Performance Overhead Compensation. Felix Wolf, Allen D. Malony, Sameer Shende, Alan Morris. HPCC 2005, 617-628. Web SearchBibTeXDownload
23Overhead Compensation in Performance Profiling. Allen D. Malony, Sameer Shende. Parallel Processing Letters (15): 19-36 (2005). Web SearchBibTeXDownload
22Phase-Based Parallel Performance Profiling. Allen D. Malony, Sameer Shende, Alan Morris. PARCO 2005, 203-210. Web SearchBibTeX
21A Scalable Approach to MPI Application Performance Analysis. Shirley Moore, Felix Wolf, Jack Dongarra, Sameer Shende, Allen D. Malony, Bernd Mohr. PVM/MPI 2005, 309-316. Web SearchBibTeXDownload
20Performance Profiling Overhead Compensation for MPI Programs. Sameer Shende, Allen D. Malony, Alan Morris, Felix Wolf. PVM/MPI 2005, 359-367. Web SearchBibTeXDownload
2004
19Computational Quality of Service for Scientific Components. Boyana Norris, Jaideep Ray, Robert C. Armstrong, Lois C. McInnes, David E. Bernholdt, Wael R. Elwasif, Allen D. Malony, Sameer Shende. CBSE 2004, 264-271. Web SearchBibTeXDownload
18Overhead Compensation in Performance Profiling. Allen D. Malony, Sameer Shende. Euro-Par 2004, 119-132. Web SearchBibTeXDownload
17Performance Measurement and Modeling of Component Applications in a High Performance Computing Environment: A Case Study. Jaideep Ray, Nick Trebon, Robert C. Armstrong, Sameer Shende, Allen D. Malony. IPDPS 2004. Web SearchBibTeXDownload
2003
16Integration and application of TAU in parallel Java environments. Sameer Shende, Allen D. Malony. Concurrency and Computation: Practice and Experience (15): 501-519 (2003). Web SearchBibTeXDownload
15ParaProf: A Portable, Extensible, and Scalable Tool for Parallel Performance Profile Analysis. Robert Bell, Allen D. Malony, Sameer Shende. Euro-Par 2003, 17-26. Web SearchBibTeXDownload
14Performance Instrumentation and Measurement for Terascale Systems. Jack Dongarra, Allen D. Malony, Shirley Moore, Philip Mucci, Sameer Shende. International Conference on Computational Science 2003, 53-62. Web SearchBibTeXDownload
13Performance Analysis Integration in the Uintah Software Development Cycle. J. Davison de St. Germain, Alan Morris, Steven G. Parker, Allen D. Malony, Sameer Shende. International Journal of Parallel Programming (31): 35-53 (2003). Web SearchBibTeXDownload
12A Performance Interface for Component-Based Applications. Sameer Shende, Allen D. Malony, Craig Edward Rasmussen, Matthew J. Sottile. IPDPS 2003, 278. Web SearchBibTeXDownload
11Online Remote Trace Analysis of Parallel Applications on High-Performance Clusters. Holger Brunst, Allen D. Malony, Sameer Shende, Robert Bell. ISHPC 2003, 440-449. Web SearchBibTeXDownload
10Online Performance Observation of Large-Scale Parallel Applications. Allen D. Malony, Sameer Shende, Robert Bell. PARCO 2003, 761-768. Web SearchBibTeX
2002
9Integrating Performance Analysis in the Uintah Software Development Cycle. J. Davison de St. Germain, Alan Morris, Steven G. Parker, Allen D. Malony, Sameer Shende. ISHPC 2002, 190-206. Web SearchBibTeXDownload
8Design and Prototype of a Performance Tool Interface for OpenMP. Bernd Mohr, Allen D. Malony, Sameer Shende, Felix Wolf. The Journal of Supercomputing (23): 105-128 (2002). Web SearchBibTeXDownload
2001
7Integration and applications of the TAU performance system in parallel Java environments. Sameer Shende, Allen D. Malony. Java Grande 2001, 87-96. Web SearchBibTeXDownload
6On using SCALEA for performance analysis of distributed and parallel programs. Hong Linh Truong, Thomas Fahringer, Georg Madsen, Allen D. Malony, Hans Moritsch, Sameer Shende. SC 2001, 34. Web SearchBibTeXDownload
5Performance Technology for Complex Parallel and Distributed Systems. Allen D. Malony, Sameer Shende. Scalable Computing: Practice and Experience (4) (2001). Web SearchBibTeXDownload
2000
4A Tool Framework for Static and Dynamic Analysis of Object-Oriented Software with Templates. Kathleen A. Lindlan, Janice E. Cuny, Allen D. Malony, Sameer Shende, Bernd Mohr, Reid Rivenburgh, Craig Edward Rasmussen. SC 2000, 49. Web SearchBibTeXDownload
1999
3SMARTS: exploiting temporal locality and parallelism through vertical execution. Suvas Vajracharya, Steve Karmesin, Peter H. Beckman, James Crotinger, Allen D. Malony, Sameer Shende, R. R. Oldehoeft, Stephen Smith. International Conference on Supercomputing 1999, 302-310. Web SearchBibTeXDownload
2A Runtime Monitoring Framework for the TAU Profiling System. Timothy J. Sheehan, Allen D. Malony, Sameer Shende. ISCOPE 1999, 170-181. Web SearchBibTeXDownload
1998
1Dynamic Performance Callstack Sampling: Merging TAU and DAQV. Sameer Shende, Allen D. Malony, Steven T. Hackstadt. PARA 1998, 515-520. Web SearchBibTeXDownload
from DBLP and Google Scholar
  • Also known as: Sameer Suresh Shende
Developed by the Database Group at the University of Wisconsin and Yahoo! Research