| 2011 |
| 268 | High Performance Pipelined Process Migration with RDMA. Xiangyong Ouyang, Raghunath Rajachandrasekar, Xavier Besseron, Dhabaleswar K. Panda. CCGRID 2011, 314-323. Web SearchBibTeXDownload |
| 267 | Optimized Non-contiguous MPI Datatype Communication for GPU Clusters: Design, Implementation and Evaluation with MVAPICH2. Hao Wang, Sreeram Potluri, Miao Luo, Ashish Kumar Singh, Xiangyong Ouyang, Sayantan Sur, Dhabaleswar K. Panda. CLUSTER 2011, 308-316. Web SearchBibTeXDownload |
| 266 | Can a Decentralized Metadata Service Layer Benefit Parallel Filesystems?. Vilobh Meshram, Xavier Besseron, Xiangyong Ouyang, Raghunath Rajachandrasekar, Ravi Prakash, Dhabaleswar K. Panda. CLUSTER 2011, 484-493. Web SearchBibTeXDownload |
| 265 | Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand Clusters. Hari Subramoni, Krishna Chaitanya Kandalla, Jérôme Vienne, Sayantan Sur, B. Barth, Karen A. Tomko, R. Mclay, Karl W. Schulz, Dhabaleswar K. Panda. CLUSTER 2011, 317-325. Web SearchBibTeXDownload |
| 264 | MPI Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives and Benefit. Ashish Kumar Singh, Sreeram Potluri, Hao Wang, Krishna Chaitanya Kandalla, Sayantan Sur, Dhabaleswar K. Panda. CLUSTER 2011, 420-427. Web SearchBibTeXDownload |
| 263 | High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT. Krishna Chaitanya Kandalla, Hari Subramoni, Karen A. Tomko, Dmitry Pekurovsky, Sayantan Sur, Dhabaleswar K. Panda. Computer Science - R&D (26): 237-246 (2011). Web SearchBibTeXDownload |
| 262 | MVAPICH2-GPU: optimized GPU to GPU communication for InfiniBand clusters. Hao Wang, Sreeram Potluri, Miao Luo, Ashish Kumar Singh, Sayantan Sur, Dhabaleswar K. Panda. Computer Science - R&D (26): 257-266 (2011). Web SearchBibTeXDownload |
| 261 | Collective Communication, Network Support For. Dhabaleswar K. Panda, Sayantan Sur, Hari Subramoni, Krishna Chaitanya Kandalla. Encyclopedia of Parallel Computing 2011, 327-334. Web SearchBibTeXDownload |
| 260 | Design and Implementation of Key Proposed MPI-3 One-Sided Communication Semantics on InfiniBand. Sreeram Potluri, Sayantan Sur, Devendar Bureddy, Dhabaleswar K. Panda. EuroMPI 2011, 321-324. Web SearchBibTeXDownload |
| 259 | Optimizing MPI One Sided Communication on Multi-core InfiniBand Clusters Using Shared Memory Backed Windows. Sreeram Potluri, Hao Wang, Vijay Dhanraj, Sayantan Sur, Dhabaleswar K. Panda. EuroMPI 2011, 99-109. Web SearchBibTeXDownload |
| 258 | Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL. Krishna Chaitanya Kandalla, Hari Subramoni, Jérôme Vienne, S. Pai Raikar, Karen Tomko, Sayantan Sur, Dhabaleswar K. Panda. Hot Interconnects 2011, 27-34. Web SearchBibTeXDownload |
| 257 | Beyond block I/O: Rethinking traditional storage primitives. Xiangyong Ouyang, David W. Nellans, Robert Wipfel, David Flynn, Dhabaleswar K. Panda. HPCA 2011, 301-311. Web SearchBibTeXDownload |
| 256 | Memcached Design on High Performance RDMA Capable Interconnects. Jithin Jose, Hari Subramoni, Miao Luo, Minjia Zhang, Jian Huang, Md. Wasi-ur-Rahman, Nusrat S. Islam, Xiangyong Ouyang, Hao Wang, Sayantan Sur, Dhabaleswar K. Panda. ICPP 2011, 743-752. Web SearchBibTeXDownload |
| 255 | CRFS: A Lightweight User-Level Filesystem for Generic Checkpoint/Restart. Xiangyong Ouyang, Raghunath Rajachandrasekar, Xavier Besseron, Hao Wang, Jian Huang, Dhabaleswar K. Panda. ICPP 2011, 375-384. Web SearchBibTeXDownload |
| 254 | Codesign for InfiniBand Clusters. Sayantan Sur, Sreeram Potluri, Krishna Chaitanya Kandalla, Hari Subramoni, Dhabaleswar K. Panda, Karen Tomko. IEEE Computer (44): 31-36 (2011). Web SearchBibTeXDownload |
| 2010 |
| 253 | High Performance Data Transfer in Grid Environment Using GridFTP over InfiniBand. Hari Subramoni, Ping Lai, Rajkumar Kettimuthu, Dhabaleswar K. Panda. CCGRID 2010, 557-564. Web SearchBibTeXDownload |
| 252 | An MPI-Stream Hybrid Programming Model for Computational Clusters. Emilio Pasquale Mancini, Gregory Marsh, Dhabaleswar K. Panda. CCGRID 2010, 323-330. Web SearchBibTeXDownload |
| 251 | RDMA-Based Job Migration Framework for MPI over InfiniBand. Xiangyong Ouyang, Sonya Marcarelli, Raghunath Rajachandrasekar, Dhabaleswar K. Panda. CLUSTER 2010, 116-125. Web SearchBibTeXDownload |
| 250 | Designing truly one-sided MPI-2 RMA intra-node communication on multi-core systems. Ping Lai, Sayantan Sur, Dhabaleswar K. Panda. Computer Science - R&D (25): 3-14 (2010). Web SearchBibTeXDownload |
| 249 | Improving Application Performance and Predictability Using Multiple Virtual Lanes in Modern Multi-core InfiniBand Clusters. Hari Subramoni, Ping Lai, Sayantan Sur, Dhabaleswar K. Panda. ICPP 2010, 462-471. Web SearchBibTeXDownload |
| 248 | Designing Power-Aware Collective Communication Algorithms for InfiniBand Clusters. Krishna Chaitanya Kandalla, Emilio Pasquale Mancini, Sayantan Sur, Dhabaleswar K. Panda. ICPP 2010, 218-227. Web SearchBibTeXDownload |
| 247 | High Performance Design and Implementation of Nemesis Communication Layer for Two-Sided and One-Sided MPI Semantics in MVAPICH2. Miao Luo, Sreeram Potluri, Ping Lai, Emilio Pasquale Mancini, Hari Subramoni, Krishna Chaitanya Kandalla, Sayantan Sur, Dhabaleswar K. Panda. ICPP Workshops 2010, 377-386. Web SearchBibTeXDownload |
| 246 | Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application. Sreeram Potluri, Ping Lai, Karen A. Tomko, Sayantan Sur, Yifeng Cui, Mahidhar Tatineni, Karl W. Schulz, William L. Barth, Amitava Majumdar, Dhabaleswar K. Panda. ICS 2010, 17-25. Web SearchBibTeXDownload |
| 245 | Designing topology-aware collective communication algorithms for large scale InfiniBand clusters: Case studies with Scatter and Gather. Krishna Chaitanya Kandalla, Hari Subramoni, Abhinav Vishnu, Dhabaleswar K. Panda. IPDPS Workshops 2010, 1-8. Web SearchBibTeXDownload |
| 244 | Designing high-performance and resilient message passing on InfiniBand. Matthew J. Koop, Pavel Shamis, Ishai Rabinovitz, Dhabaleswar K. Panda. IPDPS Workshops 2010, 1-7. Web SearchBibTeXDownload |
| 243 | Scalable Earthquake Simulation on Petascale Supercomputers. Yifeng Cui, Kim B. Olsen, Thomas Jordan, Kwangyoon Lee, Jun Zhou, Patrick Small, Daniel Roten, Geoffrey Ely, Dhabaleswar K. Panda, Amit Chourasia, John Levesque, Steven M. Day, Philip Maechling. SC 2010, 1-20. Web SearchBibTeXDownload |
| 2009 |
| 242 | Natively Supporting True One-Sided Communication in. Gopalakrishnan Santhanaraman, Pavan Balaji, K. Gopalakrishnan, Rajeev Thakur, William Gropp, Dhabaleswar K. Panda. CCGRID 2009, 380-387. Web SearchBibTeXDownload |
| 241 | RDMA over Ethernet - A preliminary study. Hari Subramoni, Ping Lai, Miao Luo, Dhabaleswar K. Panda. CLUSTER 2009, 1-9. Web SearchBibTeXDownload |
| 240 | Design alternatives for implementing fence synchronization in MPI-2 one-sided communication for InfiniBand clusters. Gopalakrishnan Santhanaraman, Tejus Gangadharappa, Sundeep Narravula, Amith R. Mamidala, Dhabaleswar K. Panda. CLUSTER 2009, 1-9. Web SearchBibTeXDownload |
| 239 | An efficient hardware-software approach to network fault tolerance with InfiniBand. Abhinav Vishnu, Manojkumar Krishnan, Dhabaleswar K. Panda. CLUSTER 2009, 1-9. Web SearchBibTeXDownload |
| 238 | Reducing network contention with mixed workloads on modern multicore, clusters. Matthew J. Koop, Miao Luo, Dhabaleswar K. Panda. CLUSTER 2009, 1-10. Web SearchBibTeXDownload |
| 237 | ProOnE: a general-purpose protocol onload engine for multi- and many-core architectures. Ping Lai, Pavan Balaji, Rajeev Thakur, Dhabaleswar K. Panda. Computer Science - R&D (23): 133-142 (2009). Web SearchBibTeXDownload |
| 236 | Topology agnostic hot-spot avoidance with InfiniBand. Abhinav Vishnu, Matthew J. Koop, Adam Moody, Amith R. Mamidala, Sundeep Narravula, Dhabaleswar K. Panda. Concurrency and Computation: Practice and Experience (21): 301-319 (2009). Web SearchBibTeXDownload |
| 235 | Fast checkpointing by Write Aggregation with Dynamic Buffer and Interleaving on multicore architecture. Xiangyong Ouyang, Karthik Gopalakrishnan, Tejus Gangadharappa, Dhabaleswar K. Panda. HiPC 2009, 99-108. Web SearchBibTeXDownload |
| 234 | Accelerating Checkpoint Operation by Node-Level Write Aggregation on Multicore Systems. Xiangyong Ouyang, Karthik Gopalakrishnan, Dhabaleswar K. Panda. ICPP 2009, 34-41. Web SearchBibTeXDownload |
| 233 | Designing Efficient FTP Mechanisms for High Performance Data-Transfer over InfiniBand. Ping Lai, Hari Subramoni, Sundeep Narravula, Amith R. Mamidala, Dhabaleswar K. Panda. ICPP 2009, 156-163. Web SearchBibTeXDownload |
| 232 | CIFTS: A Coordinated Infrastructure for Fault-Tolerant Systems. Rinku Gupta, Pete Beckman, Byung-Hoon Park, Ewing L. Lusk, Paul Hargrove, Al Geist, Dhabaleswar Panda, Andrew Lumsdaine, Jack Dongarra. ICPP 2009, 237-245. Web SearchBibTeXDownload |
| 231 | Designing and Evaluating MPI-2 Dynamic Process Management Support for InfiniBand. Tejus Gangadharappa, Matthew J. Koop, Dhabaleswar K. Panda. ICPP Workshops 2009, 89-96. Web SearchBibTeXDownload |
| 230 | TupleQ: Fully-asynchronous and zero-copy MPI over InfiniBand. Matthew J. Koop, Jaidev K. Sridhar, Dhabaleswar K. Panda. IPDPS 2009, 1-8. Web SearchBibTeXDownload |
| 229 | Designing multi-leader-based Allgather algorithms for multi-core clusters. Krishna Chaitanya Kandalla, Hari Subramoni, Gopalakrishnan Santhanaraman, Matthew J. Koop, Dhabaleswar K. Panda. IPDPS 2009, 1-8. Web SearchBibTeXDownload |
| 228 | IPDPS 2007: Comments from the Guest Editor. Dhabaleswar K. Panda. J. Parallel Distrib. Comput. (69): 679 (2009). Web SearchBibTeXDownload |
| 227 | Impact of Node Level Caching in MPI Job Launch Mechanisms. Jaidev K. Sridhar, Dhabaleswar K. Panda. PVM/MPI 2009, 230-239. Web SearchBibTeXDownload |
| 2008 |
| 226 | Optimized Distributed Data Sharing Substrate in Multi-core Commodity Clusters: A Comprehensive Study with Applications. Karthikeyan Vaidyanathan, Ping Lai, Sundeep Narravula, Dhabaleswar K. Panda. CCGRID 2008, 138-145. Web SearchBibTeXDownload |
| 225 | MPI Collectives on Modern Multicore Clusters: Performance Optimizations and Communication Characteristics. Amith R. Mamidala, Rahul Kumar, Debraj De, Dhabaleswar K. Panda. CCGRID 2008, 130-137. Web SearchBibTeXDownload |
| 224 | Advanced RDMA-Based Admission Control for Modern Data-Centers. Ping Lai, Sundeep Narravula, Karthikeyan Vaidyanathan, Dhabaleswar K. Panda. CCGRID 2008, 384-391. Web SearchBibTeXDownload |
| 223 | Scalable MPI design over InfiniBand using eXtended Reliable Connection. Matthew J. Koop, Jaidev K. Sridhar, Dhabaleswar K. Panda. CLUSTER 2008, 203-212. Web SearchBibTeXDownload |
| 222 | Designing next generation clusters with InfiniBand and 10GE/iWARP: Opportunities and challenges. Dhabaleswar K. Panda. CLUSTER 2008, 202. Web SearchBibTeXDownload |
| 221 | Efficient one-copy MPI shared memory communication in Virtual Machines. Wei Huang, Matthew J. Koop, Dhabaleswar K. Panda. CLUSTER 2008, 107-115. Web SearchBibTeXDownload |
| 220 | Designing a High-Performance Clustered NAS: A Case Study with pNFS over RDMA on InfiniBand. Ranjit Noronha, Xiangyong Ouyang, Dhabaleswar K. Panda. HiPC 2008, 465-477. Web SearchBibTeXDownload |
| 219 | Sockets Direct Protocol for Hybrid Network Stacks: A Case Study with iWARP over 10G Ethernet. Pavan Balaji, Sitha Bhagvat, Rajeev Thakur, Dhabaleswar K. Panda. HiPC 2008, 478-490. Web SearchBibTeXDownload |
| 218 | ScELA: Scalable and Extensible Launching Architecture for Clusters. Jaidev K. Sridhar, Matthew J. Koop, Jonathan L. Perkins, Dhabaleswar K. Panda. HiPC 2008, 323-335. Web SearchBibTeXDownload |
| 217 | Performance Analysis and Evaluation of PCIe 2.0 and Quad-Data Rate InfiniBand. Matthew J. Koop, Wei Huang, Karthik Gopalakrishnan, Dhabaleswar K. Panda. Hot Interconnects 2008, 85-92. Web SearchBibTeXDownload |
| 216 | Designing an Efficient Kernel-Level and User-Level Hybrid Approach for MPI Intra-Node Communication on Multi-Core Systems. Lei Chai, Ping Lai, Hyun-Wook Jin, Dhabaleswar K. Panda. ICPP 2008, 222-229. Web SearchBibTeXDownload |
| 215 | Performance of HPC Middleware over InfiniBand WAN. Sundeep Narravula, Hari Subramoni, Ping Lai, Ranjit Noronha, Dhabaleswar K. Panda. ICPP 2008, 304-311. Web SearchBibTeXDownload |
| 214 | IMCa: A High Performance Caching Front-End for GlusterFS on InfiniBand. Ranjit Noronha, Dhabaleswar K. Panda. ICPP 2008, 462-469. Web SearchBibTeXDownload |
| 213 | Can software reliability outperform hardware reliability on high performance interconnects?: a case study with MPI over infiniband. Matthew J. Koop, Rahul Kumar, Dhabaleswar K. Panda. ICS 2008, 145-154. Web SearchBibTeXDownload |
| 212 | Scaling alltoall collective on multi-core systems. Rahul Kumar, Amith R. Mamidala, Dhabaleswar K. Panda. IPDPS 2008, 1-8. Web SearchBibTeXDownload |
| 211 | MVAPICH-Aptus: Scalable high-performance multi-transport MPI over InfiniBand. Matthew J. Koop, Terry Jones, Dhabaleswar K. Panda. IPDPS 2008, 1-12. Web SearchBibTeXDownload |
| 210 | Designing passive synchronization for MPI-2 one-sided communication to maximize overlap. Gopalakrishnan Santhanaraman, Sundeep Narravula, Dhabaleswar K. Panda. IPDPS 2008, 1-11. Web SearchBibTeXDownload |
| 209 | Lock-Free Asynchronous Rendezvous Design for MPI Point-to-Point Communication. Rahul Kumar, Amith R. Mamidala, Matthew J. Koop, Gopalakrishnan Santhanaraman, Dhabaleswar K. Panda. PVM/MPI 2008, 185-193. Web SearchBibTeXDownload |
| 2007 |
| 208 | Reducing Connection Memory Requirements of MPI for InfiniBand Clusters: A Message Coalescing Approach. Matthew J. Koop, Terry Jones, Dhabaleswar K. Panda. CCGRID 2007, 495-504. Web SearchBibTeXDownload |
| 207 | Understanding the Impact of Multi-Core Architecture in Cluster Computing: A Case Study with Intel Dual-Core System. Lei Chai, Qi Gao, Dhabaleswar K. Panda. CCGRID 2007, 471-478. Web SearchBibTeXDownload |
| 206 | High Performance Distributed Lock Management Services using Network-based Remote Atomic Operations. Sundeep Narravula, A. Marnidala, Abhinav Vishnu, Karthikeyan Vaidyanathan, Dhabaleswar K. Panda. CCGRID 2007, 583-590. Web SearchBibTeXDownload |
| 205 | Hot-Spot Avoidance With Multi-Pathing Over InfiniBand: An MPI Perspective. Abhinav Vishnu, Matthew J. Koop, Adam Moody, Amith R. Mamidala, Sundeep Narravula, Dhabaleswar K. Panda. CCGRID 2007, 479-486. Web SearchBibTeXDownload |
| 204 | Designing high-end computing systems with InfiniBand and10-Gigabit Ethernet iWARP. Dhabaleswar K. Panda, Pavan Balaji. CLUSTER 2007. Web SearchBibTeXDownload |
| 203 | High performance virtual machine migration with RDMA over modern interconnects. Wei Huang, Qi Gao, Jiuxing Liu, Dhabaleswar K. Panda. CLUSTER 2007, 11-20. Web SearchBibTeXDownload |
| 202 | Zero-copy protocol for MPI using infiniband unreliable datagram. Matthew J. Koop, Sayantan Sur, Dhabaleswar K. Panda. CLUSTER 2007, 179-186. Web SearchBibTeXDownload |
| 201 | Lightweight kernel-level primitives for high-performance MPI intra-node communication over multi-core systems. Hyun-Wook Jin, Sayantan Sur, Lei Chai, Dhabaleswar K. Panda. CLUSTER 2007, 446-451. Web SearchBibTeXDownload |
| 200 | Efficient asynchronous memory copy operations on multi-core systems and I/OAT. Karthikeyan Vaidyanathan, Lei Chai, Wei Huang, Dhabaleswar K. Panda. CLUSTER 2007, 159-168. Web SearchBibTeXDownload |
| 199 | Advanced Flow-control Mechanisms for the Sockets Direct Protocol over InfiniBand. Pavan Balaji, Sitha Bhagvat, Dhabaleswar K. Panda, Rajeev Thakur, William Gropp. ICPP 2007, 73. Web SearchBibTeXDownload |
| 198 | High Performance MPI over iWARP: Early Experiences. Sundeep Narravula, Amith R. Mamidala, Abhinav Vishnu, Gopalakrishnan Santhanaraman, Dhabaleswar K. Panda. ICPP 2007, 46. Web SearchBibTeXDownload |
| 197 | Group-based Coordinated Checkpointing for MPI: A Case Study on InfiniBand. Qi Gao, Wei Huang, Matthew J. Koop, Dhabaleswar K. Panda. ICPP 2007, 47. Web SearchBibTeXDownload |
| 196 | Designing NFS with RDMA for Security, Performance and Scalability. Ranjit Noronha, Lei Chai, Thomas Talpey, Dhabaleswar K. Panda. ICPP 2007, 49. Web SearchBibTeXDownload |
| 195 | High performance MPI design using unreliable datagram for ultra-scale InfiniBand clusters. Matthew J. Koop, Sayantan Sur, Qi Gao, Dhabaleswar K. Panda. ICS 2007, 180-189. Web SearchBibTeXDownload |
| 194 | Automatic Path Migration over InfiniBand: Early Experiences. Abhinav Vishnu, Amith R. Mamidala, Sundeep Narravula, Dhabaleswar K. Panda. IPDPS 2007, 1-8. Web SearchBibTeXDownload |
| 193 | Designing Efficient Asynchronous Memory Operations Using Hardware Copy Engine: A Case Study with I/OAT. Karthikeyan Vaidyanathan, Wei Huang, Lei Chai, Dhabaleswar K. Panda. IPDPS 2007, 1-8. Web SearchBibTeXDownload |
| 192 | Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers. Karthikeyan Vaidyanathan, Sundeep Narravula, Pavan Balaji, Dhabaleswar K. Panda. IPDPS 2007, 1-6. Web SearchBibTeXDownload |
| 191 | High Performance MPI on IBM 12x InfiniBand Architecture. Abhinav Vishnu, Brad Benton, Dhabaleswar K. Panda. IPDPS 2007, 1-8. Web SearchBibTeXDownload |
| 190 | Improving Scalability of OpenMP Applications on Multi-core Systems Using Large Page Support. Ranjit Noronha, Dhabaleswar K. Panda. IPDPS 2007, 1-8. Web SearchBibTeXDownload |
| 189 | Benefits of I/O Acceleration Technology (I/OAT) in Clusters. Karthikeyan Vaidyanathan, Dhabaleswar K. Panda. ISPASS 2007, 220-229. Web SearchBibTeXDownload |
| 188 | pNFS/PVFS2 over InfiniBand: early experiences. Lei Chai, Xiangyong Ouyang, Ranjit Noronha, Dhabaleswar K. Panda. PDSW 2007, 5-11. Web SearchBibTeXDownload |
| 187 | On using connection-oriented vs. connection-less transport for performance and scalability of collective and one-sided operations: trade-offs and impact. Amith R. Mamidala, Sundeep Narravula, Abhinav Vishnu, Gopalakrishnan Santhanaraman, Dhabaleswar K. Panda. PPOPP 2007, 46-54. Web SearchBibTeXDownload |
| 186 | MPI-2 One-Sided Usage and Implementation for Read Modify Write Operations: A Case Study with HPCC. Gopalakrishnan Santhanaraman, Sundeep Narravula, Amith R. Mamidala, Dhabaleswar K. Panda. PVM/MPI 2007, 251-259. Web SearchBibTeXDownload |
| 185 | Virtual machine aware communication libraries for high performance computing. Wei Huang, Matthew J. Koop, Qi Gao, Dhabaleswar K. Panda. SC 2007, 9. Web SearchBibTeXDownload |
| 184 | DMTracker: finding bugs in large-scale parallel programs by detecting anomaly in data movements. Qi Gao, Feng Qin, Dhabaleswar K. Panda. SC 2007, 15. Web SearchBibTeXDownload |
| 183 | Analyzing the impact of supporting out-of-order communication on in-order performance with iWARP. Pavan Balaji, Wu-chun Feng, Sitha Bhagvat, Dhabaleswar K. Panda, Rajeev Thakur, William Gropp. SC 2007, 35. Web SearchBibTeXDownload |
| 182 | Nomad: migrating OS-bypass networks in virtual machines. Wei Huang, Jiuxing Liu, Matthew J. Koop, Bülent Abali, Dhabaleswar K. Panda. VEE 2007, 158-168. Web SearchBibTeXDownload |
| 2006 |
| 181 | MPI over uDAPL: Can High Performance and Portability Exist Across Architectures?. Lei Chai, Ranjit Noronha, Dhabaleswar K. Panda. CCGRID 2006, 19-26. Web SearchBibTeXDownload |
| 180 | Designing Efficient Cooperative Caching Schemes for Multi-Tier Data-Centers over RDMA-enabled Networks. Sundeep Narravula, Hyun-Wook Jin, Karthikeyan Vaidyanathan, Dhabaleswar K. Panda. CCGRID 2006, 401-408. Web SearchBibTeXDownload |
| 179 | Design of High Performance MVAPICH2: MPI2 over InfiniBand. Wei Huang, Gopalakrishnan Santhanaraman, Hyun-Wook Jin, Qi Gao, Dhabaleswar K. Panda. CCGRID 2006, 43-48. Web SearchBibTeXDownload |
| 178 | Exploiting RDMA operations for Providing Efficient Fine-Grained Resource Monitoring in Cluster-based Servers. Karthikeyan Vaidyanathan, Hyun-Wook Jin, Dhabaleswar K. Panda. CLUSTER 2006. Web SearchBibTeXDownload |
| 177 | Designing High Performance and Scalable MPI Intra-node Communication Support for Clusters. Lei Chai, Albert Hartono, Dhabaleswar K. Panda. CLUSTER 2006. Web SearchBibTeXDownload |
| 176 | DDSS: A Low-Overhead Distributed Data Sharing Substrate for Cluster-Based Data-Centers over Modern Interconnects. Karthikeyan Vaidyanathan, Sundeep Narravula, Dhabaleswar K. Panda. HiPC 2006, 472-484. Web SearchBibTeXDownload |
| 175 | High Performance Block I/O for Global File System (GFS) with InfiniBand RDMA. Shuang Liang, Weikuan Yu, Dhabaleswar K. Panda. ICPP 2006, 391-398. Web SearchBibTeXDownload |
| 174 | Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand. Qi Gao, Weikuan Yu, Wei Huang, Dhabaleswar K. Panda. ICPP 2006, 471-478. Web SearchBibTeXDownload |
| 173 | A case for high performance computing with virtual machines. Wei Huang, Jiuxing Liu, Bülent Abali, Dhabaleswar K. Panda. ICS 2006, 125-134. Web SearchBibTeXDownload |
| 172 | Bridging the Ethernet-Ethernot Performance Gap. Pavan Balaji, Wu-chun Feng, Dhabaleswar K. Panda. IEEE Micro (26): 24-40 (2006). Web SearchBibTeXDownload |
| 171 | High Performance Remote Memory Access Communication: The Armci Approach. Jarek Nieplocha, Vinod Tipparaju, Manojkumar Krishnan, Dhabaleswar K. Panda. IJHPCA (20): 233-253 (2006). Web SearchBibTeXDownload |
| 170 | NIC-based reduction algorithms for large-scale clusters. Fabrizio Petrini, Adam Moody, Juan Fernández Peinador, Eitan Frachtenberg, Dhabaleswar K. Panda. IJHPCN (4): 122-136 (2006). Web SearchBibTeXDownload |
| 169 | Efficient SMP-aware MPI-level broadcast over InfiniBand's hardware multicast. Amith R. Mamidala, Lei Chai, Hyun-Wook Jin, Dhabaleswar K. Panda. IPDPS 2006. Web SearchBibTeXDownload |
| 168 | Adaptive connection management for scalable MPI over InfiniBand. Weikuan Yu, Qi Gao, Dhabaleswar K. Panda. IPDPS 2006. Web SearchBibTeXDownload |
| 167 | Designing next generation data-centers with advanced communication protocols and systems services. Pavan Balaji, Karthikeyan Vaidyanathan, Sundeep Narravula, Hyun-Wook Jin, Dhabaleswar K. Panda. IPDPS 2006. Web SearchBibTeXDownload |
| 166 | Shared receive queue based scalable MPI design for InfiniBand clusters. Sayantan Sur, Lei Chai, Hyun-Wook Jin, Dhabaleswar K. Panda. IPDPS 2006. Web SearchBibTeXDownload |
| 165 | Asynchronous zero-copy communication for synchronous sockets in the sockets direct protocol (SDP) over InfiniBand. Pavan Balaji, Sitha Bhagvat, Hyun-Wook Jin, Dhabaleswar K. Panda. IPDPS 2006. Web SearchBibTeXDownload |
| 164 | Benefits of high speed interconnects to cluster file systems: a case study with Lustre. Weikuan Yu, Ranjit Noronha, Shuang Liang, Dhabaleswar K. Panda. IPDPS 2006. Web SearchBibTeXDownload |
| 163 | RDMA read based rendezvous protocol for MPI over InfiniBand: design alternatives and benefits. Sayantan Sur, Hyun-Wook Jin, Lei Chai, Dhabaleswar K. Panda. PPOPP 2006, 32-39. Web SearchBibTeXDownload |
| 162 | Efficient Shared Memory and RDMA Based Design for MPI_Allgather over InfiniBand. Amith R. Mamidala, Abhinav Vishnu, Dhabaleswar K. Panda. PVM/MPI 2006, 66-75. Web SearchBibTeXDownload |
| 161 | Scalable systems software - A software based approach for providing network fault tolerance in clusters with uDAPL interface: MPI level design and performance evaluation. Abhinav Vishnu, Prachi Gupta, Amith R. Mamidala, Dhabaleswar K. Panda. SC 2006, 85. Web SearchBibTeXDownload |
| 160 | MPI and communication - High-performance and scalable MPI over InfiniBand with reduced memory usage: an in-depth performance analysis. Sayantan Sur, Matthew J. Koop, Dhabaleswar K. Panda. SC 2006, 105. Web SearchBibTeXDownload |
| 159 | High Performance VMM-Bypass I/O in Virtual Machines. Jiuxing Liu, Wei Huang, Bülent Abali, Dhabaleswar K. Panda. USENIX Annual Technical Conference, General Track 2006, 29-42. Web SearchBibTeXDownload |
| 2005 |
| 158 | Architecture for caching responses with multiple dynamic dependencies in multi-tier data-centers over InfiniBand. Sundeep Narravula, Pavan Balaji, Karthikeyan Vaidyanathan, Hyun-Wook Jin, Dhabaleswar K. Panda. CCGRID 2005, 374-381. Web SearchBibTeXDownload |
| 157 | Can high performance software DSM systems designed with InfiniBand features benefit from PCI-Express?. Ranjit Noronha, Dhabaleswar K. Panda. CCGRID 2005, 945-952. Web SearchBibTeXDownload |
| 156 | Supporting iWARP Compatibility and Features for Regular Network Adapters. Pavan Balaji, Hyun-Wook Jin, Karthikeyan Vaidyanathan, Dhabaleswar K. Panda. CLUSTER 2005, 1-10. Web SearchBibTeXDownload |
| 155 | Swapping to Remote Memory over InfiniBand: An Approach using a High Performance Network Block Device. Shuang Liang, Ranjit Noronha, Dhabaleswar K. Panda. CLUSTER 2005, 1-10. Web SearchBibTeXDownload |
| 154 | Head-to-TOE Evaluation of High-Performance Sockets over Protocol Offload Engines. Pavan Balaji, Wu-chun Feng, Qi Gao, Ranjit Noronha, Weikuan Yu, Dhabaleswar K. Panda. CLUSTER 2005, 1-10. Web SearchBibTeXDownload |
| 153 | Performance Evaluation of MM5 on Clusters with Modern Interconnects: Scalability and Impact. Ranjit Noronha, Dhabaleswar K. Panda. Euro-Par 2005, 134-145. Web SearchBibTeXDownload |
| 152 | Supporting MPI-2 One Sided Communication on Multi-rail InfiniBand Clusters: Design Challenges and Performance Benefits. Abhinav Vishnu, Gopalakrishnan Santhanaraman, Wei Huang, Hyun-Wook Jin, Dhabaleswar K. Panda. HiPC 2005, 137-147. Web SearchBibTeXDownload |
| 151 | High Performance RDMA Based All-to-All Broadcast for InfiniBand Clusters. Sayantan Sur, Uday Bondhugula, Amith R. Mamidala, Hyun-Wook Jin, Dhabaleswar K. Panda. HiPC 2005, 148-157. Web SearchBibTeXDownload |
| 150 | Performance Characterization of a 10-Gigabit Ethernet TOE. Wu-chun Feng, Pavan Balaji, C. Baron, Laxmi N. Bhuyan, Dhabaleswar K. Panda. Hot Interconnects 2005, 58-63. Web SearchBibTeXDownload |
| 149 | Can Memory-Less Network Adapters Benefit Next-Generation InfiniBand Systems?. Sayantan Sur, Abhinav Vishnu, Hyun-Wook Jin, Wei Huang, Dhabaleswar K. Panda. Hot Interconnects 2005, 45-50. Web SearchBibTeXDownload |
| 148 | LiMIC: Support for High-Performance MPI Intra-node Communication on Linux Cluster. Hyun-Wook Jin, Sayantan Sur, Lei Chai, Dhabaleswar K. Panda. ICPP 2005, 184-191. Web SearchBibTeXDownload |
| 147 | High performance support of parallel virtual file system (PVFS2) over Quadrics. Weikuan Yu, Shuang Liang, Dhabaleswar K. Panda. ICS 2005, 323-331. Web SearchBibTeXDownload |
| 146 | Evaluating InfiniBand Performance with PCI Express. Jiuxing Liu, Amith R. Mamidala, Abhinav Vishnu, Dhabaleswar K. Panda. IEEE Micro (25): 20-29 (2005). Web SearchBibTeXDownload |
| 145 | Designing Zero-Copy Message Passing Interface Derived Datatype Communication Over Infiniband: Alternative Approaches and Performance Evaluation. Gopalakrishnan Santhanaraman, Jiesheng Wu, Wei Huang, Dhabaleswar K. Panda. IJHPCA (19): 129-142 (2005). Web SearchBibTeXDownload |
| 144 | High Performance Broadcast Support in La-Mpi Over Quadrics. Weikuan Yu, Sayantan Sur, Dhabaleswar K. Panda, Rob T. Aulwes, Richard L. Graham. IJHPCA (19): 453-463 (2005). Web SearchBibTeXDownload |
| 143 | Selective preemption strategies for parallel job scheduling. Rajkumar Kettimuthu, Vijay Subramani, Srividya Srinivasan, Thiagaraja Gopalsamy, Dhabaleswar K. Panda, P. Sadayappan. IJHPCN (3): 122-152 (2005). Web SearchBibTeXDownload |
| 142 | Scheduling of MPI-2 One Sided Operations over InfiniBand. Wei Huang, Gopalakrishnan Santhanaraman, Hyun-Wook Jin, Dhabaleswar K. Panda. IPDPS 2005. Web SearchBibTeXDownload |
| 141 | Design and Implementation of Open MPI over Quadrics/Elan4. Weikuan Yu, Timothy S. Woodall, Richard L. Graham, Dhabaleswar K. Panda. IPDPS 2005. Web SearchBibTeXDownload |
| 140 | Analysis of Design Considerations for Optimizing Multi-Channel MPI over InfiniBand. Lei Chai, Sayantan Sur, Hyun-Wook Jin, Dhabaleswar K. Panda. IPDPS 2005. Web SearchBibTeXDownload |
| 139 | Performance Modeling of Subnet Management on Fat Tree InfiniBand Networks using OpenSM. Abhinav Vishnu, Amith R. Mamidala, Hyun-Wook Jin, Dhabaleswar K. Panda. IPDPS 2005. Web SearchBibTeXDownload |
| 138 | On the provision of prioritization and soft qos in dynamically reconfigurable shared data-centers over infiniband. Pavan Balaji, Sundeep Narravula, Karthikeyan Vaidyanathan, Hyun-Wook Jin, Dhabaleswar K. Panda. ISPASS 2005, 280-289. Web SearchBibTeXDownload |
| 137 | Exploiting NIC architectural support for enhancing IP-based protocols on high-performance networks. Hyun-Wook Jin, Pavan Balaji, Chuck Yoo, Jin-Young Choi, Dhabaleswar K. Panda. J. Parallel Distrib. Comput. (65): 1348-1365 (2005). Web SearchBibTeXDownload |
| 136 | Designing a Portable MPI-2 over Modern Interconnects Using uDAPL Interface. Lei Chai, Ranjit Noronha, Prachi Gupta, G. Brown, Dhabaleswar K. Panda. PVM/MPI 2005, 200-208. Web SearchBibTeXDownload |
| 135 | Efficient Hardware Multicast Group Management for Multiple MPI Communicators over InfiniBand. Amith R. Mamidala, Hyun-Wook Jin, Dhabaleswar K. Panda. PVM/MPI 2005, 388-398. Web SearchBibTeXDownload |
| 134 | Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand. Wei Huang, Gopalakrishnan Santhanaraman, Hyun-Wook Jin, Dhabaleswar K. Panda. PVM/MPI 2005, 191-199. Web SearchBibTeXDownload |
| 2004 |
| 133 | High performance MPI-2 one-sided communication over InfiniBand. Weihang Jiang, Jiuxing Liu, Hyun-Wook Jin, Dhabaleswar K. Panda, William Gropp, Rajeev Thakur. CCGRID 2004, 531-538. Web SearchBibTeXDownload |
| 132 | Unifier: unifying cache management and communication buffer management for PVFS over InfiniBand. Jiesheng Wu, Pete Wyckoff, Dhabaleswar K. Panda, Robert B. Ross. CCGRID 2004, 523-530. Web SearchBibTeXDownload |
| 131 | Designing high performance DSM systems using InfiniBand features. Ranjit Noronha, Dhabaleswar K. Panda. CCGRID 2004, 467-474. Web SearchBibTeXDownload |
| 130 | NIC-based offload of dynamic user-defined modules for Myrinet clusters. Adam Wagner, Hyun-Wook Jin, Dhabaleswar K. Panda, Rolf Riesen. CLUSTER 2004, 205-214. Web SearchBibTeXDownload |
| 129 | Efficient Barrier and Allreduce on Infiniband clusters using multicast and adaptive algorithms. Amith R. Mamidala, Jiuxing Liu, Dhabaleswar K. Panda. CLUSTER 2004, 135-144. Web SearchBibTeXDownload |
| 128 | State of InfiniBand in designing HPC clusters, storage/file systems, and datacenters [datacenters read as data centers]. Dhabaleswar K. Panda. CLUSTER 2004, 3. Web SearchBibTeXDownload |
| 127 | Scalable, high-performance NIC-based all-to-all broadcast over Myrinet/GM. Weikuan Yu, Dhabaleswar K. Panda, Darius Buntinas. CLUSTER 2004, 125-134. Web SearchBibTeXDownload |
| 126 | Towards provision of quality of service guarantees in job scheduling. Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda. CLUSTER 2004, 245-254. Web SearchBibTeXDownload |
| 125 | Fast and Scalable Startup of MPI Programs in InfiniBand Clusters. Weikuan Yu, Jiesheng Wu, Dhabaleswar K. Panda. HiPC 2004, 440-449. Web SearchBibTeXDownload |
| 124 | Efficient and Scalable All-to-All Personalized Exchange for InfiniBand-Based Clusters. Sayantan Sur, Hyun-Wook Jin, Dhabaleswar K. Panda. ICPP 2004, 275-282. Web SearchBibTeXDownload |
| 123 | Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study. Qingda Lu, Jiesheng Wu, Dhabaleswar K. Panda, P. Sadayappan. ICPP Workshops 2004, 538-545. Web SearchBibTeXDownload |
| 122 | Microbenchmark Performance Comparison of High-Speed Cluster Interconnects. Jiuxing Liu, B. Chandrasekaran, Weikuan Yu, Jiesheng Wu, Darius Buntinas, Sushmitha P. Kini, Dhabaleswar K. Panda, Pete Wyckoff. IEEE Micro (24): 42-51 (2004). Web SearchBibTeXDownload |
| 121 | Optimisation and performance evaluation of mechanisms for latency tolerance in remote memory access communication on clusters. Jarek Nieplocha, Vinod Tipparaju, Manojkumar Krishnan, Gopalakrishnan Santhanaraman, Dhabaleswar K. Panda. IJHPCN (2): 198-209 (2004). Web SearchBibTeXDownload |
| 120 | Application-bypass reduction for large-scale clusters. Adam Wagner, Darius Buntinas, Ron Brightwell, Dhabaleswar K. Panda. IJHPCN (2): 99-109 (2004). Web SearchBibTeXDownload |
| 119 | High Performance RDMA-Based MPI Implementation over InfiniBand. Jiuxing Liu, Jiesheng Wu, Dhabaleswar K. Panda. International Journal of Parallel Programming (32): 167-198 (2004). Web SearchBibTeXDownload |
| 118 | Implementing Efficient and Scalable Flow Control Schemes in MPI over InfiniBand. Jiuxing Liu, Dhabaleswar K. Panda. IPDPS 2004. Web SearchBibTeXDownload |
| 117 | Design and Implementation of MPICH2 over InfiniBand with RDMA Support. Jiuxing Liu, Weihang Jiang, Pete Wyckoff, Dhabaleswar K. Panda, David Ashton, Darius Buntinas, William Gropp, Brian R. Toonen. IPDPS 2004. Web SearchBibTeXDownload |
| 116 | Fast and Scalable MPI-Level Broadcast Using InfiniBand?s Hardware Multicast Support. Jiuxing Liu, Amith R. Mamidala, Dhabaleswar K. Panda. IPDPS 2004. Web SearchBibTeXDownload |
| 115 | Efficient and Scalable Barrier over Quadrics and Myrinet with a New NIC-Based Collective Message Passing Protocol. Weikuan Yu, Darius Buntinas, Richard L. Graham, Dhabaleswar K. Panda. IPDPS (cs.DC/0402027) (2004). Web SearchBibTeXDownload |
| 114 | Host-Assisted Zero-Copy Remote Memory Access Communication on InfiniBand. Vinod Tipparaju, Gopalakrishnan Santhanaraman, Jarek Nieplocha, Dhabaleswar K. Panda. IPDPS 2004. Web SearchBibTeXDownload |
| 113 | High Performance Implementation of MPI Derived Datatype Communication over InfiniBand. Jiesheng Wu, Pete Wyckoff, Dhabaleswar K. Panda. IPDPS 2004. Web SearchBibTeXDownload |
| 112 | Sockets Direct Protocol over InfiniBand in clusters: is it beneficial?. Pavan Balaji, Sundeep Narravula, Karthikeyan Vaidyanathan, Savitha Krishnamoorthy, Jiesheng Wu, Dhabaleswar K. Panda. ISPASS 2004, 28-35. Web SearchBibTeXDownload |
| 111 | Zero-Copy MPI Derived Datatype Communication over InfiniBand. Gopalakrishnan Santhanaraman, Dhabaleswar Wu, Dhabaleswar K. Panda. PVM/MPI 2004, 47-56. Web SearchBibTeXDownload |
| 110 | Efficient Implementation of MPI-2 Passive One-Sided Communication on InfiniBand Clusters. Weihang Jiang, Jiuxing Liu, Hyun-Wook Jin, Dhabaleswar K. Panda, Darius Buntinas, Rajeev Thakur, William D. Gropp. PVM/MPI 2004, 68-76. Web SearchBibTeXDownload |
| 109 | Building Multirail InfiniBand Clusters: MPI-Level Design and Performance Evaluation. Jiuxing Liu, Abhinav Vishnu, Dhabaleswar K. Panda. SC 2004, 33. Web SearchBibTeXDownload |
| 2003 |
| 108 | Application-Bypas Broadcast in MPICH over GM. Darius Buntinas, Dhabaleswar K. Panda, Ron Brightwell. CCGRID 2003, 2-9. Web SearchBibTeXDownload |
| 107 | Optimizing Mechanisms for Latency Tolerance in Remote Memory Access Communication on Clusters. Jarek Nieplocha, Vinod Tipparaju, Manojkumar Krishnan, Gopalakrishnan Santhanaraman, Dhabaleswar K. Panda. CLUSTER 2003, 138-147. Web SearchBibTeXDownload |
| 106 | Designing Next Generation Clusters with Infiniband: Opportunities and Challenges. Dhabaleswar K. Panda. CLUSTER 2003. Web SearchBibTeXDownload |
| 105 | Supporting Efficient Noncontiguous Access in PVFS over InfiniBand. Jiesheng Wu, Pete Wyckoff, Dhabaleswar K. Panda. CLUSTER 2003, 344. Web SearchBibTeXDownload |
| 104 | Application-Bypass Reduction for Large-Scale Clusters. Adam Wagner, Darius Buntinas, Dhabaleswar K. Panda, Ron Brightwell. CLUSTER 2003, 404-411. Web SearchBibTeXDownload |
| 103 | MIBA: A Micro-Benchmark Suite for Evaluating InfiniBand Architecture Implementations. B. Chandrasekaran, Pete Wyckoff, Dhabaleswar K. Panda. Computer Performance Evaluation / TOOLS 2003, 29-46. Web SearchBibTeXDownload |
| 102 | Design and Implementation of MPICH2 over InfiniBand with RDMA Support. Jiuxing Liu, Weihang Jiang, Pete Wyckoff, Dhabaleswar K. Panda, David Ashton, Darius Buntinas, William Gropp, Brian R. Toonen. CoRR (cs.AR/0310059) (2003). Web SearchBibTeXDownload |
| 101 | Exploiting Non-blocking Remote Memory Access Communication in Scientific Benchmarks. Vinod Tipparaju, Manojkumar Krishnan, Jarek Nieplocha, Gopalakrishnan Santhanaraman, Dhabaleswar K. Panda. HiPC 2003, 248-258. Web SearchBibTeXDownload |
| 100 | QoS-Aware Middleware for Cluster-Based Servers to support Interactive and Resource-Adaptive Applications. S. Senapathi, B. Chandrasekaran, D. Stredney, H. Shen, Dhabaleswar K. Panda. HPDC 2003, 205-215. Web SearchBibTeXDownload |
| 99 | Impact of High Performance Sockets on Data Intensive Applications. Pavan Balaji, Jiesheng Wu, Tahsin M. Kurç, Ümit V. Çatalyürek, Dhabaleswar K. Panda, Joel H. Saltz. HPDC 2003, 24-33. Web SearchBibTeXDownload |
| 98 | PVFS over InfiniBand: Design and Performance Evaluation. Jiesheng Wu, Pete Wyckoff, Dhabaleswar K. Panda. ICPP 2003, 125-132. Web SearchBibTeXDownload |
| 97 | High Performance and Reliable NIC-Based Multicast over Myrinet/GM-2. Weikuan Yu, Darius Buntinas, Dhabaleswar K. Panda. ICPP 2003, 197-204. Web SearchBibTeXDownload |
| 96 | High performance RDMA-based MPI implementation over InfiniBand. Jiuxing Liu, Jiesheng Wu, Sushmitha P. Kini, Pete Wyckoff, Dhabaleswar K. Panda. ICS 2003, 295-304. Web SearchBibTeXDownload |
| 95 | Efficient Collective Operations Using Remote Memory Operations on VIA-Based Clusters. Rinku Gupta, Pavan Balaji, Dhabaleswar K. Panda, Jarek Nieplocha. IPDPS 2003, 46. Web SearchBibTeXDownload |
| 94 | Implementing TreadMarks over GM on Myrinet: Challenges, Design Experience, and Performance Evaluation. Ranjit Noronha, Dhabaleswar K. Panda. IPDPS 2003, 200. Web SearchBibTeXDownload |
| 93 | Optimizing Synchronization Operations for Remote Memory Communication Systems. Darius Buntinas, Amina Saify, Dhabaleswar K. Panda, Jarek Nieplocha. IPDPS 2003, 199. Web SearchBibTeXDownload |
| 92 | Fast Collective Operations Using Shared and Remote Memory Access Protocols on Clusters. Vinod Tipparaju, Jarek Nieplocha, Dhabaleswar K. Panda. IPDPS 2003, 84. Web SearchBibTeXDownload |
| 91 | QoPS: A QoS Based Scheme for Parallel Job Scheduling. Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda. JSSPP 2003, 252-268. Web SearchBibTeXDownload |
| 90 | Towards NIC-based intrusion detection. Matthew Eric Otey, Srinivasan Parthasarathy, Amol Ghoting, G. Li, Sundeep Narravula, Dhabaleswar K. Panda. KDD 2003, 723-728. Cited by 23Web SearchBibTeXDownload |
| 89 | Fast and Scalable Barrier Using RDMA and Multicast Mechanisms for InfiniBand-Based Clusters. Sushmitha P. Kini, Jiuxing Liu, Jiesheng Wu, Pete Wyckoff, Dhabaleswar K. Panda. PVM/MPI 2003, 369-378. Web SearchBibTeXDownload |
| 88 | Scalable NIC-based Reduction on Large-scale Clusters. Adam Moody, Juan Fernández, Fabrizio Petrini, Dhabaleswar K. Panda. SC 2003, 59. Web SearchBibTeXDownload |
| 87 | Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics. Jiuxing Liu, B. Chandrasekaran, Jiesheng Wu, Weihang Jiang, Sushmitha P. Kini, Weikuan Yu, Darius Buntinas, Pete Wyckoff, Dhabaleswar K. Panda. SC 2003, 58. Web SearchBibTeXDownload |
| 2002 |
| 86 | Efficient Barrier Using Remote Memory Operations on VIA-Based Clusters. Rinku Gupta, Vinod Tipparaju, Jarek Nieplocha, Dhabaleswar K. Panda. CLUSTER 2002, 83. Web SearchBibTeXDownload |
| 85 | High Performance User Level Sockets over Gigabit Ethernet. Pavan Balaji, Piyush Shivam, Pete Wyckoff, Dhabaleswar K. Panda. CLUSTER 2002, 179-186. Web SearchBibTeXDownload |
| 84 | Impact of On-Demand Connection Management in MPI over VIA. Jiesheng Wu, Jiuxing Liu, Pete Wyckoff, Dhabaleswar K. Panda. CLUSTER 2002, 152-159. Web SearchBibTeXDownload |
| 83 | Tutorial 2: InfiniBand Architecture and Where it is Headed. Dhabaleswar K. Panda. Hot Interconnects 2002, 157-158. Web SearchBibTeXDownload |
| 82 | A Reliable Multicast Algorithm for Mobile Ad Hoc Networks. Thiagaraja Gopalsamy, Mukesh Singhal, Dhabaleswar K. Panda, P. Sadayappan. ICDCS 2002, 563-570. Web SearchBibTeXDownload |
| 81 | HIPIQS: A High-Performance Switch Architecture Using Input Queuing. Rajeev Sivaram, Craig B. Stunkel, Dhabaleswar K. Panda. IEEE Trans. Parallel Distrib. Syst. (13): 275-289 (2002). Web SearchBibTeXDownload |
| 80 | Workshop Introduction. Mohamed Ould-Khaoua, Fikret Erçal, Andreas Uhl, Peter Graham, Wah Chiu. IPDPS 2002. Web SearchBibTeXDownload |
| 79 | Protocols and Strategies for Optimizing Performance of Remote Memory Operations on Clusters. Jarek Nieplocha, Vinod Tipparaju, Amina Saify, Dhabaleswar K. Panda. IPDPS 2002. Web SearchBibTeXDownload |
| 78 | Can User-Level Protocols Take Advantage of Multi-CPU NICs?. Piyush Shivam, Pete Wyckoff, Dhabaleswar K. Panda. IPDPS 2002. Web SearchBibTeXDownload |
| 77 | MPI/IO on DAFS over VIA: Implementation and Performance Evaluation. Jiesheng Wu, Dhabaleswar K. Panda. IPDPS 2002. Web SearchBibTeXDownload |
| 76 | Active Network Interface: Opportunities and Challenges. Dhabaleswar K. Panda. LCN 2002, 605-608. Web SearchBibTeXDownload |
| 75 | Feature estimation for efficient streaming. Naveen Kumar Polapally, Raghu Machiraju, Dhabaleswar K. Panda. VolVis 2002, 107-114. Web SearchBibTeXDownload |
| 2001 |
| 74 | Implementing TreadMarksover VIA on Myrinet and Gigabit Ethernet: Challenges, Design Experience, and Performance Evaluation. Mohammad Banikazemi, Jiuxing Liu, Dhabaleswar K. Panda, P. Sadayappan. ICPP 2001, 167-174. Web SearchBibTeXDownload |
| 73 | NIC-Based Rate Control for Proportional Bandwidth Allocation in Myrinet Clusters. Abhishek Gulati, Dhabaleswar K. Panda, P. Sadayappan, Pete Wyckoff. ICPP 2001, 305-312. Web SearchBibTeXDownload |
| 72 | Hybrid Algorithms for Complete Exchange in 2D Meshes. N. S. Sundar, D. N. Jayasimha, Dhabaleswar K. Panda, P. Sadayappan. IEEE Trans. Parallel Distrib. Syst. (12): 1201-1218 (2001). Web SearchBibTeXDownload |
| 71 | MPI-LAPI: An Efficient Implementation of MPI for IBM RS/6000 SP Systems. Mohammad Banikazemi, Rama Govindaraju, Robert Blackmore, Dhabaleswar K. Panda. IEEE Trans. Parallel Distrib. Syst. (12): 1081-1093 (2001). Web SearchBibTeXDownload |
| 70 | Efficient Multicast on Irregular Switch-Based Cut-Through Networks with Up-Down Routing. Ram Kesavan, Dhabaleswar K. Panda. IEEE Trans. Parallel Distrib. Syst. (12): 808-828 (2001). Web SearchBibTeXDownload |
| 69 | Architectural Support for Efficient Multicasting in Irregular Networks. Rajeev Sivaram, Ram Kesavan, Dhabaleswar K. Panda, Craig B. Stunkel. IEEE Trans. Parallel Distrib. Syst. (12): 489-513 (2001). Web SearchBibTeXDownload |
| 68 | Efficient Multicast Algorithms for Heterogeneous Switch-based Irregular Networks of Workstations. Amit Singhal, Mohammad Banikazemi, P. Sadayappan, Dhabaleswar K. Panda. IPDPS 2001, 71. Web SearchBibTeX |
| 67 | Fast NIC-Based Barrier over Myrinet/GM. Darius Buntinas, Dhabaleswar K. Panda, P. Sadayappan. IPDPS 2001, 52. Web SearchBibTeX |
| 66 | Performance Benefits of NIC-Based Barrier on Myrinet/GM. Darius Buntinas, Dhabaleswar K. Panda, P. Sadayappan. IPDPS 2001, 166. Web SearchBibTeX |
| 65 | VIBe: A Micro-benchmark Suite for Evaluating Virtual Interface Architecture (VIA) Implementations. Mohammad Banikazemi, Jiuxing Liu, S. Kutlug, P. Sadayappan, H. Shah, Dhabaleswar K. Panda. IPDPS 2001, 24. Web SearchBibTeX |
| 64 | Design Alternatives for Virtual Interface Architecture and an Implementation on IBM Netfinity NT Cluster. Mohammad Banikazemi, Bülent Abali, Lorraine Herger, Dhabaleswar K. Panda. J. Parallel Distrib. Comput. (61): 1512-1545 (2001). Web SearchBibTeXDownload |
| 63 | Adaptive Routing on the New Switch Chip for IBM SP Systems. Bülent Abali, Craig B. Stunkel, Jay Herring, Mohammad Banikazemi, Dhabaleswar K. Panda, Cevdet Aykanat, Yucel Aydogan. J. Parallel Distrib. Comput. (61): 1148-1179 (2001). Web SearchBibTeXDownload |
| 62 | EMP: zero-copy OS-bypass NIC-driven gigabit ethernet message passing. Piyush Shivam, Pete Wyckoff, Dhabaleswar K. Panda. SC 2001, 57. Web SearchBibTeXDownload |
| 2000 |
| 61 | Fast Collective Communication Algorithms for Reflective Memory Network Clusters. Vijay Moorthy, Dhabaleswar K. Panda, P. Sadayappan. CANPC 2000, 100-114. Web SearchBibTeXDownload |
| 60 | Broadcast/Multicast over Myrinet Using NIC-Assisted Multidestination Messages. Darius Buntinas, Dhabaleswar K. Panda, José Duato, P. Sadayappan. CANPC 2000, 115-129. Web SearchBibTeXDownload |
| 59 | Comparison and Evaluation of Design Choices for Implementing the Virtual Interface Architecture (VIA). Mohammad Banikazemi, Bülent Abali, Dhabaleswar K. Panda. CANPC 2000, 145-161. Web SearchBibTeXDownload |
| 58 | Can Scatter Communication Take Advantage of Multidestination Message Passing?. Mohammad Banikazemi, Dhabaleswar K. Panda. HiPC 2000, 204-211. Web SearchBibTeXDownload |
| 57 | Characterization and enhancement of Static Mapping Heuristics for Heterogeneous Systems. Praveen Holenarsipur, Vladimir Yarmolenko, José Duato, Dhabaleswar K. Panda, P. Sadayappan. HiPC 2000, 37-48. Web SearchBibTeXDownload |
| 56 | Characterization and Enhancement of Dynamic Mapping Heuristics for Heterogeneous Systems. Vladimir Yarmolenko, José Duato, Dhabaleswar K. Panda, P. Sadayappan. ICPP Workshops 2000, 437. Web SearchBibTeXDownload |
| 55 | Balancing Web Server Load for Adaptable Video Distribution. Arindam Paul, Wu-chi Feng, Dhabaleswar K. Panda, P. Sadayappan. ICPP Workshops 2000, 469. Web SearchBibTeXDownload |
| 54 | Implementing Multidestination Worms in Switch-Based Parallel Systems: Architectural Alternatives and Their Impact. Rajeev Sivaram, Craig B. Stunkel, Dhabaleswar K. Panda. IEEE Trans. Parallel Distrib. Syst. (11): 794-812 (2000). Web SearchBibTeXDownload |
| 53 | Efficient Virtual Interface Architecture (VIA) Support for the IBM SP Switch-Connected NT Clusters. Mohammad Banikazemi, Vijay Moorthy, Dhabaleswar K. Panda, Lorraine Herger, Bülent Abali. IPDPS 2000, 33-42. Web SearchBibTeXDownload |
| 52 | Adaptive Routing in RS/6000 SP-Like Bidirectional Multistage Interconnection Networks. Mohammad Banikazemi, Dhabaleswar K. Panda, Craig B. Stunkel, Bülent Abali. IPDPS 2000, 43. Web SearchBibTeXDownload |
| 1999 |
| 51 | Low Latency Message-Passing for Reflective Memory Networks. Matthew G. Jacunski, Vijay Moorthy, Peter P. Ware, Manoj Pillai, Dhabaleswar K. Panda, P. Sadayappan. CANPC 1999, 211-224. Web SearchBibTeXDownload |
| 50 | Communication Modeling of Heterogeneous Networks of Workstations for Performance Characterization of Collective Operations. Mohammad Banikazemi, Jayanthi Sampathkumar, Sandeep Prabhu, Dhabaleswar K. Panda, P. Sadayappan. Heterogeneous Computing Workshop 1999, 125. Web SearchBibTeXDownload |
| 49 | Exploiting the Benefits of Multiple-Path Network DSM Systems: Architectural Alternatives and Performance Evaluation. Donglai Dai, Dhabaleswar K. Panda. IEEE Trans. Computers (48): 236-244 (1999). Web SearchBibTeXDownload |
| 48 | Multidestination Message Passing in Wormhole k-ary n-cube Networks with Base Routing Conformed Paths. Dhabaleswar K. Panda, Sanjay Singal, Ram Kesavan. IEEE Trans. Parallel Distrib. Syst. (10): 76-96 (1999). Web SearchBibTeXDownload |
| 47 | Multiple Multicast with Minimized Node Contention on Wormhole k-ary n-cube Networks. Ram Kesavan, Dhabaleswar K. Panda. IEEE Trans. Parallel Distrib. Syst. (10): 371-393 (1999). Web SearchBibTeXDownload |
| 46 | Implementing Efficient MPI on LAPI for IBM RS/6000 SP Systems: Experiences and Performance Evaluation. Mohammad Banikazemi, Rama Govindaraju, Robert Blackmore, Dhabaleswar K. Panda. IPPS/SPDP 1999, 183-190. Web SearchBibTeXDownload |
| 45 | Low-Latency Message Passing on Workstation Clusters using SCRAMNet. Vijay Moorthy, Matthew G. Jacunski, Manoj Pillai, Peter P. Ware, Dhabaleswar K. Panda, Thomas W. Page Jr., P. Sadayappan, V. Nagarajan, Johns Daniel. IPPS/SPDP 1999, 148-152. Web SearchBibTeXDownload |
| 44 | All-to-All Broadcast on Switch-Based Clusters of Workstations. Matthew G. Jacunski, P. Sadayappan, Dhabaleswar K. Panda. IPPS/SPDP 1999, 325-329. Web SearchBibTeXDownload |
| 1998 |
| 43 | Efficient Collective Communication on Heterogeneous Networks of Workstations. Mohammad Banikazemi, Vijay Moorthy, Dhabaleswar K. Panda. ICPP 1998, 460-467. Web SearchBibTeXDownload |
| 42 | Where to Provide Support for Efficient Multicasting in Irregular Networks: Network Interface or Switch?. Rajeev Sivaram, Ram Kesavan, Dhabaleswar K. Panda, Craig B. Stunkel. ICPP 1998, 452-459. Web SearchBibTeXDownload |
| 41 | Impact of Adaptivity on the Behaviour of Networks of Workstations under Bursty Traffic. Federico Silla, Manuel P. Malumbres, José Duato, Donglai Dai, Dhabaleswar K. Panda. ICPP 1998, 88-95. Web SearchBibTeXDownload |
| 40 | Efficient Broadcast and Multicast on Multistage Interconnection Networks Using Multiport Encoding. Rajeev Sivaram, Dhabaleswar K. Panda, Craig B. Stunkel. IEEE Trans. Parallel Distrib. Syst. (9): 1004-1028 (1998). Web SearchBibTeXDownload |
| 39 | Alleviating Consumption Channel Bottleneck in Wormhole-Routed k-ary n-Cube Systems. Debashis Basak, Dhabaleswar K. Panda. IEEE Trans. Parallel Distrib. Syst. (9): 481-496 (1998). Web SearchBibTeXDownload |
| 38 | HIPIQS: A High-Performance Switch Architecture Using Input Queuing. Rajeev Sivaram, Craig B. Stunkel, Dhabaleswar K. Panda. IPPS/SPDP 1998, 134-143. Web SearchBibTeXDownload |
| 37 | Designing communication strategies for heterogeneous parallel systems. Ravi Prakash, Dhabaleswar K. Panda. Parallel Computing (24): 2035-2052 (1998). Web SearchBibTeXDownload |
| 1997 |
| 36 | Multicast on Irregular Switch-Based Networks with Wormhole Routing. Ram Kesavan, Kiran Bondalapati, Dhabaleswar K. Panda. HPCA 1997, 48-57. Web SearchBibTeXDownload |
| 35 | Optimal Multicast with Packetization and Network Interface Support. Ram Kesavan, Dhabaleswar K. Panda. ICPP 1997, 370-377. Web SearchBibTeXDownload |
| 34 | How Much Does Network Contention Affect Distributed Shared Memory Performance?. Donglai Dai, Dhabaleswar K. Panda. ICPP 1997, 454. Web SearchBibTeXDownload |
| 33 | Bandwidth-Optimal Complete Exchange on Wormhole-Routed 2D/3D Torus Networks: A Diagonal-Propagation Approach. Yu-Chee Tseng, Ting-Hsien Lin, Sandeep K. S. Gupta, Dhabaleswar K. Panda. IEEE Trans. Parallel Distrib. Syst. (8): 380-396 (1997). Web SearchBibTeXDownload |
| 32 | A Reliable Hardware Barrier Synchronization Scheme. Rajeev Sivaram, Craig B. Stunkel, Dhabaleswar K. Panda. IPPS 1997, 274-280. Web SearchBibTeXDownload |
| 31 | Implementing Multidestination Worms in Switch-Based Parallel Systems: Architectural Alternatives and their Impact. Craig B. Stunkel, Rajeev Sivaram, Dhabaleswar K. Panda. ISCA 1997, 50-61. Web SearchBibTeXDownload |
| 30 | Special Issue on Workstation Clusters and Network-Based Computing: Guest Editors' Introduction. Dhabaleswar K. Panda, Lionel M. Ni. J. Parallel Distrib. Comput. (40): 1-3 (1997). Web SearchBibTeXDownload |
| 29 | Multicasting in Irregular Networks with Cut-Through Switches Using Tree-Based Multidestination Worms. Rajeev Sivaram, Dhabaleswar K. Panda, Craig B. Stunkel. PCRCW 1997, 39-54. Web SearchBibTeXDownload |
| 28 | How Can We Design Better Networks for DSM Systems?. Donglai Dai, Dhabaleswar K. Panda. PCRCW 1997, 171-184. Web SearchBibTeXDownload |
| 27 | Multicasting on Switch-Based Irregular Networks Using Multi-drop Path-Based Multidestination Worms. Ram Kesavan, Dhabaleswar K. Panda. PCRCW 1997, 217-230. Web SearchBibTeXDownload |
| 26 | Designing High-Performance Communication Subsystems: Top Five Problems to Solve and Five Problems Not to Solve During the Next Five Years (Panel). Dhabaleswar K. Panda. PCRCW 1997, 153-158. Web SearchBibTeXDownload |
| 25 | Simulation of Modern Parallel Systems: A CSIM-based Approach. Dhabaleswar K. Panda, Debashis Basak, Donglai Dai, Ram Kesavan, Rajeev Sivaram, Mohammad Banikazemi, Vijay Moorthy. Winter Simulation Conference 1997, 1013-1020. Web SearchBibTeXDownload |
| 1996 |
| 24 | Designing Processor-Cluster Based Systems: Interplay Between Organizations and Broadcasting Algorithms. Debashis Basak, Dhabaleswar K. Panda. ICPP, Vol. 1 1996, 271-274. Web SearchBibTeX |
| 23 | Reducing Cache Invalidation Overheads in Wormhole Routed DSMs Using Multidestination Message Passing. Donglai Dai, Dhabaleswar K. Panda. ICPP, Vol. 1 1996, 138-145. Web SearchBibTeX |
| 22 | Minimizing Node Contention in Multiple Multicast on Wormhole k-ary N-Cube Networks. Ram Kesavan, Dhabaleswar K. Panda. ICPP, Vol. 1 1996, 188-195. Web SearchBibTeX |
| 21 | A Trip-Based Multicasting Model in Wormhole-Routed Networks with Virtual Channels. Yu-Chee Tseng, Dhabaleswar K. Panda, Ten-Hwang Lai. IEEE Trans. Parallel Distrib. Syst. (7): 138-150 (1996). Web SearchBibTeXDownload |
| 20 | Designing Clustered Multiprocessor Systems under Packaging and Technological Advancements. Debashis Basak, Dhabaleswar K. Panda. IEEE Trans. Parallel Distrib. Syst. (7): 962-978 (1996). Web SearchBibTeXDownload |
| 19 | Hybrid Algorithms for Complete Exchange in 2D Meshes. N. S. Sundar, D. N. Jayasimha, Dhabaleswar K. Panda, P. Sadayappan. International Conference on Supercomputing 1996, 181-188. Web SearchBibTeXDownload |
| 18 | Benefits of Processor Clustering in Designing Large Parallel Systems: When and How?. Debashis Basak, Dhabaleswar K. Panda, Mohammad Banikazemi. IPPS 1996, 286-290. Web SearchBibTeXDownload |
| 1995 |
| 17 | Fast Barrier Synchronization in Wormhole k-ary n-cube Networks with Multidestination Worms. Dhabaleswar K. Panda. HPCA 1995, 200-209. Web SearchBibTeXDownload |
| 16 | Global reduction in wormhole k-ary n-cube networks with multidestination exchange worms. Dhabaleswar K. Panda. IPPS 1995, 652-659. Web SearchBibTeXDownload |
| 15 | An efficient scheme for complete exchange in 2D tori. Yu-Chee Tseng, Sandeep K. S. Gupta, Dhabaleswar K. Panda. IPPS 1995, 532-536. Web SearchBibTeXDownload |
| 1994 |
| 14 | Designing Large Hierarchical Multiprocessor Systems under Processor, Interconnection, and Packaging Advancements. Debashis Basak, Dhabaleswar K. Panda. ICPP (1) 1994, 63-66. Web SearchBibTeX |
| 13 | Clustering and Intra-Processor Scheduling for Explicitly-Parallel Programs on Distributed-Memory Systems. Vibha A. Dixit-Radiya, Dhabaleswar K. Panda. IPPS 1994, 609-616. Web SearchBibTeX |
| 12 | Multidestination Message Passing Mechanism Conforming to Base Wormhole Routing Scheme. Dhabaleswar K. Panda, Sanjay Singal, Pradeep Prabhakaran. PCRCW 1994, 131-145. Web SearchBibTeXDownload |
| 1993 |
| 11 | Impact of Multiple Consumption Channels on Wormhole Routed k-ary n-cube Networks. Shobana Balakrishnan, Dhabaleswar K. Panda. IPPS 1993, 163-167. Web SearchBibTeX |
| 10 | A Trip-Based Multicasting Model for Wormhole-Routed Networks with Virtual Channels. Yu-Chee Tseng, Dhabaleswar K. Panda. IPPS 1993, 276-283. Web SearchBibTeX |
| 9 | Barrier Synchronization in Distributed-Memory Multiprocessing Using Rendezvous Primitives. Sandeep K. S. Gupta, Dhabaleswar K. Panda. IPPS 1993, 501-505. Web SearchBibTeX |
| 8 | Task Assignment on Distributed-Memory Systems with Adaptive Wormhole Routing. Vibha A. Dixit-Radiya, Dhabaleswar K. Panda. SPDP 1993, 674-681. Web SearchBibTeX |
| 7 | Scalable Architectures with k-ary n-Cube Cluster-c organization. Debashis Basak, Dhabaleswar K. Panda. SPDP 1993, 780-787. Web SearchBibTeX |
| 1991 |
| 6 | Message Vectorization for Converting Multicomputer Programs to Shared-Memory Multiprocessors. Dhabaleswar K. Panda, Kai Hwang. ICPP (1) 1991, 204-211. Web SearchBibTeX |
| 5 | Architectural Design of Orthogonal Multiprocessor for Multidimensional Information Processing. Kai Hwang, Dhabaleswar K. Panda. J. Inf. Sci. Eng. (7): 459-485 (1991). Web SearchBibTeXDownload |
| 4 | Fast Data Manipulation in Multiprocessors Using Parallel Pipelined Memories. Dhabaleswar K. Panda, Kai Hwang. J. Parallel Distrib. Comput. (12): 130-145 (1991). Web SearchBibTeXDownload |
| 1990 |
| 3 | Algorithm-Driven Simulation and Performance Projection of a RISC-based Orthogonal Multiprocessor. Sharad Mehrotra, Chien-Ming Cheng, Kai Hwang, Michel Dubois, Dhabaleswar K. Panda. ICPP (3) 1990, 244-253. Web SearchBibTeX |
| 2 | OMP: a RISC-based multiprocessor using orthogonal-access memories and multiple spanning buses. Kai Hwang, Michel Dubois, Dhabaleswar K. Panda, S. Rao, Shisheng Shang, Aydin Üresin, W. Mao, H. Nair, M. Lytwyn, F. Hsieh, J. Liu, Sharad Mehrotra, Chien-Ming Cheng. ICS 1990, 7-22. Cited by 18Web SearchBibTeXDownload |
| 1988 |
| 1 | A Parallel-Serial Binary Arbitration Scheme for Collision-Free Multi-Access Techniques. Dhabaleswar K. Panda, T. Viswanathan. Computer Networks (15): 217-223 (1988). Web SearchBibTeXDownload |