Publications 

Books Authored  

B1.        W. Gropp, S. Huss-Lederman, A. Lumsdaine, E. Lusk, B. Nitzberg, W. Saphir, and M. Snir. MPI--The Complete Reference. Volume 2 - The MPI-2 Extensions. The MIT Press, 364 pp.,1998. (url)

B2.        M. Snir, S. Otto, S. Huss-Lederman, D. Walker and J. Dongarra. MPI--The Complete Reference. Volume 1 - The MPI-1 Core. The MIT Press, 1996. (url)

B3.         

Revisions 

B4.        M. Snir, S. Otto, S. Huss-Lederman, D. Walker and J. Dongarra. MPI--The Complete Reference. Volume 1 - The MPI-1 Core, 2nd edition. The MIT Press, 446 pp.,1998. (url)

Books Edited or Co-Edited

B5.        Susan L. Graham, Marc Snir and Cynthia A. Patterson (Editors). Getting up to Speed -- The Future of Supercomputing. National Academies Press, 306 pp., 2005. (url

Chapters in Books 

CB1.    Marc Snir. Distributed Memory Multiprocessor. In David Padua (ed.) Encyclopedia of Parallel Computing. Springer 2011.

CB2.    M. Snir. Reduce and Scan. In David Padua (ed.) Encyclopedia of Parallel Computing. Springer 2011.

CB3.    P. Corbett, D. Feitelson, S. Fineberg, Y.n Hsu, B. Nitzberg, .J.-P. Prost, M. Snir, B. Traversat, and P. Wong. Overview of the MPI-IO Parallel I/O Interface.  In Rajkumar Buyya, Hai Jin and Toni Cortes (eds.). High Performance Mass Storage and Parallel I/O: Technologies and Applications. Wiley, 477-487, November 2001. 

CB4.    Gottlieb, R. Grishman, C.K. Kruskal, K.P. McAuliffe, L. Rudolph, and M. Snir. The NYU Ultracomputer - Designing a MIMD, shared-memory parallel machine. IEEE 9th Ann. Int. Symp. on Computer Architecture, 1982. Reprinted in 25 years of the international symposia on Computer architecture (selected papers). 239-254, 1998. 

CB5.    P. Corbett, D. Feitelson, S. Fineberg, Y. Hsu, J.-P. Prost, M. Snir, B. Traversat and P. Wong. Overview of the MPI-IO parallel I/O interface. In . R. Jain J. Werth and J. C. Browne (eds.)  Input/Output in Parallel and Distributed Computer Systems. Kluwer, 127-146, June 1996. 

CB6.    T. Agerwala, J. Mirza and M. Snir. Designing a scalable parallel system: the IBM SP2. In J. Dongarra, L. Grandinetti, G. Joubert and J. Kowalik (eds.) High Performance Computing: Technology, Methods and Applications. Advances in Parallel Computing Series, Vol. 10, Elsevier, Amsterdam 1995. 

CB7.    Kruskal and M. Snir. Optimal interconnection networks of parallel processors: The importance of being square. In Y. Yemini (ed.) Current Advances in Distributed Computing and Computer Communications. 91-113, 1987.

.

Articles in Journals 

J1.         Babak Behzad, Surendra Byna, Prabhat, and Marc Snir. Optimizing I/O Performance of HPC Applications with Autotuning. ACM Trans. Parallel Computing 5(4) Article 15 (March 2019), 27 pages. (pdf)

J2.         Meng Hao, Weizhe Zhang, You Zhang, Marc Snir, Laurence T Yang. Automatic generation of benchmarks for I/O-intensive parallel applications, Journal of Parallel and Distributed Computing, 124. 1-13, 2019. (pdf)

J3.         S. Di, H. Guo, R. Gupta, E. R. Pershey, M. Snir  and  Franck Cappello. Exploring Properties and Correlations of Fatal Events in a Large-Scale HPC System. IEEE Transactions on Parallel and Distributed Systems 30(2) 36`-374 (2018). (pdf)

J4.         J. Calhoun, F. Cappello, L. N. Olson, M. Snir and W. D. Gropp. Exploring the Feasibility of Lossy Compression for PDE Simulations.   International Journal of High-Performance Computing Applications 33(2) 397-410 (2018) (pdf)

J5.         S. Seo, A. Amer, P. Balaji, C. Bordage, G. Bosilca, A. Brooks, P. Carns, A. Castello, D. Genet, T. Herault, P. Jindal, L. V. Kale, S. Krishnamoorthy, J. Lifflander, H. Lu, E. Meneses, M. Snir, Y. Sun, and P. Beckman. Argobots: A Lightweight, Low-Level Threading and Tasking Framework.  IEEE Transactions on Parallel and Distributed Systems 29(3) 512-526 (2018) (pdf)

J6.         H.-V. Dang, M. Snir and W. Gropp, Eliminating contention bottlenecks in multithreaded MPI, Parallel Computing (69) 1-23, 2017. (pdf)

J7.         W. Zhang, M. Hao, and M. Snir. Predicting HPC parallel program performance based on LLVM compiler. Cluster Computing 20, 1179-1192 (2017). (pdf)

J8.         M. Dorier, G. Antoniu , F. Cappello, R. Cisneros, O. Yildiz, ,  S. Ibrahim, T. Peterka and L. Orf. Damaris: Addressing Performance Variability in Data Management for Post-Petascale Simulations. ACM Transactions on Parallel Processing 3(3) 15:1-15:43 (2016) (pdf)

J9.         J. Dinan, P. Balaji, D. Goodell, D. Miller, M. Snir and R. Thakur, Enabling MPI Interoperability Through Flexible Communication Endpoints. International Journal of High-Performance Computing 28(14), 390-405 (2014). (pdf)

J10.      Dubey, S. R. Brandt, R. Brower, M. Giles, P. Hovland, D.Q. Lamb, F. Löffler, B. Norris, B.W. O'Shea, C. Rebbi, M. Snir, R. Thakur, R and P. Tzeferacos.. Software Abstractions and Methodologies for HPC Simulation Codes on Future Architectures. Journal of Open Research Software 2(1) p.e14, July 2014 DOI: http://dx.doi.org/10.5334/jors.aw (pdf)

J11.      J. Zhang, B. Behzad, and M. Snir, Design of a Multithreaded Barnes-Hut Algorithm for Multicore Clusters. IEEE Transaction on Parallel and Distributed Computing, 26(7), 1861-1873 (2015). (pdf)

J12.      F. Cappello, A. Geist, W. Gropp, S. Kale, W. Kramer and M. Snir. Toward Exascale Resilience: 2014 update. Supercomputing Frontiers and Innovations, 1(1), 4-27 (2014) (pdf)

J13.      M. Snir, R.W. Wisniewski, J.A. Abraham, S.V. Adve, S. Bagchi, P. Balaji, J. Belak, P. Bose, F. Cappello, B. Carlson, A.A. Chien, P. Coteus, N.A. Debardeleben, P. Diniz, C. Engelmann, M. Erez, S. Fazzari, A. Geist, R. Gupta, F. Johnson, S. Krishnamoorthy, S. Leyffer, D. Liberty, S. Mitra, T. Munson, R. Schreiber, J. Stearley and E.V. Hensbergen. Addressing Failures in Exascale Computing. The International Journal of High-Performance Computing Application, (28)2, 127 – 171 (2014). (pdf)

J14.      S. Li, T. Hoefler, C. Hu and M. Snir, Improved MPI collectives for MPI processes in shared address spaces. Cluster Computing, 7(4), 11390-1155 (2014).  (pdf)

J15.      W. Gropp and M. Snir. Programming for Exascale Computers, Journal of Computing in Science and Engineering, (15)6, 27-35 (2013). (pdf)

J16.      Gainaru, F. Cappello, M. Snir and W. Kramer. Failure prediction for HPC systems and applications Current situation and open issues. International Journal of High-Performance Computing Applications, 27(3), 273-282 (2013). (pdf)

J17.      F. Cappello, A.Geist, W. Gropp, L. Kale, W. Kramer and M. Snir. Toward Exascale Resilience. International Journal of High Performance Computing Applications 23, 374-388 (2009). (pdf)

J18.      B. Catanzaro, A. Fox, K. Keutzer, D. Patterson, B.-Y. Su, M. Snir, K.Olukotum, P. Hanrahan and H. Chafi. Ubiquitous Parallel Computing from Berkeley, Illinois and Stanford. IEEE Micro 3(4), 41-45 (2010). (pdf)

J19.      W. Gropp and M. Snir. On the Need for a Consortium of Capability Centers. The International Journal of High-Performance Computing Applications, 23(4), 413-420 (2009). (pdf)

J20.      F. Cappello, A. Geist, W. Gropp, L. Kale, W. Kramer, and M. Snir. Toward Exascale Resilience. The International Journal of High-Performance Computing Applications, 23(4) 374-388 (2009). (pdf)

J21.      N. V. Sahinidis, M.T. Harandi, M.T. Heath, L. Murphy, M. Snir, R.P. Wheeler and C.F. Zukoski. Establishing a master's degree programme in Bioinformatics: challenges and opportunities. IEE Proceedings Systems Biology, 152(4), 269-275 (2005). (pdf)

J22.      M. Snir and D. A. Bader. A framework for measuring supercomputer productivity. International Journal for High Performance Computing Applications, 18, 399-416 (2004). (pdf )

J23.      M. Snir. A note on n-body computations with cutoffs. Theory of Computing Systems 37, 295-318 (2004). (pdf)

J24.      Almasi, G.S., Cascaval, C., Castanos, J.G., Denneau, M., Donath, W., Eleftheriou, M., Giampapa, M., Ho, H., Lieber, D., Moreira, J.E., Newns, D., Snir, M. and Warren, H.S., Jr. Demonstrating the scalability of a molecular dynamics application on a Petaflops computer. International Journal of Parallel Programming 30(4), 317-351 (2002). (pdf)

J25.      F. Allen, G. Almasi, W. Andreoni, D. Beece, B. J. Berne, A. Bright, J. Brunheroto, C. Cascaval, J. Castanos, P. Coteus, P. Crumley, A. Curioni, M. Denneau, W. Donath, M. Eleftheriou, B. Fitch, B. Fleischer, C. J. Georgiou, R. Germain, M. Giampapa, D. Gresh, M. Gupta, R. Haring, H. Ho, P. Hochschild, S. Hummel, T. Jonas, D. Lieber, G. Martyna, K. Maturu, J. Moreira, D. Newns, M. Newton, R. Philhower, T. Picunko, J. Pitera, M. Pitman, R. Rand, A. Royyuru, V. Salapura, A. Sanomiya, R. Shah, Y. Sham, S. Singh, M. Snir, F. Suits, R. Swetz, W. C. Swope, N. Vishnumurthy, T. J. C. Ward, H. Warren, and R. Zhou. Blue Gene: a vision for protein science using a petaflop supercomputer. IBM System Journal 40(2), 310-327 (2001). (pdf)

J26.      E. Demaine, I. Foster, K. Kesselman and M. Snir. Generalized Communicators in the Message Passing Interface. IEEE Trans. on Parallel and Distributed Systems 12(6), 610-616 (2001). (pdf)

J27.      M. D. Theys, S. Ali, H. J. Siegel, M. Chandy, K. Hwang, K. Kennedy, L. Sha, K. G. Shin, M. Snir, L. Snyder and T. Sterling. What Are the Top Ten Most Influential Parallel and Distributed Processing Concepts of the Past Millenium? Journal of Parallel and Distributed Computing 61, 1827–1841 (2001). (pdf)

J28.      J. E. Moreira, S. P. Midkiff, M. Gupta, P. V. Artigas, M. Snir, and R. D. Lawrence. Java programming for high-performance numerical computing. IBM Systems Journal 39(1), 21-56 (2000). (pdf)

J29.      S.P. Midkiff, J. E. Moreira and M. Snir. Optimizing array reference checking in Java programs. IBM Systems Journal 37(3), 409-453 (1998). (pdf)

J30.      E. Upfal, S. Pelegrin and M. Snir. Randomized routing with shorter paths. IEEE Trans. on Parallel and Distributed Systems 7(4), 356-362 (1996). (pdf)

J31.      J.J. Dongarra, S.W. Otto, M. Snir, and D.W. Walker. A Message Passing Standard for MPP and Workstations. Comm. of the ACM 39(7), 84-90 (1996). (pdf)

J32.      V. Bala, J. Bruck, R. Cypher, P. Elustondo, A. Ho, C-T. Ho, V. Bala, S. Kipnis, and M. Snir. CCL: A portable and tunable collective communication library for scalable parallel computers. IEEE Trans. on Parallel and Distributed Systems 6(2), 154-164 (1995). (pdf)

J33.      P.F. Corbett, D.G. Feitelson, J-P. Prost, G.S. Almasi, S.J. Baylor, A.S. Bolmarchich, Y. Hsu, J. Satran, M. Snir, R. Colao, B. Herr, J. Kavaky, T. R. Morgan, and A. Zlotek. Parallel File System for the IBM SP Computers. IBM Systems Journal 34(2), 222-248 (1995). (pdf)

J34.      D. Frye, K. Gildea, P. Hochschild, and M. Snir. The communication software and parallel environment of the IBM SP2. IBM Systems Journal 34(2), 205-221 (1995). (pdf)

J35.      T. Agerwala, J. Martin, J. Mirza, D. Sadler, D. Dias, and M. Snir. SP2 System architecture. IBM Systems Journal 34(2), 152-184 (1995). Republished in Turning Points in Computing: 1962-1999. IBM Systems Journal 38(2), 414-446 (1999). (pdf)

J36.      B. Schieber and M. Snir. Calling names on nameless networks. Information and Computation 113(1), 80-101 (1994). (pdf)

J37.      V. Bala, J. Bruck, R. Bryant, B. Cypher, P. de Jong, P. Elustondo, D. Frye, A. Ho, C-T Ho, G. Irwin, S. Kipnis, R. Lawrence and M. Snir. The IBM External User Interface for Scalable Parallel Systems. Parallel Computing 20(4), 445-462 (1994). (pdf)

J38.      Message Passing Interface Forum. MPI: a message-passing interface standard. Int. J. of Supercomputer Applications and High-Performance Computing, 8(3/4), 167-414 (1994). (pdf)

J39.      P. Raghavan and M. Snir. Memory versus randomization in on-line algorithms. IBM J. of Research and Development 38(6), 683-708 (1994). (pdf)

J40.      D. Coppersmith, P. G. Doyle, P. Raghavan and M. Snir. Random walks on weighted graphs, and applications to on-line algorithms. J. ACM 40(3), 421-453 (1993). (pdf)

J41.      High Performance FORTRAN Forum. High performance FORTRAN language specification. Scientific Programming 2(1), 1-170 (1993). (pdf)

J42.      C. Kruskal and M. Snir. Cost-performance tradeoffs for communication networks. Discrete Applied Mathematics 37-38, 359-385 (1992). (pdf)

J43.      D. Zernik, M. Snir and D. Malki. Using visualization tools to understand concurrency. IEEE Software 9(3), 87-92 (1992). (pdf)

J44.      C.M. Pancake, R. Bailey, D. Barkai, B. Smith, J. Brandenburg, J. Dongarra, M. Kalos, M. Snir, and D. Gelenter.  What should we expect from parallel language standards (editorial), International Journal of Supercomputer Applications, 6(1), Spring 1991, 112-117. (pdf)

J45.      M. Snir. Size-depth tradeoffs for monotone arithmetic circuits. Theoretical Computer Science 82(1), 85-93 (1991). (pdf)

J46.      H. Attiya and M. Snir. Better computing on the anonymous ring. J. Algorithms 12(2), 204-238 (1991). (pdf)

J47.      Aggarwal, A. K. Chandra and M. Snir. Communication Complexity of PRAMs. Theoretical Computer Science 71, 3-28 (1990). (pdf)

J48.      C. Kruskal, L. Rudolph and M. Snir. A complexity theory of efficient parallel algorithms. Theoretical Computer Science 71, 95-132 (1990). (pdf)

J49.      C. Kruskal, L. Rudolph and M. Snir. Techniques for parallel manipulation of sparse matrices. Theoretical Computer Science 64, 135-157 (1989). (pdf)

J50.      C. Kruskal, L. Rudolph and M. Snir. Efficient parallel algorithms for graph problems. Algorithmica 5, 43-64 (1990). (pdf)

J51.      D. Shasha and M. Snir. Efficient and correct execution of parallel programs that share memory. ACM Transactions on Programming Languages and Systems 10, 282-312 (1988). (pdf)

J52.      H. Attiya, M. Snir and M. Warmuth. Computing in an anonymous ring. J. ACM, 35(4), 845-875 (1988). (pdf)

J53.      C. Kruskal, L. Rudolph and M. Snir. Efficient synchronization on multiprocessors with shared memory. Transactions on Programming Languages and Systems 10(4), 579-601 (1988). (pdf)

J54.      C. Kruskal, M. Snir, and A. Weiss. On the distribution of waiting times in clocked multistage interconnection networks. IEEE Trans. on Computers 37(11), 1337-1353 (1988). (pdf)

J55.      Marc Snir. Exact Balancing is not Always Good. Information Processing Letters 22, 99-102 (1986). (pdf)

J56.      M. Snir. Depth-size tradeoffs for parallel prefix computation. Journal of Algorithms 7, 185-201 (1986). (pdf)

J57.      C. Kruskal and M. Snir. A Unified Theory of Interconnection Network Structure. Theoretical Computer Science 48, 75-94 (1986). (pdf)

J58.      C. Kruskal, L. Rudolph and M. Snir. The power of parallel prefix. IEEE Trans. on Computers C-35, 965-968 (1985). (pdf)

J59.      M. Snir. On parallel searching. SIAM Journal on Computing 14, 688-708 (1985). (pdf)

J60.      M. Snir. Lower bounds on probabilistic decision trees. Theoretical Computer Science 38, 69-82 (1985). (pdf)

J61.      U. Manber, S. Moran and M. Snir. Applications of Ramsey's theorem to decisions tree complexity. J. ACM 32, 938-949 (1985). (pdf)

J62.      Gottlieb, R. Grishman, C.K. Kruskal, K.P. McAuliffe, L. Rudolph, and M. Snir. The NYU Ultracomputer - Designing a MIMD, shared-memory parallel computer. IEEE Trans. on Computers C-32, 175-189 (1983). (pdf)

J63.      Y. Perl and M. Snir. Circuit partitioning with size and connection constraints. Networks 13, 365-376 (1983). (pdf)

J64.      JC.K. Kruskal and M. Snir. The performance of multistage interconnection networks for multiprocessors. IEEE Trans. on Computers C-32, 1091-1098 (1983). (pdf)

J65.      M. Jerrum and M. Snir. Some exact complexity results for straight-line computations over semirings. J. ACM 29(3), 874-897 (1982). (pdf)

J66.      M. Snir. Comparison between linear forms can help. Theoretical Computer Science 19, 321-330 (1982). (pdf)

J67.      H. Gaifman and M. Snir. Probabilities over rich languages, testing and randomness. J. of Symbolic Logic 47, 495-548 (1982). (pdf)

J68.      M. Snir. On the complexity of simplifying quadratic forms. Inf. Proc. Letters 12, 217-220 (1981). (pdf)

J69.      E. Shamir and M. Snir. On the depth complexity of formulas. Mathematical System Theory 13, 301-322 (1980). (pdf)

J70.      M. Snir. The covering problem of complete uniform hypergraphs. Discrete Math. 27, 103-105 (1979). (pdf)

J71.      M. Snir and A. Barak. A direct approach to the parallel evaluation of rational expressions with a small number of processors. IEEE Trans. on Computers C-26, 933-937 (1977). (pdf)

Articles in Conference Proceedings      

  

C1.        C. Wang, J. Sun, M. Snir, K. Mohror and E. Gonsiorowski. Recorder 2.0: Efficient Parallel I/O Tracing and Analysis. The IEEE International Workshop on High-Performance Storage, May 2020. (pdf)

C2.        N. Dryden, N. Maruyama, T. Moon, T. Benson, M. Snir, B. Van Essen. Channel and Filter Parallelism for Large-Scale CNN Training. The Supercomputing Conference (SC19) Denver 2019. (pdf)

C3.        R. Dathathri, G. Gill, L. Huang,    V. Jatala, K. Pingali, Keshav, V-K Nandivada, H-V Dang, and M. Snir. Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous Graph Analytics. The 28th International Conference on Parallel Architectures and Compilation Techniques (PACT2019), Seattle, Sept 2019. (pdf)

C3.        S. Di, H. Guo, E. Pershey, M. Snir and F. Cappello. Characterizing and Understanding HPC Job Failures over the 2K-day Life of IBM Blue Gene/Q System. The 49th IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2019), 473-484, June 2019,  (pdf) 

C4.        N. Dryden, N. Maruyama, T. Benson, T. Moon, M. Snir and B. Van Essen. Improving Strong-Scaling of CNN Training by Exploiting Finer-Grained Parallelism. 33rd IEEE International Parallel and Distributed Processing Symposium, 210-220, May 2019. (pdf)

C5.        S. Leyffer, S. Wild, M. Fagan, M. Snir, K. Palem, K. Yoshii, and H. Finkel. Doing Moore with Less--Leapfrogging Moore's Law with Inexactness for Supercomputing. The 3rd International Workshop on Post Moore's Era Supercomputing (PMES 2018), Nov 2018. (pdf)

C6.        N. Dryden, N. Maruyama. T. Moon, T. Benson, A. Yoo, M. Snir, B. Van Essen. Aluminum: An Asynchronous, GPU-aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC systems. 4th Workshop on Machine Learning in HPC Environments (MNLHPC18), Nov. 2018. (pdf)

C7.        C. Wang, N. Dryden, F. Cappello and Marc Snir. Neural network based silent error detector. IEEE International Conference on Cluster Computing (Cluster 2018), Belfast UK, 168-178, September 2018.  (best paper) (pdf) 

C8.        H-V Dang and M. Snir, FULT: Fast User-Level Thread Scheduling Using Bit-Vectors. 7th International Conference on Parallel Processing (ICPP 2018), Aug 2018. (pdf)

C9.        R. Dathathri, G. Gill, L. Hoang, HV Dang, A. Brooks, N. Dryden, A. Lenharth, M. Snir and K. Pingali, Gluon: A Communication Optimizing Substrate for Distributed Heterogeneous Graph Analytics. Symposium on Programming Language Design and Implementation (PLDI), 752-768, June 2018. (pdf)

C10.     H-V Dang, A. Brooks, N. Dryden, M. Snir, R. Dathathri, G. Gill, A. Lenharth, L. Hoang, K. Pingali, A Lightweight Communication Runtime for Distributed Graph Analytics, 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 980-989, May 2018. (pdf) 

C11.     S. Di, R. Gupta, M. Snir, and F. Cappello. LogAider: A tool for mining potential correlations in HPC Log Events. 17th International Symposium on Cluster, Cloud and Grid Computing (CCGRID), 442-451, 2017. (pdf)

C12.     J. Calhoun, M. Snir, L. Olson, and W. Gropp, Towards a More Complete Understanding of SDC Propagation. 26th International Symposium on Parallel and Distributed Computing, 131-142, 2017. (pdf)

C13.     L. Bautista-Gomez, A. Gainaru, S. Perarnau, D. Tiwari, S. Gupta, C. Engelmann, F. Cappello, and M. Snir. Reducing waste in extreme scale systems through introspective analysis. 2016 IEEE International Parallel and Distributed Processing Symposium, 212-221, May 2016. (pdf)

C14.     M. Fagan, J. Schlachter, K. Yoshii, S. Leyffer, K. Palem, M. Snir, S. M. Wild, and C. Enz. Overcoming the Power Wall by Exploiting Inexactness and Emerging COTS Architectural Features Trading Precision for Improving Application Quality. 29th international IEEE System-on-Chip Conference (SOCC 2016), Sept 2016. (pdf)

C15.     H-V Dang, M. Snir, and W. Gropp, Towards millions of communicating threads. EuroMPI 2016: Proceedings of the 23rd European MPI Users' Group Meeting, Sept  2016 (Best paper) (pdf)

C16.     A. Brooks, H-V Dang, N. Dryden, and M. Snir. PPL: An abstract runtime system for hybrid parallel programming. First International Workshop on Extreme Scale Programming Models and Middleware (ESPM2 2015), Nov. 2015. (pdf)

C17.     Behzad, S. Byna, Prabhat, and M. Snir, Pattern-driven Parallel I/O Tuning. 10th Parallel Data Storage Workshop (PDSW15), Nov. 2015. (pdf)

C18.     Behzad, S. Byna, S. Wild, Prabhat, and M. Snir. Dynamic Model-driven Parallel I/O Performance Tuning. IEEE 2015 IEEE International Conference on Cluster Computing (Cluster), 184-193, Sept. 2015. (pdf)

C19.     J. Calhoun, M. Snir, L. Olson, and M. Garzaran, Understanding the Propagation of Error Due to a Silent Data Corruption in a Sparse Matrix Vector Multiply. IEEE International Conference on Cluster Computing (Cluster), 541-542, 2015. (pdf)

C20.     A. Sasidharan, J. Dennis, and M. Snir. A general Space-filling Curve Algorithm for Partitioning 2D Meshes. IEEE 17th International Conference on High-Performance Computing and Communication, 875-879, August 2015. (pdf)

C21.     S. Perarnau, R. Thakur, K. Iskra, K. Raffenetti, F. Cappello, R. Gupta, P. Beckman, M. Snir, H. Hoffmann, M. Schulz, and B. Rountree. Distributed Monitoring and Management of Exascale Systems in the Argo Project. IFIP International Conference on Distributed Applications and Interoperable Systems (DAIS), (LNCS, volume 9038) 173-178, 2015. (pdf).

C22.     J. Calhoun, L. Olson, M. Snir, and William Gropp. Towards a More Fault Resilient Multigrid Solver. Symposium on High Performance Computing/ Spring Simulation Conference (HPC/SpringSim),  2015 (pdf)

C23.     Gainaru, G. Aupy, A. Benoit, F. Cappello, Y. Robert, and M. Snir. Scheduling the I/O of HPC applications under congestion, IEEE International Parallel and Distributed Processing Symposium (IPDPS), 1013-1022.  2015. (pdf)

C24.     Behzad, H-V Dang, F. Hariri, W. Zhang, and M. Snir. 2014. Automatic generation of I/O kernels for HPC applications. 9th Parallel Data Storage Workshop (PDSW '14), 2014.  (pdf)

C25.     Behzad, S. Byna, S. M. Wild, Prabhat and M. Snir, Improving Parallel I/O Autotuning with Performance Modeling. HPDC, 2014 23rd international symposium on High-performance parallel and distributed computing, 253-256, 2014. (pdf)

C26.     J. Calhoun, L. Olson, and M. Snir, FlipIt: An LLVM Based Fault Injector for HPC. 7th Workshop on Resiliency in High Performance Computing in Clusters, Clouds, and Grids in conjunction with the 20th International European Conference on Parallel and Distributed Computing (Euro-Par) in Porto, Portugal, August 25-29, 2014. (pdf)

C27.     M. Snir, The future of supercomputing. 28th ACM international conference on Supercomputing, 2014 (pdf)

C28.     M. Snir. Programming models for Extreme-Scale computing. ACM Symposium on Principles of Distributed Computing, 3-3, 2013. (pdf)

C29.     B. Behzad, H. Vu, T. Luu, J. Huchette, S. Byna, Prabhat, R. Aydt, Q. Koziol, and M. Snir, Taming Parallel I/O Complexity with Auto-Tuning, SC13, #68, November 2013. (pdf)

C30.     J. Dinan, P. Balaji, D. Goodell, D. Miller, M. Snir and R. Thakur, Enabling MPI Interoperability Through Flexible Communication Endpoints, EuroMPI’13, 20th European MPI Users' Group Meeting, 13-18, Sept. 2013, Madrid, Spain. (pdf)

C31.     S. Li, T. Hoefler and M. Snir.  NUMA-Aware Shared Memory Collective Communication for MPI, HPDC '13: Proceedings of the 22nd international symposium on High-performance parallel and distributed computing, 85-96, June 2013, New York, NY. (pdf)

C32.     A. Yee and M. Snir. A Faster FFT in the Mid-West. 2013 SIAM Conference on Computational Science and Engineering (CSE13), Feb. 2013, Boston, Massachusetts. 

C33.     Gainaru, F. Cappello, W. Kramer and M. Snir. Fault prediction under the microscope: A closer look into HPC systems. 2012 International Conference for High Performance Computing, Networking, Storage and Analysis (SC12), Nov. 2012, Salt Lake City, Utah. (pdf)

C34.     M. Dorier, G. Antoniu, F. Cappello, M. Snir, and L. Orf. Damaris: How to Efficiently Leverage Multicore Parallelism to Achieve Scalable, Jitter-free I/O. 2012 IEEE International Conference on Cluster Computing (Cluster 2012), 155-163, Sept. 2012, Beijing, China. (pdf)

C35.     F. Kjolstad, T. Hoefler, and M. Snir. Automatic datatype generation and optimization. 17th ACM SIGPAN symposium on Principles and Practice of Parallel Programming (PPoPP), 327-328, 3012. ACM. (pdf)

C36.     Guermouche, A., Ropars, T., Snir, M., and Cappello, F. (2012, May). HydEE: Failure Containment without Event Logging for Large Scale Send-Deterministic MPI Applications. In IEEE 26th International Parallel & Distributed Processing Symposium (IPDPS), 1216-1227, 2012. (pdf)

C37.     F. Cappello, M. Jacquelin, L. Marchal, Y. Robert and M. Snir. Comparing archival policies for Blue Waters. 18th Annual International Conference on High Performance Computing (HiPC 2011), 1-10, Dec 2011. (pdf) 

C38.     T. Hoefler, W. Gropp, W. Kramer, and M.Snir. 2011. Performance modeling for systematic performance tuning. In State of the Practice Reports (SC '11). ACM, New York, NY, USA, Article 6, 12 pages. (pdf)

C39.     J. Zhang, B. Behzak and M. Snir, Optimizing the Barnes-Hut Algorithm in UPC. The Supercomputing conference (SC11), Nov. 2011. (pdf)

C40.     T. Hoefler and M. Snir. 2011. Performance engineering: a must for petascale and beyond. In 3rd international workshop on Large-scale system and application performance (LSAP '11), 1-2, June 2011. (pdf)

C41.     T. Hoefler and M. Snir. Writing parallel libraries in MPI - common practices, issues and extensions. EuroMPI’11, 18th European MPI Users' Group Meeting, 345-355, Sept. 2011. (pdf) 

C42.     T. Hoefler and M.Snir, Generic Topology Mapping Strategies for Large-scale Parallel Architectures. International Conference on Supercomputing (ICS11), 75-84, May 2011. (pdf)

C43.     Guermouche, T. Ropars, E. Brunet, M.Snir and F. Cappello. Uncoordinated Checkpointing Without Domino Effect for Send-Deterministic Message Passing Applications. 25th  IEEE International Parallel & Distributed Processing Symposium (IPDPS 2011), 989-1000, May 2011. (pdf)

C44.     F. B. Kjolstad, D. Dig, G. Acevedo and M. Snir. Transformations for Class Immutability.  33rd International Conference on Software Engineering (ICSE'11), 61-70 May 2011.

C45.     F. B. Kjolstad, D. Dig and M. Snir. Bringing the HPC Programmer’s IDE into the 21st Century through Refactoring. ACM Workshop on Concurrency for the Application Programmer (CAP), Oct. 2010. (pdf) 

C46.     F. Cappello, A. Guermouche and M. Snir. On Communication Determinism in HPC Applications. International Conference on Computer Communication Networks (ICCN 2010), 1-8, August 2010. (pdf)

C47.     R. Bocchino, S. Heumann, N. Honarmand, R. Komuravelli, J. Overbey, P. Simmons, H. Sung, M. Vakilian, S. V. Adve, V. S. Adve, D. Dig, and M. Snir, A Language for Deterministic-by-Default Parallel Programming. 15th Workshop on Compilers for Parallel Computing (CPC), July 2010. (pdf)

C48.     F.  Kjolstad and M. Snir, Ghost Cell Pattern. Workshop on Parallel Programming Patterns (ParaPlop) June 2010.

C49.     R. Johnson and M. Snir, N-body Pattern Language. Workshop on Parallel Programming Patterns (ParaPlop) June 2009 

C50.     M. Snir. Universal parallel computing research center at Illinois:  Making parallel programming synonymous with programming. IEEE Hot Chips 21st Symposium (HCS), 2009. (pdf)

C51.     R. Bocchino, V. Adve, S. Adve and M. Snir. Parallel Programming Must Be Deterministic By Default , 1st Usenix Workshop on Hot Topics in Parallelism (HotPar), 4-4, March 2009. (pdf)

C52.     J. Yu, M. Garzaran and M. Snir. ESoftCheck: Removal of Non-vital Checks for Fault Tolerance. 2009 International Symposium on Code Generation and Optimization (CGO2009), 35-46, Seattle, WA, March 2009. (pdf)

C53.     J. Yu, M. Garzaran, and M. Snir, Efficient software checking for fault tolerance. IEEE International Symposium on Parallel and Distributed Processing (IPDPS 2008), 1-5, 2008. (pdf)

C54.     M. Wei, C. Jiang and M. Snir. Programming Patterns for Architecture-Level Software Optimizations on Frequent Pattern Mining. IEEE 23rd International Conference on Data Engineering (ICDE 2007), 336-345, April 2007. (pdf)

C55.     Jiang and M. Snir. Automatic Tuning Matrix Multiplication on Graphic Hardware. 14th International Conference on Parallel Architectures and Compilation Techniques (PACT 2005), St. Louis MO, 185-194, Sept 2005. (pdf)

C56.     Mingliang Wei, Marc Snir, Josep Torrellas and R. Brett Tremaine. “A Near-Memory Processor for Vector, Streaming and Bit Manipulation Networks.” The 2nd Watson Conference on Interaction between Architecture, Circuits and Compilers (P=ac2), Yorktown Heights NY, 32-41, Sept 2005. (pdf)

C57.     G.Almasi, C. Cascaval, J. G. Castanos, M. Denneau, W. Donath, M. Eleftheriou, M. Giampapa, H. Ho, D. Lieber, J. E. Moreira, M. Snir and H. S. Warren Jr.. Demonstrations of the scalability of a molecular dynamic application on a petaflop computer. 15th ACM International Conference on Supercomputing (ICS), 393-406, June 2001. (pdf)

C58.     E. Wu, A. Bolmarcich, M. Snir, D. Wootton, F. Parpia, A. Chen, E. Lusk and W. Gropp. From Trace Generation to Visualization: A Performance Framework for Distributed Parallel Systems.  SC00 International Conference for High Performance Computing, Networking, Storage and Analysis , Nov. 2000. (pdf)

C59.     S. P. Midkiff,  J. E. Moreira and M. Snir. Java for numerically intensive computing: from flops to gigaflops. Seventh Symposium on the Frontiers of Massively Parallel Computation(Frontiers '99), 251 – 257 (1999). (pdf)

C60.     E. Kattamuri, B-H Lim, P. Pattnaik and M. Snir. PRISM: an integrated architecture for scalable shared memory. Fourth International Symposium on High Performance Computer Architecture (HPCA) Las Vegas, NV, USA 1-4 Feb. 1998, 140-151. (pdf)

C61.     E. Kattamuri, B-H Lim, P Pattnaik and M Snir. A design for a Scalable Shared Memory. Innovative Architecture for Future Generation High-Performance Processors and Systems, 29, Oct 97. Also published in an ASCI Blue Pacific white paper, Supercomputing 97. (pdf)

C62.     CB-H Lim, P. Heidelberger, P. Pattnaik and M. Snir, Message proxies for efficient, protected communication on SMP clusters. 3rd International Symposium on High Performance Computing Architectures (HPCA), Feb. 1997, 116-127. (pdf)

C63.     Marc Snir. Scalable Parallel Systems: Past, Present and Future (from an IBM Perspective). 3rd International Conference on Massively Parallel Processing Using Optical Interconnections, Oct. 1996, 33-35. (pdf) 

C64.     Geist, W. Gropp, S.Huss-Lederman, A. Lumsdaine, E.Lusk, W. Saphir, T. Skellum and M.Snir. MPI-2: extending the Message-Passing Interface. Second International Euro-Par Conference, Lyon, France, August 1996. Published in Lecture Notes in Computer Science 1(1123), 128-135, L. Bouge, P. Fraigniaud, A. Mignotte and Y. Robert (eds.), Springer Verlag, 1996. (pdf)

C65.     Foster, C. Kesselman and M. Snir. Generalized Communicators in the Message Passing Interface. Second MPI Developer's Conference Message passing interface MPI Developer's Conference. Notre-Dame, IN, July 96, 42-49. (pdf)

C66.     H. Franke, C. Wu, M. Riviere, P. Pattnaik, and M. Snir. MPI Programming Environment for IBM SP1/SP2. 15th International Conference on Distributed Computing Systems, Vancouver, B.C. Canada, May-June 1995, 127-135. (pdf)

C67.     P. Corbett, D. Feitelson, S. Fineberg, Y. Hsu, B. Nitzberg, J-P Prost, M. Snir, B. Traversat and Parkson Wong. Overview of the MPI-IO Parallel I/O Interface. IPPS'95 Workshop on Input/Output in Parallel and Distributed Systems (IOPADS), 1-15, April 95. (pdf)

C68.     V. Bala, J. Bruck, R. Cypher, P. Elustondo, A. Ho, C-T. Ho, V. Bala, S. Kipnis, and M. Snir. CCL: A portable and tunable collective communication library for scalable parallel computers. 8th Int. Parallel Processing Symposium, 835-844, April 1994. (pdf)

C69.     H. Franke, P. Hochschild, P. Pattnaik, and M. Snir. An efficient implementation of MPI. IFIP WG10.3  Working Conference on Programming Environments for Massively Parallel Distributed Systems, April 1994. (pdf)

C70.     H. Franke, P. Hochschild, P. Pattnaik, J.P. Prost and M. Snir. MPI-F: an MPI prototype implementation on IBM SP1. Workshop on Environments and Tools for Parallel Scientific Computing, Townsend TN, May 1994. (pdf )

C71.     H. Franke, P. Hochschild, P. Pattnaik and M. Snir. MPI-F: An efficient implementation of MPI on IBM SP1. 23rd Annual International Conference on Parallel Processing, Aug. 1994, 197-201, vol. 3. (pdf)

C72.     P.F. Corbett, D.G. Feitelson, J-P. Prost, and M. Snir. User-friendly and efficient parallel I/O using the Vesta parallel file system. Transputers '94 International Conference, Advanced Research and Industrial Applications, Sept. 94, 23-38. (pdf)

C73.     H. Franke, P. Hochschild, P. Pattnaik, J-P. Prost, M. Snir. MPI on IBM SP1/SP2: current status and future directions. Scalable Parallel Libraries Conference II, 39-48, Oct. 1994. (pdf)

C74.     V. Bala, S. Kipnis, L. Rudolph, and M. Snir. Designing Efficient, Scalable, and Portable Collective Communication Libraries. SIAM '93 Conference on Parallel Processing for Scientific Computing, Norfolk, VA, 862-872, March 1993. (pdf)

C75.     M. Snir. Scalable Parallel Computing -- The IBM 9076 Scalable POWERParallel 1. Symposium on Parallel Algorithms and Architectures (SPAA), Schloss-Vellen, June 1993, 42-42. (pdf)

C76.     E. Upfal, S. Pelegrin and M. Snir. Randomized routing with shorter paths. Symposium on Parallel Algorithms and Architectures (SPAA) Schloss-Vellen, June 1993, 283-293. (pdf)

C77.     M. Snir. Issues and directions in scalable parallel computing. 11th ACM Symposium on Principles of Distributed Computing, 12th Annual ACM Symposium on Principles of Distributed Computing, 1993, 21-28. (pdf)

C78.     M. Snir. Scalable Parallel Computers and Scalable Parallel Codes: From Theory to Practice. First Heinz Nixdorf Symposium: Parallel Architectures and Their Efficient Use, Paderborn, November 1992. Published in Lecture Notes in Computer Science 678, F. Meyer auf der Heide, B. Monien and A.L. Rosenberg (Eds.), Springer-Verlag Berlin 1993, 176-184. (pdf)

C79.     Malki and M. Snir. Nicke - C extensions for programming on distributed memory machines. Presented at ICASE Workshop on Languages, Compilers and Run-Time Environments for Distributed Memory Machines, May 1990. Published in Languages, Compilers and Run-Time Environments for Distributed Memory Machines, J. Saltz and P. Mehotra (eds.), Elsevier North-Holland, 1992, 103-118. (pdf)

C80.     Zernik, L. Rudolph and M. Snir. Aspects of the parallel program execution: work, time and the current state. Fifth Israel Conference on Computer Systems and Software Engineering (1991) 162-172. 

C81.     Coppersmith, P, G. Doyle, P. Raghavan and M.Snir. Random walks on weighted graphs, and applications to on-line algorithms. 22th Annual ACM Symposium on Theory of Computing (STOC), May 1990, 369-378. (pdf)

C82.     Aggarwal, A. K. Chandra and M. Snir. On communication latency in PRAM computations. 1st Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), Santa Fe NM, 11-21. (pdf)

C83.     P. Raghavan and M. Snir. Memory versus randomization in on-line algorithms. 16th International Colloquium on Automata, Languages and Programming (ICALP), Stresa, Italy, July 1989, 687-703. (pdf)

C84.     Kruskal and M. Snir. Cost-bandwidth tradeoffs for communication networks. 1st Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), Santa Fe NM, June 1989, 32-41. (pdf)

C85.     Schieber and M. Snir. Calling names on nameless networks. 8th Symp. on Principles of Distributed Computing (PODC), Edmonton, Alberta, Canada, Aug. 1989, 319-328. (pdf)

C86.     H. Attiya and M. Snir. Better computing on the anonymous ring. 3rd Aegean Workshop on Computing, AWOC 88 Corfu, Greece, June 1988, 329-338. (pdf)

C87.     C. Kruskal, L. Rudolph and M. Snir. A complexity theory of parallel efficient algorithms. Automata, Languages and Programm47ing, 8th Colloquium, Tampere, Finland, July 1988, 333-346. (pdf)

C88.     M. Snir. Parallel Computation Models -- Some Useful Questions. Proc. of the IBM Workshop on Opportunities and Constraints of Parallel Computing, J.S. Sanz (ed.), Dec. 1988, 139-145. (pdf)

C89.     Aggarwal, B. Alpern, A. Chandra and M. Snir. A model for hierarchical memory. 19th Annual ACM Symposium on Theory of Computing, May 1987, 305-314. (pdf)

C90.     Aggarwal, A. Chandra and M. Snir,. Hierarchical memory with block transfer. 28th Symposium on Foundations of Computer Science, Oct. 1987, 204-216. (pdf)

C91.     R. Kenner, M. Snir, and P. Teller, TLB consistency on highly-parallel shared-memory multiprocessors. 21st Hawaii Int. Conf. on System Sciences, Jan. 1988, 184-192. (pdf)

C92.     Kruskal, L. Rudolph and M. Snir, Techniques for parallel manipulation of sparse matrices. parallel algorithms for sparse matrices. Int. Symp. on High Performance Computer Systems, Paris, Dec. 1987, 3-14. (pdf)

C93.     Kruskal, L. Rudolph and M. Snir. Efficient synchronization on multiprocessors with shared memory. 6th Symp. on Principles of Distributed Computing (PODC), Aug. 1986, 218-228. (pdf)

C94.     Kruskal, L. Rudolph and M. Snir. Efficient parallel algorithms for graph problems. Int. Conference on Parallel Processing (ICPP), Aug. 1986, 869-876. (pdf)

C95.     C. Kruskal, M. Snir, and A. Weiss. On the distribution of waiting times in clocked multistage interconnection networks. Int. Conference on Parallel Processing (ICPP), Aug. 1986, 12-19. (pdf)

C96.     M. Snir. Communicating with shared variables in multiprocessors: hardware-software tradeoffs. 1st Israeli Conference on Computer System Engineering, June 1986. 

C97.     J. Edler, A. Gottlieb, C. Kruskal, K. McAuliffe, L. Rudolph, M. Snir, P. Teller, and J. Wilson. Issues Related to MIMD, Shared-Memory Computers: The NYU Ultracomputer Approach. 12th Annual Symp. on Computer Architecture (ISCA), Boston, June 1985, 126-135. (pdf)

C98.     C. Attiya, M. Snir and M. Warmuth. Computing in an anonymous ring. 5th Symp. on Principles of Distributed Computing, Aug. 1985, 196-203. (pdf)

C99.     S. Dickey, R. Kenner, J. Solworth, and M. Snir. A VLSI Combining Network for the NYU Ultracomputer. IEEE Proc. of the International Conference on Computer Design, Port Chester, October 1985, 110-113. (pdf)

C100.  C. Kruskal and M. Snir. The importance of being square. 11th annual IEEE Symposium on Computer Architecture (ISCA), June 1984, 91-98. (pdf)

C101.  C. Kruskal, M. Snir, and A. Weiss. On the distribution of delays in buffered multistage interconnection networks for uniform and nonuniform traffic. Int. Conf. on Parallel Processing (ICPP), Aug. 1984, 215-219. (pdf)

C102.  U. Manber, S. Moran and M. Snir. Applications of Ramsey's theorem to decisions tree complexity. Annual Symp. on Foundations of Computer Science (FOCS), Singer Island, FL, Nov. 1984, 69-82. (pdf)

C103.  Gottlieb, R. Grishman, C.K. Kruskal, K.P. McAuliffe, L. Rudolph, and M. Snir. The NYU Ultracomputer - A paradigm for future large-scale parallel processors. National Conference on Communications (NCC), 1982. 

C104.  Y. Perl and M. Snir. Circuit Partitioning with size and connection constraints. Princeton Conf. on Information Sciences and Systems, Princeton NJ, 1982, 80-84. (pdf)

C105.  C. Kruskal and M. Snir. Some results on packet-switching networks for multiprocessing. Princeton Conf. on Information Sciences and Systems, Princeton NJ, 1982, 305-310. (pdf)

C106.  M. Snir. On parallel searching. ACM Symposium on Principles of Distributed Computing (PODC), 242-253, Ottawa, Canada, August 1982, 242-253. (pdf)

C107.  M. Snir. Proving lower bounds for linear decision trees. 8th International Colloquium on Automata, Language and Programming (ICALP), Acre, Israel, July 1981, 305-315. (pdf)

C108.  Gottlieb, R. Grishman, C.K. Kruskal, K.P. McAuliffe, L. Rudolph, and M. Snir. The NYU Ultracomputer - a general-purpose parallel processor. SPIE's 25th Ann. Int. Symp., San Diego, Aug. 1981 (Invited paper). (pdf)

C109.  M. Snir, I/O limitations on multi-chip VLSI systems. 19th Allerton Conference on Communication, Control, and Computing, Sept. 1981. Allerton IL, 224-233 (pdf) 

C110.  Gottlieb, R. Grishman, C.K. Kruskal, K.P. McAuliffe, L. Rudolph, and M. Snir. The NYU Ultracomputer - Designing a MIMD, shared-memory parallel machine. IEEE 9th Annual International Symposium on Computer Architecture (ISCA), Austin TX, April 1982, 27-42. Republished in ISCA’98 25 years of the international symposia on Computer architecture (selected papers), 1998, 239-254. (pdf)

C111.  M. Snir. On the size complexity of monotone formulas. 7th International Colloquium on Automata, Language and Programming (ICALP), Noordwijkerhout, the Netherlands, July 1980, 621-631. (pdf)

C112.  H. Gaifman and M. Snir. Probabilities Defined on Arithmetical Languages: Induction and Randomness. Logic Colloquium ’77, Wroclaw, Poland, August 1977. 

C113.  Barak and M. Snir. Near optimal algorithms for the parallel evaluation of linear recurrences.  13th Allerton Conference on Circuit and System Theory, Oct. 1975, Allerton, IL, 867-872. (pdf)

 

Blogs

L1.     Marc Snir, Multi-core and Parallel Programming: Is the Sky Falling? CCC Blog, 11/17/2008

L2.     Blogtalk radio Intel

L3.     Jack Dongarra et. al. International Exascale Software Project Roadmap

 

Magazine Articles and reports

M1.      Marc Snir, The Informal Guide to ACM Fellow Nominations. CACM, Vol. 60, No. 7, July 2017, pages 32-34.

M2.      Marc Snir, Supercomputers, Clouds and the Exascale Frontier. CIO Review, March 2 2015

M3.      Al Geist, Bob Lucas, Marc Snir, Shekhar Borkar , Eric Roman, Mootaz Elnozahy , Bert Still, Andrew Chien, Robert Clay, John Wu, Christian Engelmann, Nathan DeBardeleben, Rob Ross, Larry Kaplan, Martin Schulz, Mike Heroux, Sriram Krishnamoorthy, Lucy Nowell, Abhinav Vishnu, Lee-Ann Talley. Final Report of the U.S. Department of Energy Fault Management Workshop, June 2012

M4.      Pete Beckman, Ron Brightwell, Bronis R. de Supinski, Maya Gokhale, Steven Hofmeyr, Sriram Krishnamoorthy, Mike Lang, Barney Maccabe, John Shalf and Marc Snir. Exascale Operating Systems and Runtime Software Report, Sponsored by the U.S. Department of Energy, Of­fice of Science, Of­fice of Advanced Scienti­fic Computing Research, December 2012.

M5.      Stephen Booth, Dan Campbell, Andrew Chien, Richard Lethin, Lenore Mullin, Arun Rodrigues, Ron Sass, John Shalf, Marc Snir, Tom Sterling. Report of the Workshop on Architectures II: Exascale, and Beyond: Configuring, Reasoning, and Scaling. Sponsored by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research, Aug 2011.

M6.      Jack Dongarra, Pete Beckman, Terry Moore, Patrick Aerts, Giovanni Aloisio, Jean-Claude Andre, David Barkai, Jean-Yves Berthou, Taisuke Boku, Bertrand Braunschweig, Franck Cappello, Barbara Chapman, Xuebin Chi, Alok Choudhary, Sudip Dosanjh, Thom Dunning, Sandro Fiore, Al Geist, Bill Gropp, Robert Harrison, Mark Hereld, Michael Heroux, Adolfy Hoisie, Koh Hotta, Zhong Jin, Yutaka Ishikawa, Fred Johnson, Sanjay Kale, Richard Kenway, David Keyes, Bill Kramer, Jesus Labarta, Alain Lichnewsky, Thomas Lippert, Bob Lucas, Barney Maccabe, Satoshi Matsuoka, Paul Messina, PeterMichielse, BerndMohr,Matthias S.Mueller,Wolfgang E. Nagel, Hiroshi Nakashima, Michael E Papka, Dan Reed, Mitsuhisa Sato, Ed Seidel, John Shalf, David Skinner, Marc Snir, Thomas Sterling, Rick Stevens, Fred Streitz, Bob Sugar, Shinji Sumimoto, William Tang, John Taylor, Rajeev Thakur, Anne Trefethen, Mateo Valero, Aad van der Steen, Jeffrey Vetter, Peg Williams, Robert Wisniewski and Kathy Yelick. The International Exascale Software Project roadmap. The International Journal of High Performance Computing Applications 25(1) 3–60, 2011  

M7.      Marc Snir, Bill Gropp and Peter Kogge, Exascale Research: Preparing for the Post-Moore Era, White paper, 2011

M8.      Saman Amarasinghe, Mary Hall, Richard Lethin, Keshav Pingali, Dan Quinlan, Vivek Sarkar, John Shalf, Robert Lucas, Katherine Yelick, Pavan Balaji, Pedro C. Diniz, Alice Koniges, Marc Snir and Sonia R. Sachs, Exascale Programming Challenges. Report of the 2011 Workshop on Exascale Programming Challenges. Sponsored by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research (ASCR), July 2011

M9.      Marc Snir, Computer and information science and engineering: one discipline, many specialties.  Communication of the ACM, March 2011

M10.   Oswaldo Trelles, Pjotr Prins, Marc Snir and Ritsert Jansen. Big data, but are we ready? Correspondence, Nature Reviews genetics, 8/2/2011

M11.   Marc Snir and Telle Whitney, Advice to Members Seeking ACM Distinction, CACM July 2010, pg. 40-41

M12.   Martha E. Pollack and Marc Snir. Best Practices in Promotion and Tenure of Interdisciplinary Faculty, Computing Research News, 20(4), 2008

M13.   Marc Snir, “Brain Surgery for the Masses.” HPCWire, Vol. 16, No. 8, Feb. 23rd 2007

M14.   Marc Snir, U.S. Computer Science Delegation Visits China, Computing Research News, Vol. 18/No. 5, Nov. 2006

M15.   Graham, Susan L., Snir, Marc and Patterson, Cynthia A. Bolstering “U.S. Supercomputing.” Issues in Science and Technology, v 21, n 4, Summer, 2005, p 28-32. 

M16.   Susan L. Graham and Marc Snir, “The NRC Report on the Future of Supercomputing.” Cyberinfrastructure Technology Watch Quarterly, Feb. 2005 (9 pages).

M17.   Mitchell D Theis, Shoukat Ali, Howard Jay Siegel, Many Chandy, Kai Hwang, Ken Kennedy, Lui Sha, Kang G Shin and Marc Snir, “What are the Top Ten Most Influential Parallel and Distributed Processing Concepts of the Past Millenium?” Journal of Parallel and Distributed Computing 61, 2001, 1827-1841.

M18.   D. Reed, C. Catlett, A. Choudhari, D. Kotz, and M. Snir, “Parallel I/O: Getting ready for prime time” (editorial). IEEE Parallel and Distributed Technology 3(2), Summer 95, 64-71. Also in IEEE Computer Architecture Technical Committee Newsletter, Winter 94-95, 45-55.