Sparsity/Bebop
-
Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply
- Richard Vuduc, James W. Demmel, Katherine A. Yelick,
Shoaib Kamil, Rajesh Nishtala, Benjamin Lee.
- SC 2002 (High Performance
Networking and Computing, commonly called "Supercomputing").
Baltimore, November 2002.
- Available in pdf (834k)
| Gzip'd PostScript (2.7M)
-
Automatic Performance Tuning and Analysis of Sparse
Triangular Solve
- Richard Vuduc, Shoaib Kamil, Jen Hsu, Rajesh Nishtala,
James W. Demmel, Katherine A. Yelick.
- ICS 2002:
Workshop
on Performance Optimization via High-Level Languages and
Libraries. New York, June 22-26, 2002.
- Available in pdf (548k)
| Gzip'd PostScript (1.2M)
-
Optimizing Sparse Matrix-Vector Multiplication for Register Reuse
- E. Im and K. A. Yelick
- International Conference on Computational Science, San Francisco,
California, May 2001.
- (Postscript)
-
Optimizing Sparse Matrix Kernels for Data Mining
- E. Im and K. A. Yelick
- Proceedings of the Text Mine Workshop
- Chicago, IL, April 2001.
- (Postscript)
-
Optimizing Sparse Matrix Vector Multiplication on SMPs
- E. Im and K. A. Yelick
- SIAM Conf. Parallel Processing for Scientific Computing, San
Antonio, TX, March 1999.
- (Postscript)
-
Model-based Memory Hierarchy Optimizations for Sparse Matrices
- E. Im and K. A. Yelick
- Workshop on Profile and Feedback-Directed Compilation, Paris,
France, October 1998.
- (Postscript)
Intelligent RAM (IRAM)
-
Memory-Intensive Benchmarks: IRAM vs. Cache-Based Machines,
- Brian R. Gaeke, Parry Husbands, Xiaoye S. Li, Leonid Oliker,
Katherine A. Yelick, and Rupak Biswas.
- Proceedings of the International Parallel and Distributed Processing
Symposium (IPDPS). Ft. Lauderdale, FL.
- April, 2002.
- Available in PDF.
-
Hardware/Compiler Co-development for an Embedded Media Processor,
- C. Kozyrakis, D. Judd, J. Gebis, S. Williams, D. Patterson, K. Yelick,
- Proceedings of the IEEE, vol. 89, no. 11, November 2001 (p. 1694-709).
- Draft available in PDF.
-
Exploiting On-Chip Memory Bandwidth in the VIRAM Compiler,
- D. Judd, K. Yelick, C. Kozyraki, D. Martin, and D. Patterson,
- Second Workshop on Intelligent Memory Systems, Cambridge,
November 2000.
- Available in Postscript.
-
Performance Analysis of an H.263 Video Encoder on VIRAM,
- T. Nguyen, A. Zakhor and K. Yelick
- International Conference on Image Processing (ICIP),
- Vancouver, B.C., Canada, September 2000.
- Available in PDF
-
Efficient FFTs on IRAM
- Thomas, R. and Yelick, K.
- First Workshop on Media Processors and DSPs,
November 15, 1999.
- Postscript available.
-
Scalable processors in the billion-transistor era: IRAM
- Kozyrakis, C.E., Perissakis, S., Patterson, D., Anderson, T.,
Asanovic, K., Cardwell, N., Fromm, R., Golbus, J., Gribstad, B.,
Keeton, K., Thomas, R., Treuhaft, N., Yelick, K.
- Computer, vol.30, (no.9), IEEE Comput. Soc,
Sept. 1997. p.75-8.
- Available in PDF.
-
The Energy Efficiency of IRAM Architectures
- R. Fromm, S. Perissakis, N. Cardwell, D. Patterson, T. Anderson,
and K. Yelick
- Proceedings of the 24th Annual International Conference on
Computer Architecture, June 1997.
- Available in Postscript.
-
A Case for Intelligent DRAM: IRAM
- D. Patterson, T. Anderson, N. Cardwell, R. Fromm, K. Keeton,
C. Kozyrakis, R. Thomas, and K. Yelick.
- IEEE Micro, April 1997, pp. 34-44.
Also appeared as an Award Paper, Hot Chips VIII , August 1996.
- Available in PDF or
Postscript.
-
Intelligent RAM (IRAM): Chips that remember and compute
- D. Patterson, T. Anderson, N. Cardwell, R. Fromm, K. Keeton,
C. Kozyrakis, R. Thomas, and K. Yelick.
- Proceedings of the 1997 IEEE International Solid-State Circuits
Conference, February 1997, pp. 224-225.
- Available in PDF or
Postscript.
Clusters (includes ISTORE and ROC)
-
ROC-1: Hardware Support for Recovery-Oriented Computing.
- Oppenheimer, D., A. Brown, J. Beck, D. Hettena, J. Kuroda,
N. Treuhaft, D.A. Patterson, and K. Yelick.
- IEEE Transactions on Computers Special Issue on
Embedded Fault-Tolerant Computer Systems, Jul.-Aug., 2001.
- Available in PDF
-
Cluster I/O with River: Making the Fast Case Common
- R. H. Arpaci-Dusseau, E. Anderson, N. Treuhaft, D. E. Culler,
J. M. Hellerstein, D. A. Patterson, and K. A. Yelick
- Workshop on I/O in Parallel and Distributed Systems, Atlanta, GA,
May 1999.
- Postscript available.
Parallel Applications
-
Performance Modeling and Composition: A Case Study in
Cell Simulation
- Seve G. Steinberg, Jun Yang, and Katherine Yelick, IPPS '96
April 1996.
- Abstract,
Postscript available.
-
Parallelizing the Phylogeny Problem
- J. Jones and K. Yelick, Supercomputing '95
December 1995.
- Abstract,
Postscript available.
-
Connected Components on Distributed Memory Machines
- A. Krishnamurthy, S. Lumetta, D. Culler, and K. Yelick,
June 1994.
- Abstract,
Postscript available.
-
Parallel Timing Simulation on a Distributed Memory Multiprocessor
- Chih-Po Wen and Katherine Yelick, International Conference on
Computer Aided Design, Santa Clara, California, November 1993.
- Abstract,
Postscript available.
-
Implementing an Irregular Application on a Distributed Memory
Multiprocessor
- Soumen Chakrabarti and Katherine Yelick, ACM Symposium on
Principles and Practice of Parallel Programming, San Diego,
California, June 1993.
- Abstract,
Postscript available.
-
A Parallel Completion Procedure for Term Rewriting Systems
- Katherine Yelick and Stephen J. Garland, Conference on
Automated Deduction , June 1992.
- Abstract,
Postscript available.
Compilation
-
Analyses and Optimizations for Shared Address Space Programs
- A. Krishnamurthy and K. Yelick
- Journal of Parallel and Distributed Computation, 1996.
- Postscript available.
-
Optimizing Parallel Programs with
Explicit Synchronization
- Arvind Krishnamurthy and Katherine Yelick,
Programming Language Design and Implementation,
La Jolla, California, June 1995.
- Abstract,
Postscript available.
-
Optimizing Parallel SPMD Programs
- Arvind Krishnamurthy and Katherine Yelick,
Seventh Annual Workshop on Languages and Compilers for Parallel
Computing, Ithaca, New York, August 1994.
- Abstract,
Postscript available.
-
Compiling Sequential Programs for Speculative Parallelism
- Chih-Po Wen and Katherine Yelick, International Conference on
Parallel and Distributed Systems, National Taiwan University,
Taiwan, December 1993.
- Abstract,
Postscript available.
Scheduling and Load Balancing
-
Models and Scheduling Algorithms for Mixed Data and Task Parallel
Programs
- S. Chakrabarti, J. Demmel, and K. Yelick
- Journal of Parallel and Distributed Computing,
Vol. 47, pp. 168--184. 1997.
-
Modeling the Benefits of Mixed Data and Task Parallelism
- Soumen Chakrabarti, James Demmel, and Katherine Yelick,
Symposium on Parallel Algorithms and Architectures, Santa Barbara,
California, July 1995.
- Abstract,
Postscript available.
-
Randomized Load Balancing for Tree Structured Computation
- Soumen Chakrabarti, Abhiram Ranade, and Katherine Yelick,
IEEE Scalable High Performance Computing Conference, Knoxville,
Tennessee, May 1994.
- Abstract,
Postscript available.
Distributed Data Structures & the Multipol Library
-
Portable Parallel Irregular Applications.
- K. Yelick, C.-P. Wen, S. Chakrabarti, E. Deprit,
J. Jones, A. Krishnamurthy, Workshop on Parallel Symbolic
Languages and Systems, Beaune, France, October 1995.
To appear in Lecture Notes in Computer Science.
- Abstract,
Postscript available.
-
Multipol: A Distributed Data Structure Library.
- S. Chakrabarti, E. Deprit, J. Jones, A. Krishnamurthy,
E.-J. Im, C.-P. Wen, and K. Yelick, UCB//CSD-95-879, July 1995.
- Abstract,
Postscript available.
-
Portable Runtime Support for Asynchronous Simulation
- Chih-Po Wen and Katherine Yelick, International Conference on
Parallel Processing, August 1995.
- Abstract,
Postscript available.
-
Portable Runtime Support for Asynchronous Simulation
- C.-P. Wen, S. Chakrabarti, E. Deprit,
A. Krishnamurthy and K. Yelick,
``Runtime Support for Portable Distributed Data Structures,''
Workshop on Languages, Compilers, and Runtime Systems for
Scalable Computers, May 1995.
- Postscript available.
-
Distributed Data Structures and Algorithms for Gröbner Basis
Computation
- Soumen Chakrabarti and Katherine Yelick, Lisp and Symbolic
Computation, Vol. 7, 1994.
- Abstract available.
-
Data Structures for Irregular Applications
- K. Yelick, S. Chakrabarti, E. Deprit, J. Jones, A. Krishnamurthy,
and C.-P. Wen, DIMACS Workshop on Parallel Algorithms for
Unstructured and Dynamic Problems, Piscataway, New Jersey, June 1993.
- Abstract,
Postscript available.
-
Programming Models for Irregular Applications
- Katherine Yelick.
Workshop on Languages and Compilers and Run-Time Environments
for Distributed Memory Multiprocessors, October 1992.
Also appeared in SIGPLAN Notices, January 1993.
- Postscript available.
-
A Survey of Portable Message Passing Libraries
- Chih-Po Wen and Katherine Yelick, unpublished manuscript,
October 15, 1992.
- Postscript available.
Parallel Languages: Split-C, Titanium, and UPC
-
An Evaluation of Current High Performance Networks,
- C. Bell, D. Bonachea, Y. Cote, J. Duell, P. Hargrove,
P. Husbands, C. Iancu, M. Welcome, K. Yelick,
- International Parallel and Distributed Processing Symposium,
Nice, France, April 22-26, 2003.
- Available in PDF
-
Introduction to UPC and Language Specification,
- W. Carlson, J. Draper, D. Culler, K. Yelick, E. Brooks, and K. Warren,
- CCS-TR-99-157, IDA Center for Computing Sciences, 1999.
- Available in PDF
-
Titanium: A High-Performance Java Dialect
- K. A. Yelick, L. Semenzato, G. Pike, C. Miyamoto, B. Liblit, A.
Krishnamurthy, P. N. Hilfinger, S. L. Graham, D. Gay, P. Colella,
and A. Aiken
- Concurrency: Practice and Experience,
Vol. 10, No. 11-13, September-November 1998. An earlier version was
presented at the Workshop on Java for High-Performance Network Computing,
Palo Alto, CA, Feb. 1998.
- Postscript available.
-
Empirical Evaluation of Global Memory Support on the Cray-T3D and
Cray-T3E
- A. Krishnamurthy, D. Culler, and K. Yelick
- UCB//CSD-98-991, 1998.
- Postscript available.
-
Evaluation of Architectural Support for Global
Address-Based Communication in Large-Scale Parallel Machines
- Arvind Krishnamurthy, Klaus E. Schauser, Chris Scheiman, Randy Wang,
David Culler, and Katherine Yelick,
Proceedings of Architecture Support on Programming Languages and
Operating Systems, Cambridge, MA, November 1996.
- Postscript available.
-
Empirical Evaluation of the CRAY-T3D: A Compiler Perspective
- Remzi H. Arpaci, David E. Culler, Arvind Krishnamurthy,
Steve G. Steinberg, and Katherine Yelick,
International Symposium on Computer Architecture,
Santa Margherita Ligure, Italy, June 1995.
- Abstract,
Postscript available.
-
Parallel Programming in Split-C
- D. Culler, A. Dusseau, S. Goldstein, A. Krishnamurthy, S. Lumetta,
T. von Eicken, and K. Yelick, Supercomputing, Portland, Oregon,
November 1993.
- Abstract,
Postscript available.
Symbolic Computation
-
On the Correctness of a Distributed Memory Gröbner Basis
Algorithm
- Soumen Chakrabarti and Katherine Yelick, International Conference
on Rewriting Techniques and Applications, Montreal, Canada, June 1993.
- Abstract,
Postscript available.
-
Compiling Verilog into Finite State Machines
- S.-T. Cheng, R. Brayton, G. York, K. Yelick, A. Saldanha.
International Verilog Conference, 1995.
- Abstract,
Postscript available.
-
Using Abstraction in Explicitly Parallel Programs
- Katherine A. Yelick, MIT Laboratory for Computer Science,
July 1991, TR-507. (Revised from PhD Thesis, December 1990.)
- Abstract,
Postscript available.
-
A Generalized Approach to Equational Unification
- Katherine A. Yelick, MIT Laboratory for Computer Science,
August 1985, TR-344.