next up previous
Next: Introduction

Constraints on Parallelism Beyond
10 Instructions Per Cycle

John G. Cleary, Richard H. Littin, J. A. David McWha and Murray W. Pearson
Dept. of Computer Science, University of Waikato,
Private Bag 3105, Hamilton, New Zealand
{jcleary, rhl, jadm, mpearson}@cs.waikato.ac.nz

Abstract:

The problem of extracting Instruction Level Parallelism at levels of 10 instructions per clock and higher is considered. Two different architectures which use speculation on memory accesses to achieve this level of performance are reviewed. It is pointed out that while this form of speculation gives high potential parallelism it is necessary to retain execution state so that incorrect speculation can be detected and subsequently squashed. Simulation results show that the space to store such state is a critical resource in obtaining good speedup. To make good use of the space it is essential that state be stored efficiently and that it be retired as soon as possible. A number of techniques for extracting the best usage from the available state storage are introduced.

Keywords: instruction level parallelism, speculation




Submitted to the 25th International Symposium on Computer Architecture (ISCA'98).
next up previous
Next: Introduction

Richard H Littin
Tue Nov 25 14:43:27 NZDT 1997