# Systems Design Volume II of VLSI Systems Design for Digital Signal Processing

B. A. Bowen

W. R. Brown

# Systems Design Volume II of VLSI Systems Design for Digital Signal Processing

#### B. A. Bowen

Department of Systems and Computer Engineering Carleton University Ottawa



Prentice-Hall, Inc., Englewood Cliffs, New Jersey 07632

Library of Congress Cataloging in Publication Data (Revised for vol. 2)

BOWEN, B. A. (date)

VLSI systems design for digital signal processing.

Includes bibliographies and indexes. Contents: v. 1. Signal processing and signal processors—v. 2. Systems design.

1. Signal processing—Digital Techniques.
2. Integrated circuits—Very large scale integration.
1. Brown, W. R. (William Roy) 11. V.L.S.I. systems design for digital signal processing. III. Title.
TK7868.D5B68 1982 621.3819'592 81-1385-

ISBN 0-13-942706-6 (v. 1) ISBN 0-13-881418-X (v. 2)

LV61114

Editorial/production supervision and interior design: Diana Drew

Cover design: Bruce Kenselaar

Manufacturing buyer: Gordon Osbourne

© 1985 by Prentice-Hall, Inc., Englewood Cliffs, New Jersey 07632

All rights reserved. No part of this book may be reproduced, in any form or by any means, without permission in writing from the publisher.

The author and publisher of this book have used their best efforts in preparing this book. These efforts include the development, research, and testing of the theories and programs to determine their effectiveness. The author and publisher make no warranty of any kind, expressed or implied, with regard to these programs or the documentation contained in this book. The author and publisher shall not be liable in any event for incidental or consequential damages in connection with, or arising out of, the furnishing, performance, or use of these programs.

Printed in the United States of America

10 9 8 7 6 5 4 3 2 1

#### D-13-881418-X Ol

Prentice-Hall International (UK) Limited, London
Prentice-Hall of Australia Pty. Limited, Sydney
Editora Prentice-Hall do Brasil, Ltda., Rio de Janeiro
Prentice-Hall Canada Inc., Toronto
Prentice-Hall Hispanoamericana, S.A., Mexico
Prentice-Hall of India Private Limited, New Delhi
Prentice-Hall of Japan, Inc., Tokyo
Prentice-Hall of Southeast Asia Pte. Ltd., Singapore
Whitehall Books Limited, Wellington, New Zealand

# **Dedication**

This book is dedicated to all "systems designers," those noble souls who reverse the natural laws of entropy and create structure in the universe.

Our approach to design is predicated on a natural law which we jointly discovered,

If you don't know where you are going, you won't know when you get there

with several corollaries, the most notable being

C:1 (Brown's corollary):

It won't matter in which direction you go.

C:2 (Bowen's observation)

Murphy will be your travelling companion.

Thus, what is contained in this book is a simple message: You can't design (or choose) a complex piece of hardware to execute a complex algorithm unless you know what you want done—in detail.

This book offers designers four things:

A methodology for charting your course; A tactical arsenal for getting you there; Some mechanisms for helping you along the way; and Some means of proving that you have arrived.

# Global Table of Contents

# Volume 1: Signal Processing and Signal Processors Introduction to Volume I

- Digital Signals
- 2 Linear Systems and Digital Filters
- 3 Detection and Estimation
- Digital Signal Processing Algorithms and Techniques
- **5** From Processing to Processors
- 6 Performance Measures and Limitations
- 7 Signal Processors

## **Volume 2: Systems Design**

### Introduction to Volume II

- I From Processing to Processors—An Overview
- 2 Modelling and Describing Concurrency
- 3 Design Philosophies
- Logical Design: Issues and Pragmatics

#### GLOBAL TABLE OF CONTENTS

- Synthetic Aperture Radar: A Logical Design Case Study
- **6** Hardware Selection
- Partitioning and Allocation
- Processor Architectures: Quantitative Analysis
- Design Issues

Appendix: Ada Code for Basis Operations

# Acknowledgments

Professor Raymond Buhr and Dennis Mackinnon provided us with an introduction to Ada and convincing evidence of its (her) suitability as a descriptive language for systems in general. Much of the structure in Chapter 2, and many of the examples, has been profoundly influenced by their perspectives.

Milan Kutchta extended our early ideas of architectural efficiency in a project undertaken as part of his M. Eng. program. His insight and ruthless logic provided a sharp focus on our presentation of this subject.

Examples of designs were worked out by several graduate students; Spiros Boucouris, Ravikanti Devender, Rafik Gourban, and Kostas Siamolus spent many hours checking results and proposing changes.

There are many others who survived several graduate courses based on emerging notes, and we acknowledge the influence of everyone with whom we talked (perhaps the word is harangued) about our philosophy and approach. As we may have influenced them, they profoundly influenced us.

# Introduction to Volume II

٠,

This volume is devoted to establishing the following: first, that the design of systems is a process; and, second, that an appropriate design methodology, with all the support mechanisms, is necessary to overcome many of the problems inherent in this process. Most emphatically, this volume is not concerned with myriad implementation details at the electrical level. It is concerned with the logical progression of steps from the initial requirements analysis to the choice of the functional components. Below this, another kind of expertise is required, which is beyond the scope of this volume.

In Volume I, an extensive tutorial review of signal processing theory was presented. This review exposed the computational requirements of signal processing algorithms, as derived from the underlying mathematical studies. A cursory review of processors was also undertaken to demonstrate the mechanisms used to achieve high performance, constrained by technology and perhaps by fundamental architectural concepts.

In Volume II, we present an additional set of essential concepts which are a necessary part of every system designer's arsenal. We believe it is exceedingly important that the reader appreciate both the perspective that generated the design philosophy and the structure of the material to be presented before he/she proceeds. To achieve this appreciation, Chapter 1 restates the major elements of the material presented in Chapter 5 of Volume I, followed by an examination of the structure and philosophy of this volume.

# **Table of Contents**

Preface xiii

| Introduction to Volume II xvii                                                                                                                                                                                                                                                                                      |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| From Processing to Processors—                                                                                                                                                                                                                                                                                      |
| An Overview 1                                                                                                                                                                                                                                                                                                       |
| 1.0 Introduction 1 1.1 The Rearview Mirror: A Perspective of Volume I 2 1.1.0 Introduction 2 1.1.1 Signal Modelling 2 1.1.2 Basis Operations 3 1.1.3 Performance Requirements and Constraints 5 1.1.4 Perspective 6 1.1.5 Algorithms, Architecture, and VSLI 7 1.2 An Outline of Volume II 10 1.2.0 Introduction 10 |
|                                                                                                                                                                                                                                                                                                                     |

## 2 Modelling and Describing Concurrency 14

- 2.0 Introduction 14
- 2.1 Logical Components of Concurrent Systems 14
  - 2.1.0 Introduction 15
  - 2.1.1 Concurrency 16
  - 2.1.2 Processes 18

33

| 2.1.3 Critical Regions, Flags, and Semaphores 19        |   |
|---------------------------------------------------------|---|
| 2.1.4 Monitors 25                                       |   |
| 2.1.5 The Kernel: A Virtual Machine 28                  |   |
| 2.1.6 Hardware 30                                       |   |
| 2.1.7 Access Graphs 31                                  |   |
| 2.1.8 Summary 32                                        |   |
| 2.2 A Concurrent Systems Language: Ada 33               |   |
| 2.2.0 Introduction 33                                   |   |
| 2.2.1 The Logical Structure of Ada 34                   |   |
| 2.2.2 Packages, Tasks, and Rendezvous 36                |   |
| 2.2.3 Structure Diagrams 42                             |   |
| 2.2.4 Summary 46                                        |   |
| 2.3 Chapter Summary 46                                  |   |
| 2.4 References 47                                       |   |
|                                                         |   |
| Design Philosophies 48                                  |   |
| 3.0 Introduction 48                                     |   |
| 3.1 Classical Approaches to Design 49                   |   |
| 3.1.0 Introduction 49                                   |   |
| 3.1.1 A General System Perspective 50                   |   |
| 3.1.2 Structural Attributes of Design Methodologies 54  |   |
| 3.1.3 Design Strategies 57                              |   |
| 3.1.4 Design Tactics 64                                 |   |
| 3.1.5 Design Mechanisms 65                              |   |
| 3.1.6 General Features of Design Methodologies 67       |   |
| 3.1.7 Summary 68                                        |   |
| 3.2 A General Systems Design Methodology 69             |   |
| 3.2.0 Introduction 69                                   |   |
| 3.2.1 Edges-in Design 71                                |   |
| 3.2.2 Multileveled Systems Design 74                    |   |
| 3.2.3 Requirements Analysis and System Specification 78 |   |
| 3.2.4 Logical Design 89                                 |   |
| 3.2.5 Partitioning and Allocation to Hardware 96        |   |
| 3.2.6 Automated Design And Design Aids 99               |   |
| 3.2.7 Summary 101                                       |   |
| 3.3 Chapter Summary 102                                 |   |
| 3.4 Exercise 103                                        |   |
| 3.5 References 106                                      |   |
| J.J. Moleichicos 100                                    |   |
| Logical Design: Issues and Pragmatic                    |   |
| weren mearem raanca ama rraemana                        | • |

# 4

- 4.0 Introduction 107
- 4.1 A Logical System Perspective 109 4.1.0 Introduction 109

| 4.1.1  | Establishing the System Context 112         |     |
|--------|---------------------------------------------|-----|
| 4.1.2  | User-Visible System Operation 114           |     |
|        | Logical System Organization 118             |     |
| 4.1.4  |                                             |     |
| 4.2 Da | ta-Flow Analysis 121                        |     |
|        | Introduction 121                            |     |
| 4.2.1  | Data-Flow Graphs 122                        |     |
|        | Data Elements 125                           |     |
|        | Data-Flow Functions 127                     |     |
|        | Loading Analysis 129                        |     |
|        | Partitioning Considerations 132             |     |
|        | Data-Flow Machines 135                      |     |
| 4.2.7  | Summary 141                                 |     |
|        | tem Control 142                             |     |
| •      | Introduction 142                            | •   |
| 4.3.1  | From Data-Flow Graphs to Structure Diagrams | 143 |
|        | Control Requirements 144                    |     |
|        | Control Levels 145                          |     |
| 4.3.4  | Control Alternatives 146                    |     |
|        | Control Mechanisms 150                      |     |
|        | Control Organization 151                    |     |
| 437    | Structure Diggrams 152                      |     |

#### S Synthetic Aperture Radar:

4.3.8 Execution Graphs 166

## A Logical Design Case Study 171

- 5.0 Introduction 171
- 5.1 System Context 172

4.3.9 Summary 166 4.4 Chapter Summary 166

- 5.1.0 Introduction 172
- 5.1.1 SAR Processing 172
- 5.1.2 A Satellite SAR System 175
- 5.1.3 System Operation 1775.1.4 System Context Diagram 178
- 5.1.5 Summary 179
- 5.2 Data-Flow Analysis
  - 5.2.0 Introduction 180
  - 5.2.1 Data-Flow Decomposition 180

  - 5.2.2 Loading Analysis 1935.2.3 Partitioning Alternatives 207
  - 5.2.4 Summary 224

6

|        | ructure Diagrams and System Control 230            |
|--------|----------------------------------------------------|
|        | Introduction 230                                   |
| 5.3.1  | System Structure Diagrams: Top-Down 231            |
|        | System Structure Diagram: Bottom-Up 236            |
|        | Summary 240                                        |
| 5.4 Ch | apter Summary 240                                  |
|        |                                                    |
| Hardy  | vare Selection 242                                 |
|        | roduction 242                                      |
|        | ardware Selection—an Overview 243                  |
|        | Introduction 243                                   |
|        | Hardware Elements and Architecture Alternatives 24 |
| €.1.2  | Partitioning and Allocation 246                    |
| 6.1.3  | A Selection Algorithm 247                          |
| 6.1.4  | Summary 249                                        |
| 6.2 Mu | ultiple-Processor Architectures 250                |
| 6.2.0  | Introduction 250                                   |
| 6.2.1  | Elements of Distributed Systems 252                |
| 6.2.2  | Interconnection Network Topologies 253             |
| 6.2.3  | Interconnection Topologies for DSP 260             |
| 6.2.4  | Overhead and Bottlenecks 263                       |
| 6.2.5  | Summary 265                                        |
| 6.3 Ha | rdware Structure Diagrams 266                      |
| 6.3.0  | Introduction 266                                   |
| 6.3.1  | Physical System Elements 269                       |
|        | Interconnection Topology Representation 271        |
|        | A Simple von Neumann Processor 275                 |
|        | The T-ASP 277                                      |
| 6.3.5  | Summary 283                                        |
|        | apter Summary 286                                  |
|        |                                                    |
| Partit | ioning and Allocation 288                          |
| ,      |                                                    |
|        | roduction 288                                      |
|        | rtitioning and Allocation—an Iterative Process 289 |
|        | Introduction 289                                   |
|        | Partitioning Approaches 290                        |
|        | Allocation to Hardware 291                         |
|        | A Simple Performance Model 294                     |
|        | An Iterative Approach 297                          |
|        | Summary 299                                        |
|        | e SAR System 299                                   |
| 7.2.0  | Introduction 299                                   |

9

| 7.2.1 Partitioning the SAR Structure Diagram 300 7.2.2 Allocation to a Hardware Architecture 300 7.2.3 Evaluation and Discussion 309 7.2.4 Summary 315 7.3 Chapter Summary 316 7.4 Reference 318                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Processor Architectures: Quantitative                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
| Analysis 319                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| 8.0 Introduction 319 8.1 Architectural Efficiency 320 8.1.0 Introduction 320 8.1.1 Performance and Efficiency 321 8.1.2 Algorithms and Efficiency 324 8.1.3 Matching Algorithms to Architectures 325 8.1.4 Allocation and Selection Mechanisms 331 8.1.5 Summary 333 8.2 Mapping Algorithms to Hardware 333 8.2.0 Introduction 333 8.2.1 A Mapping Procedure 324 8.2.2 Basis Operations: Computational Mapping Examples 338 8.2.3 Radix-N FFT: Examples 350 8.2.4 Summary 352 8.3 Basis Operation Processors 353 8.3.0 Introduction 353 8.3.1 FIR Filters 353 8.3.2 IIR Filters 354 8.3.3 FFT Butterfly 362 8.3.4 Multifunction Architectures 370 8.3.5 Summary 374 8.4 Chapter Summary 378 8.5 Exercises 379 |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| Design Issues 381                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |
| <ul> <li>9.0 Introduction 381</li> <li>9.1 The Design Environment: The Real World 381</li> <li>9.2 The Design Approach 382</li> <li>9.3 User Requirements and Specifications 382</li> <li>9.4 Functional Partitions 383</li> <li>9.5 Hardware Allocation 384</li> </ul>                                                                                                                                                                                                                                                                                                                                                                                                                                       |

- 9.6 Implementation 385
- 9.7 Validation: Acceptance Criteria 385
- 9.8 The Moving Target: Technology 386
- 9.9 The Principles of Design 386
- 9.10 Summary 387

# Appendix: Ada Code for Basis Operations 388

- A.1 Basic Functions 388
- A.2 Data Manipulation Functions 394
- A.3 Input/Output Functions 397
- A.4 Control and Synchronization Functions 401

### Index 408

# Preface

In Volume I the basis of digital signal processing and high-performance digital processor architectures was reviewed. In this volume we introduce a comprehensive design methodology for creating digital signal processing systems; this design methodology is self-contained and applies to the design of processing systems in general.

Design involves, at many stages, an "intellectual leap" in order to conceive and create new structures where none existed before. This leap cannot be taught. In a sense, all that can be taught is the location and the structure of sound platforms from which to leap—and also, perhaps, how to recover gracefully from landing in a swamp.

Many portions of this book represent attempts to convey a philosophy of systems design. Philosophical discourse, as with other such high-level human affairs, is difficult and will invariably provoke disagreements. In a very large sense, it is unimportant whether you agree with our approach or not—the important issue is that, as a systems designer, you must establish such a perspective. Disagreeing with someone else's approach often provides the synergism for establishing your own version.

The success of your designs will provide the final judgment: You can't argue with the facts, although you can occasionally cloud the issue.

# From Processing to Processors—An Overview

#### 1.0 INTRODUCTION

This book is the second of a two-volume set. The overall purpose of these two volumes is to present a coherent philosophy and methodology for the design of high-performance digital signal processing systems that exploit the advances being made in very large scale integration (VLSI) circuit technology.

In Volume I, we presented an overview of the basic theory of digital signal processing, an introduction to the basic concepts of high-performance processors, and a brief examination of the evolution of digital signal processors and integrated circuit technology growth trends. In this volume, we are concerned with bridging the gap between theoretical problem analysis and the physical implementation of the theory, through a systematic process of systems design.

Part A of Volume I examined the basic concepts of digital signals and the mathematical theory on which digital signal processing is based. We also reviewed some common techniques and algorithms of digital signal processing. Part A concluded with a discussion of the general attributes of any particular digital signal processing application area and introduced the overall problem of moving from theory to implementation.

The observation that the majority of digital signal processors implement a small set of basic processing operations leads to a particular perspective of the design problem. These processing operations have traditionally been further broken down to expose their dependence on the repetitious execution of the fundamental arithmetic operations of multiplication and addition. However, recent advances in integrated circuit technology have led to the availability of components of far greater complexity than simple multipliers and adders.

Thus, digital signal processor design efforts must concentrate on achieving higher throughput rates for processors organized around the execution of more complex fundamental operations. We note, in addition, that the rate of advance-

ment of integrated circuit technology has been exponential for over a decade, and projections indicate that it will remain so for at least the immediate future. Digital components of both very high execution speed and very high functional complexity are assured. Thus, the overall problem of processor design, or processing system design, seems more appropriately viewed in terms of an algorithm partitioning more closely aligned with the functional complexity of modern components, that is, viewed from the perspective of creating interconnected structures of VLSI components that in many cases are processors in their own right. From such a perspective emerges the design philosophy and methodology which forms the subject matter of this volume

In this chapter, we review the perspective, established in Volume I, of current trends in signal processing and signal processors and then outline the structure and philosophy of this volume.

# 1.1 THE REARVIEW MIRROR: A PERSPECTIVE OF VOLUME I

#### 1.1.0 Introduction

The move from signal processing theory to physical implementation is invariably made within the context of some particular application area or range of related applications. Indeed, the number of specific fields, such as radar, sonar, image processing, speech processing, and digital communications, to name only a few, in which digital signal processing techniques are being applied has been steadily increasing. The primary reasons for this proliferation are, of course, the steadily increasing performance capabilities of digital systems and the declining cost/performance ratios.

In Chapter 5 of Volume I, we noted several general attributes of an intended application that influence the selection or design of implementation hardware. These attributes were categorized in terms of signal models, the functions required to manipulate the signals, and the performance requirements and constraints imposed. We will review, briefly, these attributes in the following.

#### 1.1.1 Signal Modelling

Two general aspects of signal modelling influence implementation decisions: basic signal parameters and information content modelling. The basic signal parameters such as frequency content, dynamic range, and signal-to-noise ratio requirements drive decisions on digital signal representation such as sample rate, sample quantization, anti-alias filtering, etc. The information-content model assumed for a digital signal forms the basis for determining what processing operations are to be performed on it. The information-content model of a signal is directly related to the application and specifies such characteristics as whether the signal is considered

to be a random or a deterministic signal. This form of signal modelling is often concerned with the assumed characteristics of the signal source, the transmission medium, and transducer characteristics. From these signal models come the definitions of the required processing algorithms. Modelling is, therefore, the most fundamental part of the whole structure of deriving a suitable processor. We will not be concerned with the activity of signal modelling in this book—however, this is not to understate its importance. Indeed, we shall see that system modelling is the fundamental basis of system design.

#### 1.1.2 Basis Operations

Based on the signal models and the specific problems or goals of the various application areas, the required sequences of processing operations are formulated in terms of signal processing theory. This specification of processing requirements is carried out in terms of the basic mathematical tools of signal processing theory, as discussed in detail in Part A of Volume 1. In general, the specification of the signal processing requirements amounts to the specification of the order in which the signal is to be manipulated to get the information of interest into a desired form or representation. The actual signal manipulations tend to be based on a relatively small set of basic signal processing operations such as convolution, correlation, difference-equation calculations, discrete Fourier transform (DFT) coefficient calculations, vector or matrix arithmetic operations, etc. The problem at hand must be specified in terms of the appropriate combination of these operations to accomplish the required processing.

This view of processing requirements specification is of central importance, since it provides a common set of processing function types needed to carry out digital signal processing. These common processing functions, which we designate as the basis operations of digital signal processing (DSP), are summarized in the following list.

#### 1. Difference equation calculations

$$y(n) = \sum_{k=0}^{N} a_k x(n-k) - \sum_{k=1}^{M} b_k y(n-k)$$
 (1.1)

This equation represents the general computational requirement for a recursive or infinite impulse response (IIR) filtering operation, where x(n) is the input sequence and y(n) is the filtered output sequence. The parameters N, M,  $a_k$ , and  $b_k$  define the actual transfer function or equivalently the phase and amplitude response of the filter. If the coefficients  $b_k$  are all zero, then equation (1.1) reduces to the familiar form of a finite convolution sum representing a nonrecursive or finite impulse response (FIR) digital filtering operation:

$$y(n) = \sum_{k=0}^{N} a_k x(n-k)$$
 (1.2)