aca2 01 new
DESCRIPTION
Advanced Computer ArchitectureTRANSCRIPT
![Page 1: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/1.jpg)
Sumit Mittu Assistant Professor, CSE/IT
Lovely Professional University
www.tinyurl.com/askSumit
![Page 2: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/2.jpg)
What is this course all about and
Why study this course?
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 2
![Page 3: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/3.jpg)
CSE539: Advanced Computer Architecture
Chapter 1
Parallel Computer Models
Book: “Advanced Computer Architecture – Parallelism, Scalability, Programmability”, Hwang & Jotwani
Sumit Mittu
Assistant Professor, CSE/IT Lovely Professional University
![Page 4: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/4.jpg)
In this chapter…
• THE STATE OF COMPUTING
• MULTIPROCESSORS AND MULTICOMPUTERS
• MULTIVECTOR AND SIMD COMPUTERS
• PRAM AND VLSI MODELS
• ARCHITECTURAL DEVELOPMENT TRACKS
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 4
![Page 5: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/5.jpg)
THE STATE OF COMPUTING Computer Development Milestones
• How it all started… o 500 BC: Abacus (China) – The earliest mechanical computer/calculating device.
• Operated to perform decimal arithmetic with carry propagation digit by digit
o 1642: Mechanical Adder/Subtractor (Blaise Pascal)
o 1827: Difference Engine (Charles Babbage)
o 1941: First binary mechanical computer (Konrad Zuse; Germany)
o 1944: Harvard Mark I (IBM)
• The very first electromechanical decimal computer as proposed by Howard Aiken
• Computer Generations o 1st 2nd 3rd 4th 5th
o Division into generations marked primarily by changes in hardware and software technologies
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 5
![Page 6: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/6.jpg)
THE STATE OF COMPUTING Computer Development Milestones
• First Generation (1945 – 54) o Technology & Architecture:
• Vacuum Tubes
• Relay Memories
• CPU driven by PC and accumulator
• Fixed Point Arithmetic
o Software and Applications:
• Machine/Assembly Languages
• Single user
• No subroutine linkage
• Programmed I/O using CPU
o Representative Systems: ENIAC, Princeton IAS, IBM 701
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 6
![Page 7: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/7.jpg)
THE STATE OF COMPUTING Computer Development Milestones
• Second Generation (1955 – 64) o Technology & Architecture:
• Discrete Transistors
• Core Memories
• Floating Point Arithmetic
• I/O Processors
• Multiplexed memory access
o Software and Applications:
• High level languages used with compilers
• Subroutine libraries
• Processing Monitor
o Representative Systems: IBM 7090, CDC 1604, Univac LARC
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 7
![Page 8: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/8.jpg)
THE STATE OF COMPUTING Computer Development Milestones
• Third Generation (1965 – 74) o Technology & Architecture:
• IC Chips (SSI/MSI)
• Microprogramming
• Pipelining
• Cache
• Look-ahead processors
o Software and Applications:
• Multiprogramming and Timesharing OS
• Multiuser applications
o Representative Systems: IBM 360/370, CDC 6600, T1-ASC, PDP-8
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 8
![Page 9: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/9.jpg)
THE STATE OF COMPUTING Computer Development Milestones
• Fourth Generation (1975 – 90) o Technology & Architecture:
• LSI/VLSI
• Semiconductor memories
• Multiprocessors
• Multi-computers
• Vector supercomputers
o Software and Applications:
• Multiprocessor OS
• Languages, Compilers and environment for parallel processing
o Representative Systems: VAX 9000, Cray X-MP, IBM 3090
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 9
![Page 10: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/10.jpg)
THE STATE OF COMPUTING Computer Development Milestones
• Fifth Generation (1991 onwards) o Technology & Architecture:
• Advanced VLSI processors
• Scalable Architectures
• Superscalar processors
o Software and Applications:
• Systems on a chip
• Massively parallel processing
• Grand challenge applications
• Heterogeneous processing
o Representative Systems: S-81, IBM ES/9000, Intel Paragon, nCUBE 6480, MPP, VPP500
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 10
![Page 11: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/11.jpg)
THE STATE OF COMPUTING Elements of Modern Computers
• Computing Problems
• Algorithms and Data Structures
• Hardware Resources
• Operating System
• System Software Support
• Compiler Support
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 11
![Page 12: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/12.jpg)
THE STATE OF COMPUTING Evolution of Computer Architecture
• The study of computer architecture involves both the following: o Hardware organization
o Programming/software requirements
• The evolution of computer architecture is believed to have started with von Neumann architecture o Built as a sequential machine
o Executing scalar data
• Major leaps in this context came as… o Look-ahead, parallelism and pipelining
o Flynn’s classification
o Parallel/Vector Computers
o Development Layers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 12
![Page 13: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/13.jpg)
THE STATE OF COMPUTING Evolution of Computer Architecture
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 13
![Page 14: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/14.jpg)
THE STATE OF COMPUTING Evolution of Computer Architecture
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 14
• Fig 1.3
![Page 15: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/15.jpg)
THE STATE OF COMPUTING Evolution of Computer Architecture
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 15
![Page 16: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/16.jpg)
THE STATE OF COMPUTING Evolution of Computer Architecture
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 16
![Page 17: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/17.jpg)
THE STATE OF COMPUTING System Attributes to Performance
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 17
• Machine Capability and Program Behaviour
• Peak Performance
• Turnaround time
• Cycle Time, Clock Rate and Cycles Per Instruction (CPI)
• Performance Factors o Instruction Count, Average CPI, Cycle Time, Memory Cycle Time and No. of memory cycles
• System Attributes o Instruction Set Architecture, Compiler Technology, Processor Implementation and control,
Cache and Memory Hierarchy
• MIPS Rate, FLOPS and Throughput Rate
• Programming Environments – Implicit and Explicit Parallelism
![Page 18: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/18.jpg)
THE STATE OF COMPUTING System Attributes to Performance
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 18
• Cycle Time (processor) 𝜏
• Clock Rate 𝑓 = 1/𝜏
• Average no. of cycles per instruction 𝐶𝑃𝐼
• No. of instructions in program 𝐼𝑐
• CPU Time 𝑇 = 𝐼𝑐 × 𝐶𝑃𝐼 × 𝜏
• Memory Cycle Time 𝑘𝜏
• No. of Processor Cycles needed 𝑝
• No. of Memory Cycles needed 𝑚
• Effective CPU Time 𝑇 = 𝐼𝑐 × (𝑝 + 𝑚 × 𝑘) × 𝜏
![Page 19: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/19.jpg)
THE STATE OF COMPUTING System Attributes to Performance
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 19
• MIPS Rate
• 𝜏 =𝐼
𝑐
𝑇×106=
𝑓
𝐶𝑃𝐼×106 =𝑓×𝐼
𝑐
𝐶×106
• Throughput Rate
• 𝑊𝑝 =𝑀𝐼𝑃𝑆×106
𝐼𝑐
=𝑓
𝐶𝑃𝐼×𝐼𝑐
![Page 20: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/20.jpg)
THE STATE OF COMPUTING System Attributes to Performance
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 20
• A benchmark program contains 450000 arithmetic instructions, 320000 data transfer instructions and 230000 control transfer instructions. Each arithmetic instruction takes 1 clock cycle to execute whereas each data transfer and control transfer instruction takes 2 clock cycles to execute. On a 400 MHz processors, determine: o Effective no. of cycles per instruction (CPI)
o Instruction execution rate (MIPS rate)
o Execution time for this program
![Page 21: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/21.jpg)
THE STATE OF COMPUTING System Attributes to Performance
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 21
Performance Factors
System Attributes
Instruction Count (Ic)
Average Cycles per Instruction (CPI) Processor Cycle Time (𝜏)
Processor Cycles per Instruction (CPI and p)
Memory References per Instruction (m)
Memory Access Latency (k)
Instruction-set Architecture
Compiler Technology
Processor Implementation and Control
Cache and Memory Hierarchy
![Page 22: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/22.jpg)
Multiprocessors and Multicomputers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 22
• Shared Memory Multiprocessors o The UMA Model
o The NUMA Model
o The COMA Model
o The CC-NUMA Model
• Distributed-Memory Multicomputers o The NORMA Machines
o Message Passing multicomputers
• Taxonomy of MIMD Computers
• Representative Systems o Multiprocessors: BBN TC-200, MPP, S-81, IBM ES/9000 Model 900/VF,
o Multicomputers: Intel Paragon XP/S, nCUBE/2 6480, SuperNode 1000, CM5, KSR-1
![Page 23: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/23.jpg)
Multiprocessors and Multicomputers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 23
![Page 24: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/24.jpg)
Multiprocessors and Multicomputers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 24
![Page 25: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/25.jpg)
Multiprocessors and Multicomputers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 25
![Page 26: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/26.jpg)
Multiprocessors and Multicomputers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 26
![Page 27: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/27.jpg)
Multiprocessors and Multicomputers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 27
![Page 28: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/28.jpg)
Multivector and SIMD Computers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 28
• Vector Processors o Vector Processor Variants
• Vector Supercomputers
• Attached Processors
o Vector Processor Models/Architectures
• Register-to-register architecture
• Memory-to-memory architecture
o Representative Systems:
• Cray-I
• Cray Y-MP (2,4, or 8 processors with 16Gflops peak performance)
• Convex C1, C2, C3 series (C3800 family with 8 processors, 4 GB main memory, 2 Gflops peak performance)
• DEC VAX 9000 (pipeline chaining support)
![Page 29: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/29.jpg)
Multivector and SIMD Computers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 29
![Page 30: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/30.jpg)
Multivector and SIMD Computers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 30
• SIMD Supercomputers o SIMD Machine Model
• S = < N, C, I, M, R >
• N: No. of PEs in the machine
• C: Set of instructions (scalar/program flow) directly executed by control unit
• I: Set of instructions broadcast by CU to all PEs for parallel execution
• M: Set of masking schemes
• R: Set of data routing functions
o Representative Systems:
• MasPar MP-1 (1024 to 16384 PEs)
• CM-2 (65536 PEs)
• DAP600 Family (up to 4096 PEs)
• Illiac-IV (64 PEs)
![Page 31: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/31.jpg)
Multivector and SIMD Computers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 31
![Page 32: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/32.jpg)
PRAM and VLSI Models
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 32
• Parallel Random Access Machines o Time and Space Complexities
• Time complexity
• Space complexity
• Serial and Parallel complexity
• Deterministic and Non-deterministic algorithm
o PRAM
• Developed by Fortune and Wyllie (1978)
• Objective:
o Modelling idealized parallel computers with zero synchronization or memory access overhead
• An n-processor PRAM has a globally addressable Memory
![Page 33: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/33.jpg)
PRAM and VLSI Models
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 33
![Page 34: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/34.jpg)
PRAM and VLSI Models
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 34
• Parallel Random Access Machines o PRAM Models
o PRAM Variants
• EREW-PRAM Model
• CREW-PRAM Model
• ERCW-PRAM Model
• CRCW-PRAM Model
o Discrepancy with Physical Models
• Most popular variants: EREW and CRCW
• SIMD machine with shared architecture is closest architecture modelled by PRAM
• PRAM Allows different instructions to be executed on different processors simultaneously. Thus, PRAM really operates in synchronized MIMD mode with shared memory
![Page 35: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/35.jpg)
PRAM and VLSI Models
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 35
• VLSI Complexity Model o The 𝑨𝑻𝟐 Model
• Memory Bound on Chip Area
• I/O Bound on Volume 𝑨𝑻
• Bisection Communication Bound (Cross-section area)
𝑨 𝑻 • Square of this area used as lower
bound
![Page 36: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/36.jpg)
Architectural Development Tracks
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 36
![Page 37: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/37.jpg)
Architectural Development Tracks
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 37
![Page 38: Aca2 01 new](https://reader034.vdocuments.us/reader034/viewer/2022050801/54817874b4af9fb9158b6073/html5/thumbnails/38.jpg)
Architectural Development Tracks
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 38