agenda : chip-level issuesbwrcs.eecs.berkeley.edu/classes/ee290c_s04/lectures/lect19_serpar... ·...

13
EE290C - Spring 2004 Advanced Topics in Circuit Design High-Speed Electrical Interfaces Lecture #19 Chip-Level Issues Serial/Parallel tradeoffs Clock distribution Jared Zerbe 3/30/04 2 Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key differentiators Multi-drop busses On-chip clocking Chip clock distribution, termination, jitter Failover clocking

Upload: vuanh

Post on 03-Mar-2019

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

1

EE290C - Spring 2004Advanced Topics in Circuit DesignHigh-Speed Electrical Interfaces

Lecture #19Chip-Level Issues

Serial/Parallel tradeoffsClock distribution

Jared Zerbe3/30/04

2

Agenda : Chip-Level IssuesSerial & parallel

Points of viewSharing functions & key differentiators

Multi-drop bussesOn-chip clocking

Chip clock distribution, termination, jitterFailover clocking

Page 2: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

2

3

Generic View : Point to Point Parallel Interfaces vs. Serial Links

Parallel InterfacesLinks logically grouped (bus)

Source-synchronous clock

Skew between links an issue

Serial LinksIndependent connections

Clock embedded with dataRequires encoding scheme overhead e.g. 8B/10BAsynchronous clocking

No link to link skew issues

Clock

Data

Clock

Data

Clock

Data I/O I/OData +Clock Clock

Data

Clock

Data I/O I/O

4

Serial/Parallel : Point of ViewThe best engineering solution doesn’t always winWhat makes sense in a system all depends on your point of view…Typical systems using links fall into one of three catagories

1. Legacy upgrades2. Future-legacy3. Green-field

Page 3: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

3

5

Legacy upgradeMake my existing system run faster

Have to live with environment available…including not only channel but clocks

You can have a 5-10 year span from BP design to Si!Interested in scaling performance while changing nothing else

Power, area, pins, etcTypically require low-performance mode to support legacy linksConventionally generations are 4x in performance

2x happens sometimes but is rareNot much parallel-bus scaling

6

Future LegacyGive me something that is “standard now”

…but can be upgraded laterAKA

“I don’t want to spend a lot of $$”“I don’t want to build a system which is going to be obsolete”

Not usually leading-edge performanceWeak parallel standards usually limit use of parallel busses here

Page 4: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

4

7

Green Field DesignA blank sheet or “green field” to design

Can look at completely new topology/architectureUsually interested in figure of merit

Highest Gb/mW or Gb/mm^2 or Gb/cm^2 or Gb/$$Cross-sectional bandwidth thru PCB

Doesn’t always mean cost-insensitive!Parallel can win here by sharing functions

Shared structures can be more efficientSystem vs. Silicon complexity can be traded off

8

Simple Parallel Clock Sharing

Source – synchronous clock, simple distributionIssue

Board data skews, on-chip clock tree skews limit to ~1-1.5G

Page 5: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

5

9

Parallel Sharing : PLL Phasors

Shared PLLsCan use a common-PLL and distribute phase-vectors to each linkEach link needs a phase-mixer/CDR for recovery

Yeung, JSSC 11/2000

10

Parallel Sharing : CDR Outputs

Shared CDRsSingle link phase-tracks, others follow

Page 6: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

6

11

Parallel Sharing : Power Supplies

Can regulate an entire shared power supplyExpensive, but has a big gain

An regulated inverter has %UI delay, not ps delay

Wei, JSSC 11/2000

12

Serial / Parallel tradeoffs

Shared EqualizationEqualization : similar traces mean similar S21’sStill requires a FIR per channel; can share coefficients

CommonCoefficients

Page 7: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

7

13

Serial & Parallel : Key Differentiators

Latency : a significant differenceSerialization and deserialization takes time

If the data is inherently parallel, it takes less time to send it parallelReally a line-rate vs. on-chip clock rate issuePlesiochronous often = coding = high-latency

When is this bit…

…available here?

14

Key Differentiators Con’tBandwidth granularity

What is the smallest chunk of bandwidth required?Parallel busses are big blocks

Source data : aggregation from many sources or one?CDRs : Plesiochronous or Mesochronous sources?

Cost functionChip to chip? Are chip I/O’s “free”?Board-to-board? Are connector pins expensive?Simple issues like this can greatly affect the tradeoff

Page 8: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

8

15

Agenda : Chip-Level IssuesSerial & parallel

Points of viewSharing functions & key differentiators

Multi-drop bussesOn-chip clocking

Chip clock distribution, termination, jitterFailover clocking

16

Multi-drop busses

Multiple devices sharing same interconnectEach driver sees Zo/2 when it drivesTopology is good for memory interconnect, some multiprocessor interconnectCan be very efficient use of wires viz. connectivityMust watch stub discontinuities

Page 9: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

9

17

Multi-Drop Channel Characteristics

Attenuation gets much worse as devices are addedMultiple poles at same frequency

Loss can be terrible at high speeds

0.00

0.20

0.40

0.60

0.80

1.00

1e+008 1e+009

Nor

mal

ized

am

plitu

de

Freq (Hz)

"low_CL""1_drop""2_drop""4_drop""8_drop"

18

Direct Rambus : Single-Ended + Reference

Single-ended open-drain current-mode driversVery efficient on device pin-count, channel-wiring Large amount of bandwidth from a single very small device

Common referenceNoise limits : how well this functions viz. differential

Page 10: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

10

19

Direct Rambus : Clocking

Source-synchronous clocking in both directionsDLLs remove common on-chip buffer delaysMinimized clock & data skews within the bus

20

Agenda : Chip-Level IssuesSerial & parallel

Points of viewSharing functions & key differentiators

Multi-drop bussesOn-chip clocking

Chip clock distribution, termination, jitterFailover clocking

Page 11: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

11

21

On Chip Clock Distribution

On a big chip there is distribution both before PLLsand after PLLsWhat is the PLL jitter characteristic?

LC VCO – notchRing VCO – Low-pass

Ref Clk PhasedetectorKpd

Icp

Icp R

C

VCOKvco/s

Clockbuffer

+−

105 106 107 108 109 1010

-30

-20

-10

0

10

frequency [Hz]

Noi

se tr

ansf

er fu

nctio

ns [d

B]

fromVCO supply

frominput clock

fromclock buffer supply

22

Why Does This Matter?

Distribution = buffering = delayDelay in a clock buffer = PSR jitter succeptability

CMOS : ~1% Vdd variation = ~1% in delay variationKeep your buffers short & your power supplies clean

Page 12: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

12

23

Buffered Distribution to PLLsPLLs with ring VCOs can filter high frequency buffer noise

But pass low frequency noiseSolution

Eliminate bufferingDesign buffers with low-frequency noise removed

24

Mansuri Clock Buffer Idea for PSR

Mansuri, JSSC 11/2003

Page 13: Agenda : Chip-Level Issuesbwrcs.eecs.berkeley.edu/Classes/EE290C_S04/lectures/Lect19_SerPar... · Agenda : Chip-Level Issues Serial & parallel Points of view Sharing functions & key

13

25

Clock Buffer Delay vs. VSG

26

Failover Clocking

Critical in some systems where reliability is keyNo single point of failure

Multiple clock phases are averaged to create input to PLL phase-detector