a new data concentrator for the cms muon barrel track finder

1
A New Data Concentrator for the CMS Muon Barrel Track Finder A. Triossi, M. Bellato, R. Isocrate, F. Montecassiano, S. Ventura – INFN Padova The muon trigger upgrade is not expected to be completed before the LHC resumes operation after LS1. Thus it is imperative to be able to commission the new muon trigger in parallel with the operation of the current trigger. For that reason, the trigger primitives from the CSC and RPC systems will be fully split into two paths, and those from the DT system will be split from a slice of the system. A portion of the links coming from the minicrates will be equipped with passive optical splitters in order to deploy and validate the TwinMux board. The portion will cover three sectors over two wheels, for a total of 192 split links, so to enclose all possible neighbour correlations that the track finding algorithm has to explore to correlate track segments Solder mask removed from one PCB (scrap of production). We made sure that PCBs were manufactured at 22 degrees relative to the wave of the dielectric material in order not to be parallel to high speed lanes Follow Us You can get updates on TwinMux developing and debugging here: https://cern.ch/CMS-TM7 https://cern.ch/TwinMux Firmware Blocks CSC DT RPC MPC SC LB CSC sorter DT sorter RPC sorter CSCTF DTTF PAC GMT CSC DT RPC Splitte r TwinMux Concent . GMT Endcap TF & sorter Barrel TF & sorter Overlap TF & sorter MPC mezzanin e CuOF OFCu LB Muon Trigger Upgrade Strategy The present muon trigger preserves the complementarity and redundancy of the three separate muon detection systems until they are combined at the input to the Global Trigger. In contrast, the upgrade to the muon trigger will utilize the redundancy of the three muon detection systems earlier in the trigger processing chain so as to obtain a more performant trigger with higher efficiency and better rate reduction. Recognizing that every additional hit along a muon trajectory further improves the fake rejection and muon momentum measurement, the upgrade seeks to combine muon hits at the input stage to the Muon Track-Finder layer rather than at its output. The segmentation of the track processing will still be done regionally in sectors of φ and η. For the upgrade it is expected a separate treatment of transition region between barrel and endcap (|η| ~ 1) in the algorithms and in the deployment of hardware processors as well. Therefore the processing will be subdivided in azimuth as well as in barrel, overlap, and endcap regions. The final sorting and ghost cancellation of muon candidates is also expected to be handled separately for each of the three regions in η 0.8 η 1.2 η Barr el Overla p Endcap TwinMux Splitte r. Overlap TF & sorter CuOF OFCu LB A further way for exploiting earlier the redundancy of the DT- RPC system consists in combining the primitive of the two systems in a “super” primitive at the TwinMux level. In such a case each TwinMux will receive the trigger data of one DT and one RPC sector. Virtex 7 XC7VX330T Receiver GTH Sel ect IO GTH De-mux Level Translator CML to LVD S Transmit ter Receiver MiniPod Snap12 Top Ground GTH_1 Ground GTH_2 Ground Power Split_1 Power Split_2 Ground Bottom Power Power TwinMux Timeline Where we are? Schematics Stack-up Layout Simulation Prototype Slice Splitting Minicrat e Cu OF OF Cu Sector Collecto r TwinMux DTTF MBTF USC UXC Optical Splitter o No schedule interference for achieving validation tests o 2015 – validation on a slice: 6 sectors (bottom part of YB-2 and YB-1) o 2016 – full deployment From CuOF To SC To TwinMux TwinMux will be a single slot double-width full-height μTCA board, equipped with 6 front panel SNAP12 RX and 2 Minipods TX for high speed data transmission (up to 13Gb/s). Based around a Xilinx Virex-7 FPGA, the TwinMux must achieve the merging of several 480Mb/s trigger links to higher speed serial links and compensate delays to provide BX alignment of the trigger data coming from the different inputs. Twelve of the 72 TwinMux inputs will be routable to GTH inputs to be able to handle the GOL based 1.6 Gb/s links. The expected latency is around 8 BX in total (2 BX for the deserialization, 5 BX for the output serialization and 2 BX to perform local operations, if preprocessing is not needed). Such latency is compatible with the TSC + OptoRX boards it will replace. To cover the full barrel, 60 TwinMux will be hosted in 6 μTCA crates, each of which equipped with an AMC13 M M C M ÷ 5 IBUFGDS BUFIO BUFR I S E R D E S 40MHz 800MHz lectIO in V7 speed grade -3 can de-serialize GOL links rializing /deserializing at 1.6Gbps in DDR 8/10b coding/decoding ly BUFIO can run up to 800 MHz sted in external loopback with a 32bit counter Three main blocks are responsible for data exchanging: from DT minicrates, from RPC link boards, and to barrel and overlap trigger processors. The data links from DT are deserialized by the SelectIO block of the standard pins, while RPC links are deserialized by the GTH transceivers. Both links need a BX alignment and RPC data needs also a uncompressing if a preprocessing on the TwinMux is performed. The transmitter blocks is responsible for data packing in accordance to the trigger processor data frame and it will exploit the GTH transceiver for an high speed serialization. TwinMux design foresee the possibility of having a double slow control: by means of PCI express lane on the backplane or by IPbus. Besides usual link status (unlocks, parity, signal detection), the available processing power in the FPGA’s will allow to host several improved monitoring tools, like for instance live trigger rate vs BX histograms layered on quality and muon sorting. The 10 Gb/s links to the AMC13 will enhance the probing of trigger raw data, allowing direct inclusion in the readout chain o Receiver from the Minicrates (480Mb/s) o Deserializer o BX alignment o Receiver from the LBs (1.6Gbps) o Deserializer o BX alignment o Uncompressing o Transmitter to the TP (8 Gb/s) o Data packing o Serializer o Slow control o TTC commands o PCIex IP/IPbus o DAQ link With the redesign of the DT trigger Sector Collectors, which handle trigger data from the 4 chambers belonging to each 30° φ sector of a wheel, all trigger primitives created within the DT chamber minicrates will be made available to the track finder processors. This, besides a slight improvement of θ hits information, allows skipping the sorting presently done within the minicrates while providing four segments per BX per chamber (instead of two). Each new Sector Collector, called the TwinMux because it is designed to handle the inputs from two DT sectors, will merge the 32 direct optical links coming from a sector into two 8 Gb/s links. The twinMux can also accept RPC data and hence can be used for receiving one DT and one RPC sector. In such case the total output corresponds to three 8 Gb/s links. In addition, through duplication, the data coming from each sector can also be synchronously fanned out to up to two additional destinations, providing a way to deliver the required neighbour information to each track-finder regional processor. This upstream duplication avoids the need of interconnections between processors. The duplication will also be used as a path to share data to the region of overlap with the endcap. Such new interconnection will replace the current coupling among the respective track-finders 32 80 bit/BX each SB 480 bit/BX MBTF (2+1)x8 Gbps MOTF MBTF SB SB SB LBM LBM LBM LBM 8 LVDS @ 480 Mbps 4 Server Boards LBM Twin Mux 5 5 Link Boards Master 1 GOL @ 1.6 Gbps 32 bit/BX each LBM SB MOTF Trigger Processors TwinMux Solution o One DT sector corresponds 32 LVDS lanes One RPC sector correspond to max 5 GOL links o Each sector routed at most to 4 destinations (2 barrel and 2 overlap TPs) o DT bandwidth to TP: 2x8Gbps RPC bandwidth to TP: 1x8Gbps Vadatech VT892 TwinMux TwinMux TwinMux TwinMux TwinMux AMC1 3 MCH1 TwinMux TwinMux TwinMux TwinMux TwinMux TwinMux Powe r Powe r JSM CPU o CMS µTCA crate o Dual star backplane o 60 double-width and full-height cards in 6 crates o Fabric A-B, Fabric D-G o FCLKA, TCLKA, TCLKB o Slow control o IPBUS, PCIEx, IPMI, JTAG o AMC13 commands o Connection to central DAQ for data spying Preliminary test of the firmware are already started on the characterization kit of the Virtex 7 speed grade -3. Eye diagrams of a 10Gb/s data links are collected at different level of clock jitter (0, 200, 500 ps)

Upload: marie

Post on 23-Feb-2016

39 views

Category:

Documents


0 download

DESCRIPTION

A New Data Concentrator for the CMS Muon Barrel Track Finder A. Triossi , M. Bellato , R. Isocrate , F. Montecassiano , S. Ventura – INFN Padova. Level Translator. De- mux. Virtex 7 XC7VX330T. GTH. GTH. Transmitter. Receiver. Snap12. Receiver. SelectIO. CML to LVDS. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A  New Data Concentrator for  the CMS  Muon Barrel Track  Finder

A New Data Concentrator for theCMS Muon Barrel Track Finder

A. Triossi, M. Bellato, R. Isocrate, F. Montecassiano, S. Ventura – INFN Padova

The muon trigger upgrade is not expected to be completed before the LHC resumes operation after LS1. Thus it is imperative to be able to commission the new muon trigger in parallel with the operation of the current trigger. For that reason, the trigger primitives from the CSC and RPC systems will be fully split into two paths, and those from the DT system will be split from a slice of the system. A portion of the links coming from the minicrates will be equipped with passive optical splitters in order to deploy and validate the TwinMux board. The portion will cover three sectors over two wheels, for a total of 192 split links, so to enclose all possible neighbour correlations that the track finding algorithm has to explore to correlate track segments

Solder mask removed from one PCB (scrap of production). We made sure that PCBs were manufactured at 22 degrees relative to the wave of the dielectric material in order not to be parallel to high speed lanes

Follow UsYou can get updates on TwinMux developing and debugging here:https://cern.ch/CMS-TM7https://cern.ch/TwinMux

Firmware Blocks

CSC DTRPC

MPC SCLB

CSC sorter DT sorterRPC sorter

CSCTF DTTFPAC

GMT

CSC DTRPC

Splitter TwinMuxConcent.

GMT

EndcapTF & sorter

BarrelTF & sorter

Overlap TF & sorter

MPC mezzanine

CuOF OFCuLB

Muon Trigger Upgrade StrategyThe present muon trigger preserves the complementarity and redundancy of the three separate muon detection systems until they are combined at the input to the Global Trigger. In contrast, the upgrade to the muon trigger will utilize the redundancy of the three muon detection systems earlier in the trigger processing chain so as to obtain a more performant trigger with higher efficiency and better rate reduction. Recognizing that every additional hit along a muon trajectory further improves the fake rejection and muon momentum measurement, the upgrade seeks to combine muon hits at the input stage to the Muon Track-Finder layer rather than at its output. The segmentation of the track processing will still be done regionally in sectors of φ and η. For the upgrade it is expected a separate treatment of transition region between barrel and endcap (|η|~1) in the algorithms and in the deployment of hardware processors as well. Therefore the processing will be subdivided in azimuth as well as in barrel, overlap, and endcap regions. The final sorting and ghost cancellation of muon candidates is also expected to be handled separately for each of the three regions in η

0.8 η

1.2 η

Barrel

Overlap

Endcap

TwinMux

Splitter.

Overlap TF & sorter

CuOF OFCuLB

A further way for exploiting earlier the redundancy of the DT-RPC system consists in combining the primitive of the two systems in a “super” primitive at the TwinMux level. In such a case each TwinMux will receive the trigger data of one DT and one RPC sector.

Virtex 7

XC7VX330T

Receiver GTH

SelectIO

GTH

De-mux LevelTranslator

CML to

LVDS

Transmitter

ReceiverMiniPod

Snap12

TopGroundGTH_1GroundGTH_2Ground

PowerSplit_1PowerSplit_2GroundBottom

PowerPower

TwinMux Timeline Where we are?

Schemati

cs

Stack-

up

Layout

Simulati

on

Prototype

Slice Splitting

Minicrate CuOF

OFCu

Sector Collector

TwinMux

DTTF

MBTF

USCUXC

Opti

cal S

plitt

er

o No schedule interference for achieving validation testso 2015 – validation on a slice: 6 sectors (bottom part of YB-2 and YB-1)o 2016 – full deployment

From CuOF To SC To TwinMux

TwinMux will be a single slot double-width full-height μTCA board, equipped with 6 front panel SNAP12 RX and 2 Minipods TX for high speed data transmission (up to 13Gb/s). Based around a Xilinx Virex-7 FPGA, the TwinMux must achieve the merging of several 480Mb/s trigger links to higher speed serial links and compensate delays to provide BX alignment of the trigger data coming from the different inputs. Twelve of the 72 TwinMux inputs will be routable to GTH inputs to be able to handle the GOL based 1.6 Gb/s links. The expected latency is around 8 BX in total (2 BX for the deserialization, 5 BX for the output serialization and 2 BX to perform local operations, if preprocessing is not needed). Such latency is compatible with the TSC + OptoRX boards it will replace. To cover the full barrel, 60 TwinMux will be hosted in 6 μTCA crates, each of which equipped with an AMC13

MMCM ÷5

IBUFGDS

BUFIO

BUFR

ISERDES

40MHz800MHz

o SelectIO in V7 speed grade -3 can de-serialize GOL linkso Serializing /deserializing at 1.6Gbps in DDR

+ 8/10b coding/decodingo Only BUFIO can run up to 800 MHzo Tested in external loopback with a 32bit counter

Three main blocks are responsible for data exchanging: from DT minicrates, from RPC link boards, and to barrel and overlap trigger processors. The data links from DT are deserialized by the SelectIO block of the standard pins, while RPC links are deserialized by the GTH transceivers. Both links need a BX alignment and RPC data needs also a uncompressing if a preprocessing on the TwinMux is performed. The transmitter blocks is responsible for data packing in accordance to the trigger processor data frame and it will exploit the GTH transceiver for an high speed serialization. TwinMux design foresee the possibility of having a double slow control: by means of PCI express lane on the backplane or by IPbus. Besides usual link status (unlocks, parity, signal detection), the available processing power in the FPGA’s will allow to host several improved monitoring tools, like for instance live trigger rate vs BX histograms layered on quality and muon sorting. The 10 Gb/s links to the AMC13 will enhance the probing of trigger raw data, allowing direct inclusion in the readout chain

o Receiver from the Minicrates (480Mb/s)o Deserializero BX alignment

o Receiver from the LBs (1.6Gbps)o Deserializero BX alignmento Uncompressing

o Transmitter to the TP (8 Gb/s)o Data packingo Serializer

o Slow controlo TTC commands o PCIex IP/IPbus

o DAQ link

With the redesign of the DT trigger Sector Collectors, which handle trigger data from the 4 chambers belonging to each 30° φ sector of a wheel, all trigger primitives created within the DT chamber minicrates will be made available to the track finder processors. This, besides a slight improvement of θ hits information, allows skipping the sorting presently done within the minicrates while providing four segments per BX per chamber (instead of two). Each new Sector Collector, called the TwinMux because it is designed to handle the inputs from two DT sectors, will merge the 32 direct optical links coming from a sector into two 8 Gb/s links. The twinMux can also accept RPC data and hence can be used

for receiving one DT and one RPC sector. In such case the total output corresponds to three 8 Gb/s links. In addition, through duplication, the data coming from each sector can also be synchronously

fanned out to up to two additional destinations, providing a way to deliver the required neighbour information to each track-finder regional processor. This upstream duplication avoids the need of

interconnections between processors. The duplication will also be used as a path to share data to the region of overlap with the endcap. Such new interconnection will replace the current coupling among the

respective track-finders

…32

80 bit/BX each SB

480 bit/BX

MBTF

(2+1)x8 Gbps

MOTF

MBTF

SBSBSB

LBMLBMLBMLBM

8 LVDS @ 480 Mbps

4 Se

rver

Boar

ds

LBM

TwinMux…

5

5 Li

nk B

oard

sM

aste

r

1 GOL @ 1.6 Gbps32 bit/BX each LBM

SB

MOTF

TriggerProcessors

TwinMux Solution

o One DT sector corresponds 32 LVDS lanesOne RPC sector correspond to max 5 GOL links

o Each sector routed at most to 4 destinations (2 barrel and 2 overlap TPs)o DT bandwidth to TP: 2x8Gbps

RPC bandwidth to TP: 1x8Gbps

Vadatech VT892

Twin

Mux

Twin

Mux

Twin

Mux

Twin

Mux

Twin

Mux AM

C1 3M

CH1

Twin

Mux

Twin

Mux

Twin

Mux

Twin

Mux

Twin

Mux

Twin

MuxPo

wer

Pow

er

JSM

CPU

o CMS µTCA crateo Dual star backplaneo 60 double-width and full-height cards in 6 crateso Fabric A-B, Fabric D-Go FCLKA, TCLKA, TCLKBo Slow control

o IPBUS, PCIEx, IPMI, JTAGo AMC13 commands

o Connection to central DAQ for data spying

Preliminary test of the firmware are already started on the characterization kit of the Virtex 7 speed grade -3. Eye diagrams of a 10Gb/s data links are collected at different level of clock jitter (0, 200, 500 ps)