Designing system interconnect isn't the most glamorous engineering job in the world. Indeed, to most system or digital designers, being assigned a backplane, interconnect, and connector selection is tantamount to being told to go stand in a corner with a pointy hat. In reality, what they need
Designers and system architects are realizing that the number of interconnections between functional blocks in systems is becoming unwieldy. Indeed, Moore's Law not only applies to the number of transistors that you can pack onto an IC, but also drives the amount of data to transfer between chips, modules, and systems. Classically, the digital information coding this data simply scales by increasing the number of parallel signals. This scaling continues until N interconnects multiply the bandwidth, which the maximum clock speed in the system dictates. The tricky part for the designer was to maintain phase alignment across N signal lines to a single clock line. A few years ago, clock speeds increased to the point at which phase alignment between bits and their clock was marginally operational, driving even wider system buses to accommodate information-transfer needs.
The PC industry, which silicon suppliers primarily drive, has recognized the ugly trend of I/O-count proliferation and recently sounded a death knell for parallel interfaces—first by eliminating the parallel interfaces to peripherals, such as DVD and hard disks, with SATA (serial ATA) and then by defining a serial peripheral interface such as PCI Express. SATA operates at a native rate of 1.5 and 3 Gbps and will soon deliver 6 Gbps; PCI Express delivers 2.5 Gbps per lane (available with one, four, and eight serial lanes), soon scaling to 5 Gbps per lane. The architects of these serial interfaces cleverly disguised them to make the interface indistinguishable to software, above the transport layer, from their legacy parallel implementations.
Chip manufacturers aren't the only ones reaping the benefits of serial interfaces. These interfaces reduce the number of pc-board and cable interconnects by as much as two orders of magnitude. But they may still deliver an increase in information sharing between points over their parallel ancestors, because they do not use a separate clock line. Figure 1 depicts an example layout of a PCI Express interface delivering twice the bandwidth in half the number of interconnects. Other serial-interface standards and data rates are also seeing a significant ramp in use (Table 1) .
Though standardized protocols facilitate midspan interconnections, as well as modular and interoperable system building blocks, a number of systems architects encapsulate standard protocols or data with signaling, switching, data integrity, and other channel overhead. In these proprietary systems, designers typically use a nonstandard baud rate, which in the past meant they had to implement their design in an ASIC. Now, designers can use FPGAs that accommodate both standard and proprietary serial protocols from 614 Mbps through 6.375 Gbps per lane, including "bonded" lanes in which data spans across multiple serial links, building the aggregate bandwidth by the number of lanes added.
Serial-protocol and baud-rate support, however, is only the tip of the iceberg. The motivation for economic and reliable interconnect does not stop at a silicon vendor's package or business interests. For serial links to succeed and grow in use, the circuitry to transport data must be fairly simple, low-power, and inexpensive. In many instances, the circuitry must fit into legacy chassis, and, for new systems, it must be "evergreened"—backward compatible and externally upgradable through simple plug-ins to quickly increase capacity, performance, and new features for customers, making it more difficult for competitors to take market share.
These economic, strategic, and business interests lend the appearance that the interconnect problem and its solutions are rather boring. To the contrary, it takes a substantial amount of design magic to make excessive interconnect and related problems disappear.
The challenge in most interconnects is to deliver a recognizable signal from the transmitter to the receiver. Most digital designers make a connection with any-sized wire and assume the receiver coincidentally understands the ones and zeros that the transmitter sends. In some cases, having been bitten by the lore of fast edges, designers take great care to either series-damp or parallel-terminate a signal to eliminate false triggering and thresholds. Designers rarely give consideration to impedance, matching, or trace length, because rise and fall times are longer than λ/10, and the loss budget from transmitting to receiving is on the order of -3 dB for most logic. Designers could accomplish greater interconnect bandwidths by simply making buses wider, possibly adding layers to pc boards in the systems, and possibly shipping a sledgehammer with every card to facilitate insertion into hundreds of connector pins at a few ounces of force apiece.
Although high-speed serial links may appear to deliver the Holy Grail of insertion force by reducing the number of interconnects by one or two orders of magnitude, they do not come without a price. The amount of consolidated bandwidth requires a high baud rate, mandating speedy rise and fall times. The system must deliver these high edge rates to the receiver by wires that must be insulated from each other. Unlike digital signals of less than 100 MHz, for which the resistance of interconnect wiring is the primary source of signal loss, at high edge rates (high frequency), a "skin effect" occurs. The electric fields in the wire cause electrons to conduct only in the "skin" of the conductor, thus dramatically increasing resistive loss over the bulk cross section of the conductor.
To further complicate matters, this crowding of electrons at the conductor surface produces intense electric fields in the interface between the conductor and the insulating dielectric that has energy-absorbing electrostatic dipoles. This dielectric-loss component dominates resistive loss for interconnect attenuation above 500 MHz for a representative trace geometry and materials set (Figure 2 ). In this example, the dielectric losses are twice that of the resistive loss at 2 GHz and diverge to almost five times the conductive loss at 10 GHz. Digital designers from the "Land of Ohm" must face nature's sleight of hand, in which energy at gigabit speeds becomes lost in a once benign and invisible medium: a conductor's insulation. In addition, familiarity, manufacturability, availability, legacy, reliability, and cost considerations anchor the constraint of using homogeneous FR4 as the board material of first choice for pc boards and backplanes.
The curves in Figure 2 model losses in only a microstrip and do not include the effects from connectors, vias, and impedance discontinuities. Simply adding a 150-mil-long via stub to a signal trace creates 65 dB of further losses at 9 GHz from that of a backplane trace with no vias (Figure 3 ). These signal "suckouts" result from the resonance of the nonconnected portion of the via (the "stub"), yielding an approximate resonant frequency, f, in gigahertz of this one-quarter-wave LC structure in FR4 of: f=k×1/(4×L×v), where k is a correction factor for stray reactance, L is the length of the via stub in inches, and v is the propagation velocity in nanoseconds per inch, which equals 174 psec/in. for FR4.
The novice faces the nonintuitive concept that a barrel of metal lacking an end-to-end electrical connection can resonate with strong electromagnetic fields (Figure 3a ), thereby annihilating a signal that happens to share a summing node with that via. However, by using the full length of the via as respective input and output ports or by using counterbored or backdrilled vias, designers can model the resulting attenuation curve for an interconnect (Figure 3b ) and measure the attenuation curve for an 1.25m trace on a GbX-connector-based, backdrilled, FR408 backplane from Molex (Figure 4 ). Although the via may appear to magically vanish, the backdrilling process simply tunes the resonant stub frequency to a substantially higher frequency and out of the information band of the transmitted signal.
After achieving the lowest possible attenuation, what does a signal look like after it traverses the backplane? Inspection of the backplane's attenuation curve in Figure 4 reveals that low frequencies are moderately attenuated, but there is also a linearly increasing attenuation of the signal with frequency. Severe signal degradation in the higher frequency components of a waveform—namely, in narrow, single-bit pulses and in its rising and falling edges—results in eye closure and ISI (intersymbol interference). For example, when you apply a PRBS (pseudorandom bit sequence) of high-speed NRZ data to a lossy channel, its high-frequency components become attenuated, and the system exhibits reduced amplitudes as pulses shrink to a single-bit width (Figure 5a ). Rise and fall times are lowpass-filtered to the point at which the signal cannot reach full amplitude before signaling an adjacent complementary bit. The attenuation curve intuitively suggests that a high-frequency PRBS pattern would result in eye closure. The waveform shown in Figure 5b , captured at the far end of the FR408 Molex backplane, confirms this suspicion.
Having improved the passive aspects of the attenuation curve, yet having achieved a disappointing closed eye at the receiver, there are two cards left for system designers to play. One is to use a significantly more expensive, higher performance board material that reduces the dominant dielectric-loss component, as the loss tangent of the material specifies. It would reduce the slope of the attenuation curve such that the far end of the backplane might achieve an eye opening. The resulting distance that you can drive on FR4 versus a lower loss material, with tan(δ)=0.01, contrasts with the yellow and green -20-dB-loss curves in Figure 6 .
Apart from reducing the slope of the attenuation curve through the use of exotic board materials, designers have a second choice that retains the low-cost, manufacturable aspects of FR4. The trick is to actively distort the frequency response of the overall system such that the serial channel has a flatter effective frequency response. Designers can accomplish this objective by either de-emphasizing low-frequency content or emphasizing high-frequency content of the transmitted and received signals themselves. Multiplying these channel responses and the harmonic content of a predistorted or equalized signal results in a flatter equivalent-frequency response of the signal from transmitter to receiver.
Why de-emphasize low-frequency content? In 1966, RL Johnson of RCA Labs
postulated the Johnson Limit: For a given transistor structure and process, the
product of breakdown voltage, V
To further open the eye, designers can boost the high-frequency content at the receiver. Here, it is possible for designers to tune (equalize) the response of the receiver amplifier so that it exhibits a higher gain at high frequencies (Figure 8 ). With careful design, system or board designers can dynamically program-in an optimized amount of boost, because excessive boost decreases the SNR of the received signal, and insufficient boost may result in insufficient edge rates and eye opening. Figure 9 shows the result of having an optimized level of equalization. The figure depicts a noncompensated eye and equalized eye, respectively, after a non-pre-emphasized PRBS signal has traversed 42 in. of FR4 at 6.375 Gbps.
Designers can adjust pre-emphasis, equalization, and transmitting-drive levels to compensate for the frequency-dependent attenuation effects of connectors, discontinuities, and losses in a backplane, or even those of cable. Figure 10 shows 18 ft of standard RJ45-connectorized Category 5 wire at 3.125 Gbps using FPGA pre-emphasis capabilities. To illustrate, Altera's Stratix II GX 90-nm FPGA with multigigabit transceivers has more than 5000 possible combinations, making optimization of signal integrity a nightmare in systems with multiple and varying channel lengths or properties. During the design phase of the system, designers can measure s-parameters for each card-slot position. They can then run the PELE (pre-emphasis-and-equalization-link-estimator) program using The MathWorks' ( www.mathworks.com ) Matlab code, which incorporates an FPGA-multigigabit-transceiver-specific model, to determine the optimal pre-emphasis, drive level, and equalization for each channel in the system. The dynamic configurability of an FPGA then comes into play. Designers can read the backplane-slot ID and then change settings to optimize the eye for each blade position without altering the configuration of the FPGA core. Designers can enable other system capabilities by using a small controller in the FPGA to automate the receiver equalizer, making it adaptive to system, environmental, and component variations.
FPGAs are well-suited to providing access to the density and performance that advanced process nodes offer, enabling system designers to develop highly integrated SOCs (systems on chips) in the shortest amount of time. FPGA vendors also offer debugged protocol IP, thus reducing R&D. FPGA packaging and design techniques mitigate simultaneous-switching-noise effects on all I/O, including serial links, allowing designers to use a large number of parallel I/O for devices including external memory and ASICs. To address the concerns of interconnecting large amounts of information, either intrasystem or intersystem, FPGAs are available now with large counts of multigigabit serial I/O. These high-speed, multigigabit transceivers have paced the system need of remaining on low-cost FR4 materials, and provide lower bit-error rate, higher interconnect bandwidth, and lower power dissipation than discrete serializer/deserializer devices.
As CMOS-process nodes continue to shrink below 100 nm, most, if not all, systems will use an FPGA with multigigabit serial interconnects or a structured ASIC for integrating systems onto a chip. The advanced signal-conditioning techniques that this article presents, which are available in high-end FPGAs, will appear rather mundane in the coming decades, particularly as device speeds push into the tens of gigahertz of operation and as the magical tricks of signal-integrity wizards become common knowledge.
Protocol |
Data rate (Gbps/lane) |
PCI Express 1.1, 2.0 |
2.5, 5 |
OIF CEI 6G |
6.25 |
Gigabit Ethernet |
1.25, 2.5 |
Serial Rapid I/O |
1.25, 2.5, 3.125 |
XAUI |
3.125, 3.7 |
SDI |
0.270, 1.488, 2.97 |
SONET |
0.62208 |
Fibre Channel |
1.063, 2.125, 4.25 |
Andy Turudic is a senior manager for Altera's high-end-FPGA-product line. He is currently investigating advanced, high-speed applications and signal integrity of multigigabit FPGAs in backplanes and cables. He has been involved in research, development, applications engineering, and marketing of high-speed serial communications, PLLs, and mixed-signal devices for more than 26 years. He holds a bachelor's degree in electrical engineering from the University of Windsor (Windsor, ON, Canada), holds eight US patents, and is a senior member of the IEEE.
The author would like to thank Steve McKinney at Mentor Graphics; Eric Bogatin at www.bethesignal.com ; Gourgen Oganessyan at Molex; and Leonard Dieguez, Tina Tran, Mark Flanigan, Naresh Raman, Venkat Yadavalli, Samson Tan, Michael Woo, and Dave Greenfield at Altera.