Options emerge for 10-Gbits/s chip-to-chip interfaces
EE Times: Latest News Options emerge for 10-Gbits/s chip-to-chip interfaces | |
Rich Hovey (08/02/2004 9:00 AM EDT) URL: http://www.eetimes.com/showArticle.jhtml?articleID=26100742 | |
Engineers have been rapidly increasing chip-to-chip I/O speeds in an effort to keep pace with the bandwidth needs of increasingly integrated silicon. Consequently, a variety of parallel and serial options at speeds up to 10 Gbits/second are becoming available that a designer would do well to evaluate carefully.
When selecting a chip-to-chip interface, factors that should be considered include size, power, number of required package signal balls and latency. However, when comparing 10 Gbit/s serializer/deserializer (serdes), the 10-Gbit attachment unit interface (XAUI), and system packet interface 4 (SPI-4) in dynamic mode, a 10-Gbit/s serdes will typically have the advantage on size, power and signal pins.
With respect to power consumption, a 10 Gbit/s serdes can be about two-thirds that of an XAUI interface and about one-third that of an SPI-4 interface. The physical size of a 10 Gbit/s serdes core can be about half and one-eighth that of XAUI and SPI-4 interfaces respectively.
The number of signal pins can vary depending on the package, but in general, the interface with fewer high-speed signal I/Os will have the advantage. For example, a 10-Gbit/s serdes uses four high-speed signal lines while XAUI uses 16 and SPI-4 uses 72.
Interfaces with more data lines generally provide links with lower latency at the PHY layer. This is because, at a high level, as you transmit and receive at a higher speed over a single line, more serialization and deserialization is being performed on the data before it is transmitted and after it is received.
A 10-Gbit/s serdes, for example, will perform 64-to-1 or 32-to-1 serialization and deserialization. Each lane of an XAUI interface, on the other hand, will only need to perform 10-to-1 serialization and deserialization. So, for applications where link latency is critical, SPI-4 type physical (PHY) layer or a XAUI interface will generally provide superior performance.
Depending on the application, other items to consider when selecting an interface include the capability to be backward compatible with legacy interfaces, packaging options and channel characteristics.
To dig deeper into the high-speed interconnect issue, let's take a look at an example of a 10-Gbit/s interface and how it could be implemented in an ASIC.
In this example, the ASIC logic transmits 32- or 64-bit parallel data. Depending on the application, this data can be 64/66B encoded or Sonet scrambled. The data is serialized and sent using an NRZ (non-return-to-zero) signal over a single differential pair at 9.95 Gbits/s to 11.1 Gbits/s.
On the receive side, the differential serial data stream is received, the clock is recovered and the data is deserialized. The data is then provided to the ASIC logic at 32 or 64 bits wide with respect to the recovered clock. The core employs on-chip termination (100 ohms, differential) and for most applications transmits with a nominal signal amplitude of 500 mV differential, peak-to-peak.
Such a 10-Gbit/s serdes core may be optimized for low power and small size by targeting applications which require up to approximately 20 cm of FR4-type pc board and a single connector. These applications would include chip-to-chip applications where both chips are on the same board, or when one chip is on a daughter card.
By designing the serdes core so that it supports the XFP MSA (10 Gigabit Small Form Factor Pluggable Multi Source Agreement), it could then also be used to interface with XFP optical modules. These modules do not perform serialization and deserialization, and they have significantly smaller form factors than modules that employ parallel interfaces.
Rather than pushing 10 Gbits/s over a single differential pair, an alternative is to use an XAUI interface. XAUI employs four serdes, or lanes, each transmitting and receiving differential 8B/10B-encoded data at 3.125 Gbits/s. The effective bandwidth after the overhead for the 8B/10B coding is removed is 10 Gigabits of data per second.
XAUI interfaces may be used to transfer data between chips, over a backplane or to an optical module such as a Xenpak or XPAK (two 10-Gbit MSAs). When the XAUI interface is integrated into an ASIC, it typically provides 10-to-1 serialization and deserialization and interfaces to an XGXS (10-Gbit Ethernet extended sublayer) block that performs 8B/10B encoding/decoding, lane alignment, character substitution, and resynchronization of the received data stream to the local clock. Another alternative for 10-Gbit/s chip-to-chip links is to use low-voltage differential signaling (LVDS) I/Os consisting of multiple data lines. SPI-4 interfaces are examples of such links. They feature 16 LVDS transmit data lines, one transmit control line, 16 receive data lines, one receive control line, and separate transmit and receive LVDS clocks which are forwarded with the data in each direction. In a typical application, each data line operates at a minimum of 622 Mbits/s, with 800 Mbits/s and above being common. The clocks run at half the baud rate — for example, 311 MHz for data-line operation of 622 Mbits/s.
There are two primary operating modes for the electrical interface in SPI-4 links. In static mode, which is for lower data rates, such as 800 Mbits/s and below, the skew between data lines and between clock and data must be carefully managed so that the clock may latch in the data on the receive side. Dynamic mode requires a more sophisticated receiver that determines the optimum timing for sampling each data line. In this mode there are no restrictions on clock-to-data skew and on-chip de-skew logic can resolve multiple bit times of skew between data lines.
Dynamic mode uses an initialization sequence to allow the receiver sampling circuitry to determine the optimal timing for latching each data line. With no restriction on clock-to-data skew, relaxed requirements for data-to-data skew and higher speed capability, dynamic mode has proved to be a popular interface. For both modes, the interface generally uses on-chip 8-to-1 serdes for each data line.
Rich Hovey (rhovey@lsil.com) is manager of the CoreWare product management group at LSI Logic Corp. (Milpitas, Calif.).
| |
All material on this site Copyright © 2005 CMP Media LLC. All rights reserved. Privacy Statement | Your California Privacy Rights | Terms of Service | |
Related Articles
New Articles
- Quantum Readiness Considerations for Suppliers and Manufacturers
- A Rad Hard ASIC Design Approach: Triple Modular Redundancy (TMR)
- Early Interactive Short Isolation for Faster SoC Verification
- The Ideal Crypto Coprocessor with Root of Trust to Support Customer Complete Full Chip Evaluation: PUFcc gained SESIP and PSA Certified™ Level 3 RoT Component Certification
- Advanced Packaging and Chiplets Can Be for Everyone
Most Popular
- System Verilog Assertions Simplified
- System Verilog Macro: A Powerful Feature for Design Verification Projects
- UPF Constraint coding for SoC - A Case Study
- Dynamic Memory Allocation and Fragmentation in C and C++
- Enhancing VLSI Design Efficiency: Tackling Congestion and Shorts with Practical Approaches and PnR Tool (ICC2)
E-mail This Article | Printer-Friendly Page |