S/PDIF (Sony/Philips Digital Interface Format) is a type of digital audio interconnect used in consumer audio equipment to output audio over reasonably short distances. The signal is transmitted over either a coaxial cable with RCA connectors or a fibre optic cable with TOSLINK connectors. S/PDIF interconnects components in home theatres and other digital high-fidelity systems.
S/PDIF is based on the AES3 interconnect standard. S/PDIF can carry two channels of uncompressed PCM audio or compressed 5.1/7.1 surround sound (such as DTS audio codec); it cannot support lossless formats (other than 2ch LPCM) such as Dolby TrueHD and DTS-HD Master Audio, that require greater bandwidth like that available with HDMI or DisplayPort.
S/PDIF is a data link layer protocol as well as a set of physical layer specifications for carrying digital audio signals between devices and components over either optical or electrical cable. The name stands for Sony/Philips Digital Interconnect Format but is also known as Sony/Philips Digital Interface. Sony and Philips were the primary designers of S/PDIF. S/PDIF is standardized in IEC 60958 as IEC 60958 type II (IEC 958 before 1998).
A common use for the S/PDIF interface is to carry compressed digital audio for surround sound as defined by the standard IEC 61937. This mode is used to connect the output of a DVD player or computer, via optical or coax, to a home theatre amplifying receiver that supports Dolby Digital or DTS. Another common use is to carry two channels of uncompressed digital audio from a CD player to an amplifying receiver.
S/PDIF was developed at the same time as the main standard, AES3, used to interconnect professional audio equipment in the professional audio field. This resulted from the desire of the various standards committees to have at least sufficient similarities between the two interfaces to allow the use of the same, or very similar, designs for interfacing ICs. S/PDIF remained nearly identical at the protocol level, but changed the physical connectors from XLR to either electrical coaxial cable (with RCA connectors) or optical fibre (TOSLINK; i.e., F05 or EIAJ optical), both of which cost less than the XLR connection. The RCA connectors are typically colour-coded orange to differentiate from other RCA connector uses such as composite video. The cable was also changed from 110 Ω balanced twisted pair to 75 Ω coaxial cable, using RCA jacks.
|AES3 balanced||AES3 unbalanced||S/PDIF|
|Cabling||110 Ω STP||75 Ω coaxial||75 Ω coaxial or optical fibre|
|Connector||3-pin XLR||BNC||RCA or TOSLINK|
|Output level||2–7 V peak to peak||1.0–1.2 V peak to peak||0.5–0.6 V peak to peak|
|Min. input level||0.2 V||0.32 V||0.2 V|
|Max. distance||100 m||1,000 m||10 m|
|Modulation||Biphase mark code||Biphase mark code||Biphase mark code|
|Subcode information||ASCII id. text||ASCII id. text||SCMS copy protection info.|
|Max. resolution||24 bits||24 bits||20 bits (24 bits optional)|
S/PDIF is used to transmit digital signals of a number of formats, the most common being the 48 kHz sample rate format (used in DAT) and the 44.1 kHz format, used in CD audio. In order to support both systems, as well as others that might be needed, the format has no defined data rate. Instead, the data is sent using biphase mark code, which has either one or two transitions for every bit, allowing the original word clock to be extracted from the signal itself.
S/PDIF is meant to be used for transmitting 20-bit audio data streams plus other related information. To transmit sources with less than 20 bits of sample accuracy, the superfluous bits will be set to zero. S/PDIF can also transport 24-bit samples by way of four extra bits; however, not all equipment supports this, and these extra bits may be ignored.
With one exception, S/PDIF protocol is identical to AES3. The channel status bit differs in S/PDIF. There is one channel status bit in each subframe, making 384 bits in each audio block. The meaning of the channel status bits is completely different between AES3 and S/PDIF. For S/PDIF, the 192-bit block for each channel is divided into 12 words of 16 bits each. The first 6 bits of the first word are a control code. The meaning of its bits are shown in the accompanying table.
|Bit||Unset (0)||Set (1)|
|0||Consumer (S/PDIF)||Professional (AES3)|
(changes meaning of control word)
|2||Copy restrict||Copy permit|
|3||2 channels||4 channels|
|6–7||Mode, defines subsequent bytes, always zero|
|8–14||Audio source category (general, CD-DA, DVD, etc.)|
|15||L-bit, original or copy (see text)|
Bits 8–14 of the control code are a 7-bit category code indicating the type of source equipment, and bit 15 is the "L-bit", which (for most category codes) indicates whether copy-restricted audio is original (may be copied once) or a copy (does not allow recording again). The L-bit is only used if bit 2 is zero, meaning copy-restricted audio. The L-bit polarity depends on the category, with recording allowed if it is 1 for DVD-R and DVR-RW, but 0 for CD-R, CD-RW, and DVD. For plain CD-DA (ordinary nonrecordable CDs), the L-bit is not defined, and recording is prevented by alternating bit 2 at a rate of 4–10 Hz.
The receiver does not control the data rate, so it must avoid bit slip by synchronizing its reception with the source clock. Many S/PDIF implementations cannot fully decouple the final signal from influence of the source or the interconnect. Specifically the process of clock recovery used to synchronize reception may produce jitter. If the DAC does not have a stable clock reference then noise will be introduced into the resulting analog signal. However, receivers can implement various strategies that limit this influence.
- Consumer S/PDIF provides for copy protection, whereas professional interfaces do not