The IBM 3850 Mass Storage System (MSS) consists of an array of data cartridges about 1.9 in. (4.8 cm) in diameter and 3.5 in. (8.9 cm) long, with a capacity of 50 million characters each. Each cartridge contains magnetic tape 2.7 in. (6.9 cm) wide and 64 ft (19.5 m) long, on which data are organized in cylinders analogous to those of a
Introduction
The IBM 3850 Mass Storage System (MSS) consists of an array of data cartridges about 1.9 in. (4.8 cm) in diameter and 3.5 in. (8.9 cm) long, with a capacity of 50 million characters each. Each cartridge contains magnetic tape 2.7 in. (6.9 cm) wide and 64 ft (19.5 m) long, on which data are organized in cylinders analogous to those of a disk file and can be transferred to the disk file one cylinder at a time. Up to 4720 cartridges are stored in hexagonal compartments in a honeycomb-like apparatus that includes mechanisms for fetching cartridges from the compartments, for the reading and writing of data on them, and for the replacement of cartridges in the compartments.
The data are recorded as coded binary sequences corresponding to the presence or absence of magnetic flux transitions in slanted, fixed-length stripes across the tape at a density of 67 stripes per inch (26.4 stripes per cm) and lineal density ranging from 3444 to 6888 flux transitions per inch (1356 to 2712 flux transitions per cm). A readwrite operation always involves the processing of whole stripes, with each data stripe containing exactly 4096 net data bytes after decoding. If the staging adaptor is destaging more than 4096 bytes, it must format the data to fit within stripe boundaries. One destage order can transfer up to 61 stripes, the equivalent of one cylinder of a disk file.
Unlike the conventional fixed head in the multitrack tape machines, the 3850 uses a rotary read-write head. The high cost of a multielement head and the need for a multiple number of read-write electronic channels are eliminated by replacing parallel multitrack recording with serial single-stripe recording. The data are recorded in short slanted stripes across the tape instead of in long tracks along the tape. In this way the jittery motion of the flexible tape over a fixed head is replaced by a smooth, controlled motion of the rotary head over a steady-state tape. The tape follows a helical path around a mandrel and is stepped in position from one slanted stripe to the next over a circular slit in the mandrel which houses the continuously moving read-write transducer element of the rotary head. These new approaches in transport design bring various improvements in the design of mechanical hardware, reducing head-tape wear and providing controlled spacing between the magnetic transducer and the recording media. However, they also demand new sophistications in data encoding techniques for purposes of waveform design and error recovery.
The read-write signal to the rotary transducer is coupled through a transformer whose primary and secondary coils are in continuous relative motion. This arrangement requires read-write signals with zero dc component. Previous papers [ 1, 21 reported a novel waveform design scheme referred to as "zero modulation" (ZM), which was created especially for the 3850 application. Zero modulation features waveforms with zero dc component and yet retains the advantages of other commonly known encoding methods, including high efficiency. Furthermore, the stringent coding constraints of ZM provide powerful checks on errors in decoding the read data. Zero modulation is used in the 3850 not only as a waveform encoding method but also as a powerful error-detection code.
This paper describes the design and implementation of the 3850 error recovery scheme which features a new error-correction code fitted into a serial, single-stripe data format. The second section begins with a discussion of the error recovery problem, which is followed by the rationale and approach for the design of the data format with resynchronizable sections, ZM encoding, and interleaved codewords of the error-correction code. In the third section, the design of the error-correction code is given, with a discussion of the salient characteristics and features of the code. This is followed by a description of the implementation of the encoding process, syndrome computation, and decoding process, providing correction of one or two erroneous bytes in a codeword.
A brief summary of zero modulation code is given in Appendix 1; proofs of theorems on the capability of the error-correction code are presented in Appendix 2; and Appendix 3 provides an example, with step-by-step results, of the encoding and decoding process of the errorcorrection codeword.
Error recovery scheme and data format Errors in magnetic tape recording are primarily caused by defects on the magnetic media or variations in head-media separation in the presence of dust particles. These errors often affect as many as 100 bits at a time, depending on the density of recording. Furthermore, long errors are often associated with loss of synchronization of the read clock, which renders subsequent data unreadable. In the standard nine-track 800-bpi magnetic tape machines the errors are identified as track errors and are corrected by means of a specially designed error-correction system [3].
As the recording density is increased, another error mode plays an important role. This is the well-known bitshift phenomenon where a magnetic flux transition is shifted from its normal position due to interference from neighboring flux transitions. The bit shift usually results in a two-bit error, where 01 is read in place of 10 or vice versa. The nine-track 6250-bpi tape machines feature an error-correction code [4] which corrects various combinations of one or two full tracks and multiple numbers of one-bit and two-bit errors.
The parallel multitrack data format is not available in the 3850 MSS. Instead, the data are organized in resynchronizable sections in order to facilitate recovery from mixed-mode errors in magnetic recording read-write processes involving one-bit errors caused by random noise, multiple two-bit errors caused by bit shift, and clusters of errors caused by defects and dust particles-including the capability to resynchronize the clock. The data format of the 3850 stripe is illustrated in Fig. 1 . The stripe is divided into 20 segments. The segments are appended to each other, forming a continuous waveform; however, each segment is a separate entity and can be decoded without reference to the data in other segments. Each segment consists of 13 data sections followed by two check sections. In a write operation, the bit values for the two check sections are computed in accordance with an error- correction code by processing the 13 data sections as they are being recorded. The computed check sections are then appended to the 13 data sections, thus completing a segment.
Each section is 129 bits long, consisting of 16 bytes of binary information with an overall odd-panty bit. This sequence of 129 bits is encoded into a 258-digit zero modulation waveform followed by a known unique synchronization signal (see Appendix 1 for details on ZM). The odd-parity bit serves the dual purpose of checking data errors and of limiting the memory requirement in the encoding process by setting the look-back parity of ZM to zero at the end of each section. The sections are appended to one another to form a continuous ZM waveform. Thus, each section is protected by the synchronization signal at both of its ends. This allows resynchronization of the decoding clock, at the beginning and at the end of each section, in the event of a long error causing loss of synchronization.
In a read operation, each section is read through the ZM decoding algorithm, which also checks for errors through stringent runlength and dc charge constraints. Error-free ZM patterns possess runlengths of at least one and at most three zeros between two ones and the dc charge value is always constrained within k 3 units. Thus, two consecutive ones or four consecutive zeros indicate an error. Acquisition of dc charge in excess of 5 3 units can be detected with an up-down counter which increments for every digit position recorded with a positive level; decrements likewise when the level is negative; and signals an error if the total exceeds t 3 at any time. The charge value must also be zero at the end of the section, excluding the synchronization pattern. These checks and 34 the odd parity at the end of each section detect most errors, including the two-bit errors caused by bit shift and drop out, and synchronization errors caused by defects and dust particles. It should be noted that the decoding errors in ZM do not propagate and that the decoding process always terminates at a section boundary. Thus, the presence of an error is usually detected by the ZM errordetection circuits in the vicinity of the error within the same section. The resynchronization signal at the beginning and at the end of each section provides or confirms the proper phase of the ZM double-frequency clock, thereby rendering each section independent in error modes.
All detected errors are reported to the decoder of the error-correction code for error recovery. Errors in up to two full sections in a segment can be recovered by means of this error-correction code, including the longest error in a worst-case situation when the defect coincides with a section boundary and affects two adjacent sections. A wide variety of shorter multiple errors is also detected and corrected by the same error-correction code.
The data format for the error-correction code is designed around the resynchronizable sections. This interleaving of the codewords facilitates correction of mixed-mode errors. When a defect or dust particle affects up to two full sections, the resultant error is recoverable by correcting the corresponding two bytes in each of the 16 codewords. On the other hand, many combinations of multiple one-bit and two-bit errors in a segment are also recoverable, since each codeword can detect and correct any one of its bytes. Any stripe with a defect length of more than 128 bits is demarked by the write operation. Every write operation is followed by a read-back check. Every demarked stripe is bypassed by the read operation.
Excluding the two error-checking sections in each segment, the 20 segments in a stripe provide a net recording space for a data stream of 4160 bytes (Fig. 3) . The first two bytes in this data stream are reserved for stripe identification. This is followed by a block of 4096 bytes of data, 60 bytes of filler zeros, and two bytes of Cyclic Redundancy Check (CRC) code. The two-byte CRC code provides an overall check for the data integrity of the stripe, including errors caused by malfunctions in the data flow hardware and error-correction process. The read operation is retried both with and without a change in the read amplifier gain setting when an uncorrectable error is encountered in any codeword or when a miscorrected error is detected by the CRC at the end of the stripe.
Aside from the main body of 20 segments, each stripe begins with one start-up section recorded with an all-ones signal and a special sync character. The leftover space at the end of the stripe is recorded with an all-zeros signal. The start-up section provides a known uniform signal for the purpose of priming the clock and the detection circuits, and the special sync character marks the beginning of the data. The all-zeros signal at the end ensures removal of any previously written data which may be present beyond the twentieth segment due to minor variations in the stripe length.
Error-correction code
As just described, the 3850 error recovery scheme is designed around the concept of resynchronizable sections. The basic building block of this scheme is a 15-byte codeword W , designated as For simplicity of notation, the word index k has been dropped from the notation for bytes; thus the byte Bj, is denoted simply by B,. In this codeword, Bo and B , are the check bytes, and the remaining 13 bytes are the data bytes. The coding rules are given in the form of the following matrix (modulo-2) equations: The code given by the coding rules (1) and (2) is derived from single-symbol-correcting Bose-Chaudhuri-Hocquenghem (BCH) codes [5] [6] [7] over symbols from the Galois field GF(2,). It is well known [4, 7-91 that the error-correcting codes for symbols from GF(2*) can be used for correction of groups of b adjacent errors. Here we present a modification to this concept where a code for symbols from GF(2*) is used for correction of rn x b adjacent errors. The modification relates to the representation of the elements of the Galois field. We use elements of a 16-element subfield of GF(28) in the parity check matrix in place of the conventional elements of GF (2,) . The idea of substituting the subfield elements is a novel contribution in code design. It makes the code applicable to eight-bit binary bytes, provides the necessary code-structure for multiplexed code capabilities, and yet retains the same decoding steps as those for the code over GF(Z4). The resultant code possesses the following two basic capabilities of two different well-known [9, 101 codes:
1. It detects all double bit-errors and corrects all single 2. It detects and corrects all single byte-errors.
bit-errors.
This multiplexed capability makes the code applicable in two different ways. More importantly, it provides an effective method of reducing the probability of miscorrections in tape-like, mixed-mode error environment. In particular, the following two assertions are proved in Appendix 2 :
1. If the code is used for correction of single byte-errors, then it will not miscorrect any combination of two one-bit errors. 2. If the code is used for correction of single bit-errors, then it will not miscorrect any combination of one byte-error with one bit-error in another byte.
In the 3850 MSS application, the code is used for correction of single byte-errors in the absence of error pointers. In this mode, the code exhibits a high level of protection against miscorrection of noise-induced bit-errors in more than one byte. In the presence of error pointers, the code corrects two erroneous bytes.
The matrix T represents a primitive element of CF@).
Thus, the matrices T, T2, T 3 , . . ., TZ5' represent distinct non-zero elements of GF(2'). The matrix T" represents a primitive element of the 16-element subfield of GF(2*), where A is a multiple of 17 and prime to 15. The choice of A = 68 was made simply because it provided a minimum number of hardware connections in the implementation of multiplication by matrix T". The matrix T6' can be com-
36
puted from T and is given by non-zero elements of the subfield, and TI5" = I where I is the 8 x 8 identity matrix. Thus the inverse T"" of the matrix Ti" is the matrix T'15-i)A. The modulo-2 sum and product operations of these matrices are closed, in the sense that the result is always one of the elements of the subfield. It will be seen later that these properties of the subfield elements facilitate implementation of the decoder and save hardware and decoding time.
0 Encoding process The code described by Eqs. (1) and (2) is a cyclic code with a generator polynomial g(x) with roots from the set of elements of a 16-element subfield of GF(28):
where a is a primitive element of GF(2') and A = 68. The roots 1 and a" are subfield elements corresponding to the matrix representations I and T", respectively. On multiplying out the factors of g(x), we have
The elements of GF(2') can also be represented by the eight-digit binary bytes. For example, the first column of the matrix Ti, for any i, is the commonly used eight-digit binary representation of the field element a'. Then, any 15-byte codeword represents a polynomial over GF(2') that is divisible by g(x). Note that although the coefficients of g(x) are restricted to the subfield elements, the coefficients of the codeword polynomial are from the complete set of elements of the field CF(2'). This allows all possible eight-bit patterns in the codeword and restricts the encoding and decoding operations within the subfield.
The encoding can be performed by a shift register network built for modulo g(x) operations. Figure 4 shows a block diagram of this shift register, which can be constructed from the conventional binary network elements.
The sum of any field elements /3, and P I , network is, in fact, an eight-channel binary shift register as shown in Fig. 5 , in which each storage element of Fig.  4 is replaced by eight binary storage elements, and the modulo-2 matrix multiplication or addition is realized by means of a set of binary modulo-2 gates (XOR circuits). Figure 6 shows separately a network of modulo-2 gates for multiplication of any eight-bit vector with the matrix TA.
The check bytes Bo and B , are computed by processing the data bytes B,, B,, B, , . ' ., B,, in the encoding shift register of Fig. 5 . Initially, the storage elements of the shift register are all set to zero. The ordered sequence of data bytes B,,, B,,, B,,, . . ., B, are entered into the shift register in 13 successive shifts, as eight-bit-parallel vector inputs. At the end of this operation, the shift register contains the check bytes B , and Bo in its high-and low-order positions, respectively. Then B, and Bo are gated out without feedback and appended to the data bytes to form a 15-byte codeword. Appendix 3 presents an example where this encoding process and also the following decoding process, including syndrome computation and correction of one and two byte-errors, are illustrated in detail.
Syndromes of error
The read data are checked for errors by means of the coding equations (1) 
and
When the indices i and j are known, the unknown error patterns e, and ej can be determined by processing the syndromes So and S,, provided all other bytes are error free. The syndrome equations (6) and (7) can be reduced in terms of these unknown error patterns by combining with the coding equations (1) and (2) and ei = So @ e,.
Equations (12) and (13) may be implemented with simple hardware. For this, the closure property and the multiplicative inverse of the subfield elements which were discussed before are used. In particular, we note that
Tp" = T -~A
where p and q depend only on the known values of i and j . The parameters p and q are precalculated for all possible values of i and j and are given in Tables 1 and 2 , respectively.
Thus, the decoding equations (12) and (13) The decoder then consists of the following four simple steps:
Step 1: Multiply S, by the matrix PA.
Step 2: Add So to the result of step 1 .
Step 3: Multiply the result of step 2 by T"'.
Step 4: Add S,, to the result of step 3.
The multiplication by PA and TyA of steps 1 and 3 can be performed by means of the shift register SR, of Fig. 7 with p and q shifting operations, respectively. The addition of So of steps 2 and 4 can be accomplished by entering the vector So into SR, at the time of the last shifting operation of the previous step. The results of steps 3 and 4 provide the correction patterns ei and e, for bytes bi and B,, respectively. When only one byte is in error, as indicated by pointer i, and the second pointer value j is undefined, the syndrome processing still determines e, and e, in which e, must result in a zero value. A non-zero value of e, in this case indicates an uncorrectable error in one or more unknown byte positions.
pointers. In this case the syndromes are processed for deer determines the index i of the erroneous byte and the corresponding error pattern ei. When all other bytes are error free, the syndrome equations (6) and (7) 
Thus, the error pattern ci is determined by the syndrome S,. Also, from Eqs. (18) and (19) we have 
Summary and comments
The error recovery scheme for the IBM 3850 MSS is designed to deal with mixed-mode errors comprised of single-bit errors caused by random noise, two-bit errors caused by the bit-shift phenomenon, and the long-burst errors caused by media defects, dirt particles, etc. The loss of clock synchronization over a media defect was an added problem, and the transformer coupling of the rotary transducer was a given constraint.
I
which cannot be corrected without ZM (or other) point-
~
In this paper, the error recovery scheme as it is implemented in the IBM 3850 system has been described. It features a new error-correction code, with dual capability, designed with a polynomial whose roots are chosen from the set of a Idelement subfield of GF(2'). This code not only possesses the necessary code capability for a mixed-mode error environment, but also provides the structure for economy in decoder hardware. The concept of multiplexed code capability can be extended for applications with other special error environments. modulation ensures the absence of dc component in the recording signal passing through the transformer coupling and also provides very reliable error-detection pointers in the read process. The error recovery scheme is designed around a data format consisting of resynchronizable sections which are rendered independent in error modes through the use of zero modulation encoding and selfcontained error-detection pointers. A segment of 15 resynchronizable sections is formed by interleaving 16 codewords. Each 15-byte codeword provides correction of one unknown or two known erroneous bytes. As a result, the scheme provides correction of various combinations of mixed-mode short and long errors-ranging from a multiple number of one-bit and two-bit errors to one or two full sections in each segment.
Appendix 1: Zero modulation
The ZM algorithm maps every data bit sequentially into two binary digits. The mapping is described in terms of a data bit to be encoded, one preceding data bit, and the two coded digits corresponding to the preceding data bit; and in terms of two parity functions that look ahead and back relative to the bit being encoded. Look-ahead parity P(A) is the count, modulo 2 , of ones in the data stream, beginning with the data bit being encoded and counting forward to the next zero bit; look-back parity P(B) is the count, modulo 2 , of all zeros in the data stream from its beginning up to the present bit. For example, in the data sequence 0101 I 1 10, with bit positions considered from left to right, P(A) = 1 at the second, fifth, and seventh bits and P(B) = 1 at the first, second, and eighth bits.
The encoding and decoding rules are expressed in the form of binary logic functions. The encoding function is
and the decoding function is
where the symbol d represents a data bit; a and h represent coded digits; and subscripts -I , 0, and 1 signify preceding, current, and succeeding bits, respectively. For convenience, the nonexistent bit preceding the first data bit is assumed to be one and its look-back parity is zero; the nonexistent bit following the last bit is zero.
Look-back parity P(B) can be obtained by updating a one-bit storage cell for every zero in the data as data bits are encoded. Look-ahead parity P(A) depends on the length of a string of ones in the succeeding data sequence. When the algorithm imposes no limit on the length of this string, the computation of P(A) requires an encoder with decoding process is delayed by only one bit perio d. The decoding errors in ZM do not propagate, and the decoding process always terminates at the section boundary.
In the coded ZM binary sequence, any two consecutive ones are separated by at least one and at the most three zeros. This sequence is converted into a waveform using a transition for a one and no transition for a zero in the binary sequence. Consequently, the narrowest pulse in the ZM waveform spans two digits in the coded sequence, thus keeping the ratio of the data density to the highest recorded transition density close to one. Similarly, the widest pulse spans four digits, thus limiting the range of different pulse widths. The ZM waveform has zero dc component, the accumulated dc charge being the difference between the numbers of positive and negative pulses from the beginning to any digit position in the waveform. In the ZM waveform, the accumulated dc charge value always remains within *3 units, and it always returns to a zero value at the section boundary.
In the IBM 3850 MSS the value off is 128. A known unique pattern is inserted at the end of each section and used as a synchronization pattern. This pattern is s = 010001001010001010001000l01001.
The waveform corresponding to S satisfies the ZM pulsewidth constraints. It has zero dc component, but does not satisfy the 2 3 charge constraint. The pattern S contains the sequence 00101000101000, which is the shortest among those that never occur in the valid ZM pattern in its original or shifted position. Thus, when the synchronization is lost, the sequence s can be still identified from the shifted data, which then re-establishes the synchronization.
The reverse of the waveform corresponding to S also makes a good sync mark. The IBM 3850 MSS uses this distinction to mark the beginning of the data in a segment by means of the reverse sync waveform in contrast with the regular sync waveform at the end of each section.
One interesting property of the ZM coded waveform that is not used in 3850 MSS and not reported in previous papers [ 1, 21 is read-backward symmetry. A properly terminated ZM waveform, when read backward, is a ZM waveform corresponding to the same data in reverse. In particular, when a parity bit is appended to the data to make P(B) equal to zero at the end, the encoding process terminates the dc charge value at zero at the end of the corresponding waveform. It can be shown that such a waveform, which has zero dc component, can be decoded forward or backward by means of the ZM decoding algorithm. where e , and e2 are error patterns with weight one, and x and y denote the locations of bytes in error.
The syndrome expressions for single byte-error are
where e is the error pattern and z denotes the location of the erroneous byte.
A miscorrection of two one-bit errors as one byte-error implies that e = e , @ e2 and
('45) Equation (A5) can be rewritten as where e2 is an error pattern with weight one and y is the error location.
A miscorrection of one as the other implies that e = e, @ e2 and As seen in Theorem 1, Eq. (A12) leads to a contradiction except when x = y and e , = e2. Thus miscorrection is not possible. 8,,, 8,,, . . ., B 2 , 8,, 8 , into shift registers SR, and SR, of Fig. 7 . Table A2 presents the contents of these shift registers after each shift. The error in one byte is detected and corrected by further processing the computed syndromes in the same shift registers. Table A3 presents the steps of this decoding process. and j = 10. From Table 1 we find p = 1 1 corresponding to i = 4. From Table 2 we find q = 7 corresponding to ( j -i ) = 6. Table A5 presents the steps in the syndrome decoding process for correction of the erroneous bytes.
Appendix 3: Example

Encoding process
