## **General Disclaimer**

## One or more of the Following Statements may affect this Document

- This document has been reproduced from the best copy furnished by the organizational source. It is being released in the interest of making available as much information as possible.
- This document may contain data, which exceeds the sheet parameters. It was furnished in this condition by the organizational source and is the best copy available.
- This document may contain tone-on-tone or color graphs, charts and/or pictures, which have been reproduced in black and white.
- This document is paginated as submitted by the original source.
- Portions of this document are not fully legible due to the historical nature of some of the material. However, it is the best reproduction available from the original submission.

Produced by the NASA Center for Aerospace Information (CASI)

# UNCLASSIFIED

SPACE VEHICLE VITERBI DECODER FINAL REPORT

ER75-4176

16 APRIL 1975

PREPARED UNDER

NASA CONTRACT NO. NAS 9-13923

(NASA-CR-141787) SPACE VEHICLE VITERBI DECODER Final Report (Raytheon Co.) 64 p HC \$4.25 CSCL 03F N75-23672

Unclas G3/19 21774



## RAYTHEON COMPANY

EQUIPMENT DIVISION

## SPACE VEHICLE VITERBI DECODER

#### FINAL REPORT

ER75-4176

ľ

16 APRIL 1975

PREPARED UNDER NASA CONTRACT NO. NAS 9-13923

FOR

NASA MANNED SPACECRAFT CENTER HOUSTON, TEXAS

RAYTHEON COMPANY EQUIPMENT DEVELOPMENT LABORATORY SUDBURY, MA 01776

#### I. INTRODUCTION

- 11 11 - 11

4

الدغرية

12

The primary objective of the Space Vehicle Viterbi Decoder project was to design and fabricate an extremely low-power, constraint-length 7, rate 1/3 Viterbi decoder brassboard capable of operating at information rates of up to 100 kb/s. The brassboard was to be partitioned to facilitate a later transition to an LSI version requiring even less power. Additional objectives included an evaluation of the effect of soft-decision thresholds, path memory lengths and output selection algorithms on the bit error rate and the comparison of a new branch synchronization algorithm with a more conventional approach. The following pages describe in detail the implementation of the decoder and its test set (including novel all-digital noise source) and present the results of the various system tests and evaluations. The report concludes with a summary of the results obtained and with recommendations for the implementation of the LSI version of this decoder.

## II. VITERBI DECODER DUSIGN AND IMPLEMENTATION

The decoder configuration chosen for the baseline design is a series parallel implementation of the Viterbi decoding algorithm, tailored to the rate 1/3, constraint length 7, Odenwalder convolutional code. This configuration represents a compromise between a fully parallel decoder with lowest speed/power requirements, and a series decoder with lower hardware requirements.

One of the key elements used to implement the decoder baseline design is a random access memory (RAM). The organization of the RAM units was a determining factor in selection of an 8 x 8 series/parallel configuration.

CMOS integrated circuits were chosen for implementing the decoder design to minimize power requirements. The same logic functions can be used in a later LSI version of the decoder and implemented with either hybrid or monolithic technology.

#### 2.1 General Description\*

Ť.

The basic algorithm implemented by the decoder can be explained with the aid of the decoding algorithm flow chart, Figure 2-1, and the simplified decoder block diagram in Figure 2-2.

The branch metric calculator receives the quantized received code bits and generates from the 64 metric increments,  $\Delta s$ .

The input data representation is assumed to be such that the natural binary progression corresponds to the progression from a "strong zero" to a "strong one" (i.e., 000 = strong zero, 001 = moderately strong zero, 010 = moderately weak zero, 011 = weak zero, 100 = weak one, 101 = moderately weak one, 110 = moderately strong one, 111 = strong one). This representation was found to result in the simplest overall implementation (if the actual demodulator output assumes some other form, a simple format translator could easily be provided).

Once the branch-metric increments are generated, the <u>add-compare-select (ACS)</u> circuit updates the metrics associated with each surviving path, compares each of

\*The algorithm described in this section is particularized to those codes which, like the one of interest here, have generators both beginning and ending with the all-ones V-tuple, with 1/V the code rate.

-2-



 1 J J  $< \eta^{-1}$ - **1** فستنبيه  $(\mathbb{C}^{\ast +})$ لالية  $|\Gamma^{\infty}_{i}\rangle$ ين ا 2.3 المستبل /~2)  $i_{\rm B}$ 1 4 r art J, 

1



Figure 2-2

render att in dit i dagen

these metrics with that of its competitor, and selects the one with the smallest metric. As indicated in Figure 2-1, the branch-metric increments  $\Delta s_1$ , and  $\Delta \overline{s_1}^*$ , for example, are added to the sums  $s_1$  and  $s_{33}$  which are stored in accumulation memories Ax-low and Ax-high (cf. Figure 2-2). (Accumulation memories Ax-low and Ay-high are four separate memories with individual address selection. Dotted lines indicate that memory utilization alternates with each cycle of decoder operation, a cycle ending with the generation of an information bit (i = 32). Memories Ax-low and Ay-low contain locations one through 32, while memories Ax-high and Ay-high contain locations 32 through 64.) A comparison is performed on the outputs of the two adders and the lesser sum is stored in memory Ay-low, location 1.

اني. توب

w.

in 🖄

d n

1

λi ie

و د این او د این

At the same time, the contents of the <u>path memory</u> are modified as follows: If  $S_1 + \Delta S_1$  is the lesser of the competitive metrics, path memory location  $B_1$ is read and the output from bits 1 through 31 are loaded into bits 2 through 32 of duplicate path memory location R1, and a logic "0" is written into bit position 1. If  $S_{33} + \Delta S_1$  is the lesser, path memory location  $B_{33}$  is read and the output from bits 1 through 31 are loaded into bits 2 through 32, and a logic "1" is written into bit 1 of the duplicate path memory location R1. (cf. Figure 2-2. As with the accumulation memories, the dotted lines here indicate that memory utilization alternates with each cycle of decoder operation.)

The next step in the ACS unit is to interchange the roles of  $\Delta s_1$  and  $\Delta \overline{s_1}$ . That is,  $s_1$  is added to  $\Delta \overline{s_1}$  and  $s_{33}$  is added to  $\Delta s_1$  with the two sums compared as before. If  $s_1 + \Delta \overline{s_1}$  is less than or equal to  $s_{33} + \Delta s_1$ , then it is stored into Ay-low, location 2. Also, the contents of path memory location Bl (bits 1-31) are stored in location R2 (bits 2-32) and bit 1 is loaded with a logic "O". If  $s_{33} + \Delta s_1$  is the lesser, then that sum is stored in Ay-low, location 2, and the contents of path memory location B33 are stored in location R2, bits 2 through 32, with a logic "1" loaded into bit 1.

\*The terms  $\Delta s_i$  and  $\Delta \overline{s_i}$  here represent the metric increments corresponding to complemented branch pattersn; i.e., if  $\Delta s$  is the metric increment when the received data are compared to the pattern xy =, the  $\Delta \overline{s_i}$  is that obtained where these same data are compared with  $\overline{x} \overline{y} \overline{z}$ .

-5-

The identical sequence takes place for the remaining values of i, using the branch-metric increments  $\Delta S_i$  and  $\Delta \overline{S_i}$  with the locations in the metric and path memories altered accordingly. Finally, the output information bit is determined by comparing the number of "ls" and "Os" in the most significant bit (position 32) of each memory location just written into, with the majority state selected for the output bit.

Although the flow chart in Figure 2-1 indicates a serial operation with the index processing sequentially from 1 to 32, the decoder actually performs the 64 operations thus represented (two operations for each value of i) in eight steps of eight operations each. This is done to achieve the desired decoding rate (up to 100,000 information bits per second) while still using low-power CMOS logic. Furthermore, in order to simplify the metric calculator, these operations are so arranged that only one pair of metric increments,  $\Delta s_i$  and  $\Delta \tilde{s}_i$ , need be determined for any one of these steps. A related modification is also made in the ACS and path memory, units where the storage locations are shuffled (relative to those indicated in Figure 2-1) to simplify the address generators needed to identify all those locations corresponding to a given metric-increment pair.

| 011 | 010 | 110 |     |     |     |  |
|-----|-----|-----|-----|-----|-----|--|
| 110 | 100 | 000 | 110 | 101 |     |  |
|     | 011 | 010 | 110 |     |     |  |
|     | 110 | 100 | 000 | 110 | 101 |  |

н ≕

. च्यु

بزاله

. . .

ų į

w.,...

011 010 110 110 100 000 110 101 011 010 110 000 011 010 110

-6-

That is,  $H\underline{v} = 0$  for all sequences generated by the convolutional encoder. This statement is true, however, only if the sequence  $\underline{v}$  begins at a node in the code tree. If  $\underline{v}$  begins with either of the two other symbols in a code tree branch, in contrast, and if the encoder input sequence is random,  $H\underline{v}$  is a random sequence of ones and zeros.

Note that rows two and three are orthogonal on two bits. This property is exploited as follows: The sign bit from each decoder input is read into a 15-bit auxiliary shift register and the modulo-two sum formed of the contents of the register's 1st, 3rd, 12th, 14th and 15th stages, a second modulo-two sum formed of the contents of its 8th, 10th and 11th stages, and a third modulo-two sum formed of its 5th and 6th stages. If the shift-register's contents are properly framed and if the polarity of its inputs is correct, all three parity checks will agree, at least in the absence of errors. (I.e., the first and third of these parity checks correspond to the second row of H and the second and third to the third row of H.) If the framing is correct but the polarity reversed, the first and second parity check will agree, again, at least in the absence of errors, and will equal the complement of the third parity check (since the latter is the only one based up an even number of bits). If the framing is incorrect, however, it is easily demonstrated that the three parity bits will be randomly related. (Note that all vectors in H are cyclic shifts of the two vectors actually used. Consequently, all code information available to the Viterbi decoder is used in the synchronizer, although, of course, much less thoroughly.)

ł.

3 6

κ. ..

1

The results of these parity-check comparisons can, therefore, be used as an indicator of correct or incorrect node synchronization. If, in particular, all three parity checks agree, an up-down counter is incremented; if the first two agree but disagree with the third, the same up-down counter is decremented; in any other situation, the counter is neither incremented nor decremented. The process is repeated when the next bit is read in to the shift-register but with a second up-down counter, and again with the third input bit and a third up-down counter. The whole sequence begins again with the fourth input so that, in general, the i<sup>th</sup> input causes the j<sup>th</sup> up-down counter to be incremented or decremented with j = i (mod 3). It is readily verified that the up-down counter associated with the properly framed code sequence is more likely to be augmented (decremented) and less likely to be decremented (augmented) when the polarity is correct (reversed) than are either of the other two counters, even in the presence of errors. Correct node synchronization is thus estimated by observing

-7-

which of the counters first reaches a predetermined positive or negative threshold T. Further, a negative threshold crossing indicates an inverted decoder input so that all node synchronization and sign ambiguities are resolved simultaneously.

#### 2.2 General Implementation

فانغ

ساق ۱

si.

d la

đ.

1

' ( ) 11 - 20 A complete block diagram of the decoder is shown in Figure 2-3. The 3-bit quantized demodulator output is received by the Metric Calculator logic where the branch metric calculations are performed. The resulting 5-bit metric increment numbers ( $\Delta$ S) are used two at a time in the add, compare, select (ACS) arithmetic section. This arithmetic section is divided into eight identical units, each of which performs eight ACS operations in one bit-time period.

The results of the 64 decisions made in the ACS unit are sent to the path memory in the output selector section. The path memory is divided functionally into eight indentical groups of 8 words by 32 bits in a one-to-one correspondence with the ACS metric memory. During one information-bit period, data are read from one memory and stored in a second duplicate memory. The memory roles are reversed during the following bit period; the second memory is read and a write performed into the first. The output bit is determined by comparing the number of "1s" with the number of "0s" in the oldest bit position for each path and selecting the one occurring more frequently.

-----

Synchronization logic receives the most significant bit of each input symbol and resolves from these the node and phase ambiguities.

In the self-test mode a fixed data pattern is presented to the metric calculator section and the resulting decoder output compared with an appropriately delayed version of the same pattern.

Finally, a timing and control section provides the sequence, sub-sequence, control functions and output clocks for the decoder.

「「「「「「「」」」」

Each of these decoder subunits is described in detail in the following sections.

-8-



ORIGINAL PAGE IS OF POOR QUALITY

ь s

ы. ...

ļ

ني ...

J

J

Č.

VITERBI DECODER BLOCK DIAGRAM Figure 2-3

## 2.2.1 Metric Calculator, Node Synchronizer and Control Logic

A block diagram of the metric calculation logic is shown in Figures 2-4 and 2-5. Detailed logic for this section is shown on drawing VDBD#1, Sheets 1 and 4.

Input symbols are received at the decoder with three DM88L12, low power TTL, level translator gates which receive the 5 volt interface signals and output them at the 10 volt CMOS logic levels. The translator outputs are fed into an and-orselect (AOS) unit which provides selection of either the normal input symbols or the internally generated self-test symbols. Each output of the AOS unit is exclusive-ored with the complement signal which is generated by the node synchronization logic. In the soft-input mode all three inputs are passed on to the storage registers; in the hard-input mode, the most significant symbol bit only is used for all three inputs. The first two symbols are loaded into buffer registers 1A and 2A, while the third symbol is loaded directly into holding register 3B. At the time symbol #3 is loaded into 3B, the contents of registers 1A and 2A are transferred into holding registers 1B and 2B, respectively. This data transfer initiates the decode operation which is completed before the third symbol of the following group is received.

Data represented by the three 3-bit numbers in the registers are added in either the true or complemented form depending on the specific branch metric increment being determined. The output of the two adder circuits then represent the metric increments  $\Delta s_i$  and  $\Delta \overline{s}_i$ .

### <u> ∆ S Control Logic</u>

The logic shown on the left-center section of Figure 2-4 generates the true/complement control signals TCl, TC2 and TC3 for the registers 1B, 2B and 3B, respectively. The binary counter is incremented at eight times the output bit rate to generate the following control pattern:



······

1460 2021 S M Strangs &

· · · · ·

INPUT LOGIC Figure 2-4

ORIGINAL PAGE IS OF POOR QUALITY

I

-11-



·····

- 1 - 1 - 1

12 is

11

85.

10000

10000

- 1. 1

J

Į,

> ORIGINAL PAGE IS OF POUR QUALITY

METRIC CALCULATION LOGIC

Figure 2-5

| 1 | - | 000 |
|---|---|-----|
| 2 | - | 111 |
| 3 | - | 101 |
| 4 | - | 010 |
| 5 | ÷ | 110 |
| 6 | - | 001 |
| 7 | - | 011 |
| 8 |   | 100 |

A logic "O" provides a "true" output, and a logic "l" provides a "complement" output. With respect the the flow chart of Figure 2-1, the control function 000 yields the metric increments  $\Delta S_1$  and  $\Delta \overline{S_1}$ . The next control 111 yields the same quantities,  $\Delta \overline{S_1}$  and  $\Delta S_1$ , but in reversed order, etc.

Logic in the lower-left of Figure 2-4 is for symbol clock select. In the decode mode, the external symbol clock is level-translated and selected for use as the symbol load clock. An internally generated symbol clock is selected in the self-test mode.

#### Three-Number Adder

(小)) || || (山心

با فا

u is

j j

1000

rs Li Logic for the three-number adder is shown in Figure 2-6. Inputs from the three holding registers are added to generate the five-bit branch-metric increment,  $\Delta s_i$ . An identical circuit receives the inverted branch-metric increment,  $\Delta \overline{s_i}$ . The small scale integrations (SSI) logic used for the adders is of silicon-on-sapphire (SOS) technology made by Inselek Co.

#### Symbol Load Control Signal Generation

Signals for loading the two-buffer registers 1A and 2A, and the three holding registers 1B, 2B and 3B are generated by the logic of Figure 2-7. From these signals are derived the signals for clocking the three up-down counters in the node synchronization logic.

-13-



-14-

 $10^{-1} \rm ke^{-1}$ 

> LOAD 1A · -- -- --------→ LOAD 2A 5 21 \$ ŝ ->LOAD B Â A в C c KR ĸ SYMBOL LOAD CLOCK RESET

l

1724°25

L.

، میں - ا - ایک ا

11-1-1 11-1-1 1-1-1-1

E There

1. . . . . . .



-15-

1

#### Decoder Output Clocks

1

4.1

i i

11

Output clocks at 0° and 180° are provided at the information bit rate. The Load 2A signal is buffered into the 9614 driver gates connected to provide the two clock phases (VDBD #1, Sheet 1). The clock duty cycle is one-third.

#### Node Synchronization Logic

Node synchronization logic is comprised of the following sections:

Node-sync shift register, node-sync up/down counters, enable sync-change logic and node sync control logic. Detailed logic is shown on Drawing VDBD #1, Sheets 2 and 3.

## Node-Sync Shift Register (Figure 2-8)

The 15-bit node-sync shift register receives each symbol MSB at the symbol clock rate. Selected register outputs are exclusive-ored to produce three signals A, B and C. From these signals are then generated the up/down count, and inhibit-count control signals.

#### Node-Sync Up/Down Counter (Figure 2-9)

Three 8-bit binary up/down counters are required for the node synchronization implementation. Each 8-bit counter consists of two CD4029 4-bit up/down counter units. The counters generate overflow signals at the count of 64 when counting up, and generate underflow signals at count of 0 when counting down. When any one counter overflows or underflows, all three are then reset to a count of 32.

#### Node-Sync Control Logic

The synchronization action to be taken is determined according to the overflow or underflow indication from a counter, as follows:

| <u>Counter #</u> | <u>Overflow</u> | Underflow                     |
|------------------|-----------------|-------------------------------|
| 1                | Sync OK         | Complement Symbol             |
| 2                | Slip 1 Symbol   | Slip 1 Symbol and Complement  |
| 3                | Slip 2 Symbols  | Slip 2 Symbols and Complement |

Logic for implementing these functions is shown in Figure 2-10. A symbol is slipped by inhibiting the symbol load clock for one or two periods, as required. An overflow or underflow from any counter following start-up will set the in-sync flip-flop shown at the bottom of Figure 2-10 and drive an In-Sync indicator in the

-16-



ł

لعاضه



-18-



J

المن ا

ب ب

<u>ي</u> ل

ي: سي تيته

15 m 1

يا يە

 $\mathbf{e}_{1}$  :

10

n Ja

مەر 11. 11.

وعاصر

برؤية

1.1.2

لمتبا

Ť

j,

いたい ちんちょう ちんちょう ちんち いちちちょう しょうしょう

Figure 2-10

test set.

i g

14

. .

لمريمة

الما يما

άw

المَرْيَ السبة

ليألفه

من **ف**يد

00

است: الله

لسندلا

The clocks for the three up/down counters are inhibited for sixteen symbol clock times following a change in sync. This allows the sync shift register to refill with symbols in the correct sequence before any counters are incremented or decremented. Logic for this function is shown in Drawing VDBD #1, Sheet 2.

#### Enable Sync-Change Logic

Figure 2-11 is a block diagram of the logic required to generate the enable sync-change signal. Detailed logic is located on Drawing VDBD #1, Sheet 3.

At decoder start-up, the Enable Sync F/F forces ENSYNC CHANGE signal to the true state until either a complement signal or an inhibit symbol load clock signal is received.

A four stage up/down counter, EE9 controls resynchronization following start-up. The counter is initialized to the count of one and counted up or down according to the node sync up/down counter outputs. If counter #1 overflows, indicating correct sync, EE9 is incremented twice. A counter #1 underflow, or a counter #2 or #3 underflow or overflow decrements EE9. When the count of sixteen is reached any additional up-counts are inhibited. If the counter is decremented to zero, the signal RESYNC is generated which in turn activates the ENSYNC CHANGE signal allowing decoder resynchronization.

A resynchronization test point which can drive a test set indicator is generated by flip/flops DB43 and DF44. The first ENSYNC CHANGE signal transition at start-up is trapped in DB43 which enables flip/flop DF44. The next ENSYNC CHANGE signal transition will be trapped in DB43 and the RESYNC test point energized.

#### Decoder Input Timing

Timing for input symbol load signals and node synchronization counter increment and control signals is shown in Figure 2-12. All time intervals in this section are derived from the symbol clock which is variable according to the input symbol rate.

#### Self-Test Symbol Generator

The decoder self-test concept requires the generation of a pseudo-random bit pattern which is encoded and presented to the decoder in 3-bit symbols. The decoded output is then compared with the output of the pattern generator shift register. A block diagram for the pattern generator and encoder is shown on

-20-





Ĵ i a ώu 19 19 ما ل 4. -Arrest 7 

ور المستقدين وروالة

Figure 2-13. Detailed logic is located on Drawing VDBD #1, Sheet 3.

The pattern generator is a 7-stage convolutional encoder shift register with the modulo-two sum of the contents of the third and fifth stages fed back to generate a pseudo-random sequence of length 31. The diagram shows two additional stages added to the shift-register serial output to provide the 33-bit delay corresponding to that through the decoder with a path memory length of 32 bits.

#### Self-Test Control Logic (Figure 2-14)

44

المسا فيسار

45

ŝ.

The transition from decode mode to self-test mode initiates a Reset 1 pulse which resets the entire decoder. An internal symbol clock is then generated at approximately 250 KHz to clock the self-test symbol generator and the symbol load control logic. The 250 KHz basic clock frequency is provided by an R/C oscillator circuit.

A bit delay interval at least equal to the path memory length plus sync acquisition delay is required before a valid or put comparison can be made. Two CD4024 7-bit counters were chosen to provide a 128 bit delay.

#### Sequence Logic (Figure 2-15)

Sequencer timing for the decode operation is derived from a 6.5 MHz local crystal oscillator which is divided by 8 to produce the 154 nsec. time pulses, T1 through T8. These pulses are in turn divided by 8 to generate sequence pulses 1 through 8, each of 1.23  $\mu$ sec duration. Logic selected for the sequencer is Inselek SOS GMOS which operates at the required frequencies without excessive propagation delays. Detailed logic is shown on Drawing VDBD #1, Sheet 3.

The sequencer is started upon receipt of the third symbol which enables FF ST1 to be set on the next positive transition of the local oscillator. The output of ST1 sets ST2, enabling the time pulse generator and allowing the counter to be advanced from T1 to T2 on the next rising clock edge. Time pulse generation continues through T8, then repeats. On the trailing edge of the T8 pulse, a carryout signal is generated which clocks the sequence generator from sequence 1 to sequence 2.

The end of sequence 8 indicates completion of the decode operation for one output bit. Flip-flop ST3 is set, ST2 is reset, and the sequencer operation is halted until the third symbol is received for the next information bit. The time required to synchronize the symbol-load clock the OSC signal before start of a new cycle is greater than was anticipated. The maximum bit rate for the decoder

-23-



SELF-TEST SYMBOL GENERATOR

ORIGINAL PAGE IS OF POOR QUALITY

T

.

لين

144

. 1 i i i

د. مذقع

. . .

 $e^{i\alpha}$  :

منفد

> دشته ا انت**بر**

I

Figure 2-13

-24-

M



1

نے ن

ب يد

ಷ ಎ

, L.,

י יי גר

Į.

-

ŋ,ª

. بدندا

sentati f

لتأنن

LORICO HEST. F.R. CMPS



4

2

8

-26-

ے ب j. 1 1 1 1 1 1 1 1 с. 1. т. в. 1. ر ب بر ب و و . .ः ! • ے ا ایک

120

н ц



@ 6.5 MHz ) 1 THE PULSE = 154 NSEC, 1 SEQ. = 8×154 - 1.232 MISE 1 CYOLE = 8×1.232 = 9.856 MSEC. : = 144 NSEC. FROM END OF SEC. & TO STI

SEQJENCER TIMING

Zigure 2-16

is therefore limited to approximately 99 KBPS with the existing 6.5 MHz oscillator. The use of a slightly higher frequency oscillation would allow the sequence to be operated at the 100 KBPS rate. Sequencer timing is shown on Figure 2-16.

A sequence flip-flop provides the odd-even sequence indication for control signal generation. The flip-flop is reset to odd sequence and clocked to even sequence when the sequence generator advances to sequence 2.

An output-bit cycle indication is required for ACS and path memory read-write control signals. The cycle flip-flop shown on Figure 2-35 provides the odd-even cycle indication which changes every #8 sequence time.

#### ACS Control Signals

ىڭ ر

**月**合 日日 Detailed logic for controlling the ACS memory address increment, and for generating the read, write and normalization signals is shown on Drawing VDBD #1, Sheet 3.

#### Path Memory Control Signals

Logic for the path memory address and read-write control signal generation is located on Drawing VDBD #1, Sheet 4.

#### 2.2.2 ACS Logic

The logic for ACS Group #1 is shown in Figure 2-17, and is typical for each of the eight ACS groups. Detailed logic for each ACS group is shown on Drawings VDBD #1, Sheets 1 through 8.

The 5-bit number  $\Delta s_1$  is presented to the upper 8-bit adder, along with the 8-bit accumulation sum,  $s_1$ , read from MS1 memory. At the same time the 5-bit number  $\Delta \overline{s_1}$  is added to the 8-bit sum,  $s_{33}$  (M9-1 through M9-8), read from the MX9 memory. The outputs of both adders are compared and the smaller sum is gated through the and/or select gates to MY1 memory location Ay1. The comparator result is also sent to path memory control for the first of eight path memory sections. This and analogous operations in each of the other seven sections complete the ACS action for the first of eight sequence intervals. During the subsequent sequence intervals, the above operations are repeated with, for example,  $\Delta \overline{s_1}$  added to the same  $s_1$  sum in the upper adder of the first section during the second interval, while  $\Delta s_1$  is added to  $s_{33}$  in the lower adder.





-29-

FOLDOUT, FRAME

The lesser of these new sums is selected and stored in Ay2. The ACS operation for one information bit is completed during the eighth sequence interval where  $\Delta s_4$  plus  $s_4$  is compared with  $\Delta \overline{s}_4$  plus  $s_{36}$  and the smaller sum stored in memory My2, location Ay8.

#### Normalization of Accumulation Sums

The need for normalization of the accumulated sums is determined by counting the number of "ls" in the most significant bit (MSB) position of all 64 accumulation cum\_ stored during one output bit interval. If all 64 positions are a "1" at the end of the interval, then during the next interval each MSB is complemented to a logic "0" as it is read from memory. Examining the MSB from all eight ACS units, each sequence interval reduces the actual count required to 8. Logic gates Cl on Figure 2-17 generate the two signals NORM 1 and NORM 2, when all MSB's are a logic "1". This occurrence increments a counter in decoder board #1. At the end of the bit interval (eight sequence intervals) if all 64 MSB's are "ls", the the count will equal 8. In this case a NORMALIZE signal is sent to X-OR gates Z30 and Z40, complementing each MSB logic "1" as it is read from memory during the following bit interval.

#### Memory Organization

d a

انت تليه

Figure 2-18 shows in part, the organization of the 32 ACS memory units. The interconnections are shown for the function of reading Ax and writing Ay. A single example of the interconnection for the opposite function is shown with the dashed lines.

The particular memory address organization is shown in Table 2-1. This organization was dictated by the  $\triangle$  S metric sequence which in turn was selected so that only two  $\triangle$  S adder circuits are requried. The savings in logic here was achieved with no additional logic in the ACS area.

As shown in the upper section of Figure 2-18, Ax locations Ax 1-4 are read from one memory, while locations Ax 33-36 are read from another memory. These two sums are added with the proper  $\Delta$  S numbers, and the smaller sums stored in Ay locations 1-8. Each memory location on the Ax side is read twice during this interval, producing two new sums which are written into the Ay side.

When the memory functions are reversed in the following bit interval, then Ay locations 4-8 and 33-36 are read at the same time, as shown by the dashed lines, and the smaller ACS result stored in Ax locations 1-8. The "OR" functions between

-30-



 $F_{\widetilde{W}}$ 1 فينبغ

\*\*\*\*\*\*

استح

en ra

146

Ľ

υü

en,

25

ىتا ئىد

ii ia

į١

ت -----

1707 -

ب کنا

en en

ı İr

ت أب

 $c_{2}c_{2}$ 

.ch 20

فأرديها

4

Ţ

## ACS MEMORY ORGANIZATION Figure 2-18



## PATH MEMORY ORGANIZATION

Figure 2-20

r⊆> i فبريشة بال : تريت 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - 100 - - 1122 **-**terrer en ې د د د د 

မြန

1723

كسرسة



## Table 2-1

ė,

ii ii

Ĩ

I

pres.

ت

ار ن<u>ا</u>

ر یہ

€ ¥

і. С.,

» در ۲۰۰۰ در ا

*b*, p . मुन्द्र च

1

10

r-3 . Links the second

## ACS MEMORY MAP

| Memory      | Physical            | Memory                  | Read                     | Write            | Input Selected From                                   |
|-------------|---------------------|-------------------------|--------------------------|------------------|-------------------------------------------------------|
| <u>Unit</u> | Location            | <u>Contents</u>         | <u>Sequence</u>          | <u>Sequence</u>  |                                                       |
| 1           | 1                   | S1                      | 1,2                      | 1                | $s_{1}+\Delta s_{1},s_{33}+\Delta s_{1}$              |
|             | 2                   | S2                      | 3,4                      | 2                | $s_{1}+\Delta s_{1},s_{33}+\Delta s_{1}$              |
|             | 3                   | S3                      | 5,6                      | 3                | $s_{2}+\Delta s_{2},s_{34}+\Delta s_{2}$              |
|             | 4                   | S4                      | 7,8                      | 4                | $s_{2}+\Delta s_{2},s_{34}+\Delta s_{2}$              |
| 2           | 5                   | s7                      | 1,2                      | 7                | $s_4 + \Delta s_4, s_{36} + \Delta s_4$               |
|             | 6                   | 58                      | 3,4                      | 8                | $s_4 + \Delta s_4, s_{36} + \Delta s_4$               |
|             | 7 °                 | S5                      | 5,6                      | 5                | $s_3 + \Delta s_3, s_{35} + \Delta s_3$               |
|             | 8                   | S <sub>6</sub>          | 7,8                      | 6                | $s_3 + \Delta s_3, s_{35} + \Delta s_3$               |
| 3           | 9<br>10<br>11<br>12 | S11<br>S12<br>S9<br>S10 | 1,2<br>3,4<br>5,6<br>7,8 | 7<br>8<br>5<br>6 | 86+                                                   |
| 4           | 13                  | S <sub>13</sub>         | 1,2                      | 1                | $s_{7}+\Delta s_{1}, s_{39}+\Delta s_{1}$             |
|             | 14                  | S <sub>14</sub>         | 3,4                      | 2                | $s_{7}+\Delta s_{1}, s_{39}+\Delta s_{1}$             |
|             | 15                  | S <sub>15</sub>         | 5,6                      | 3                | $s_{8}+\Delta s_{2}, s_{40}+\Delta s_{2}$             |
|             | 16                  | S <sub>16</sub>         | 7,8                      | 4                | $s_{8}+\Delta s_{2}, s_{40}+\Delta s_{2}$             |
| 5           | 17                  | S18                     | 1,2                      | 5                | $s_{9}+\Delta s_{3}, s_{41}+\Delta \overline{s_{3}}$  |
|             | 18                  | S17                     | 3,4                      | 6                | $s_{9}+\Delta \overline{s_{3}}, s_{41}+\Delta s_{3}$  |
|             | 19                  | S20                     | 5,6                      | 7                | $s_{10}+\Delta s_{4}, s_{42}+\Delta \overline{s_{4}}$ |
|             | 20                  | S19                     | 7,8                      | 8                | $s_{10}+\Delta \overline{s_{4}}, s_{42}+\Delta s_{4}$ |
| 6           | 21                  | S <sub>24</sub>         | 1,2                      | 3                | $s_{12} + \Delta s_2, s_{44} + \Delta \overline{s_2}$ |
|             | 22                  | S23                     | 3,4                      | 4                | $s_{12} + \Delta \overline{s_2}, s_{44} + \Delta s_2$ |
|             | 23                  | S22                     | 5,6                      | 1                | $s_{11} + \Delta s_1, s_{43} + \Delta \overline{s_1}$ |
|             | 24                  | S21                     | 7,8                      | 2                | $s_{11} + \Delta \overline{s_1}, s_{43} + \Delta s_1$ |
| 7           | 25                  | S25                     | 1,2                      | 2                | $s_{13} + \Delta \overline{s_1}, s_{45} + \Delta s_1$ |
|             | 26                  | S26                     | 3,4                      | 1                | $s_{13} + \Delta s_1, s_{45} + \Delta \overline{s_1}$ |
|             | 29                  | S27                     | 5,6                      | 4                | $s_{14} + \Delta \overline{s_2}, s_{46} + \Delta s_2$ |
|             | 28                  | S28                     | 7,8                      | 3                | $s_{14} + \Delta s_2, s_{46} + \Delta \overline{s_2}$ |
| 8           | 29                  | S <sub>31</sub>         | 1,2                      | 8                | $s_{16} + \Delta \overline{s_4}, s_{48} + \Delta s_4$ |
|             | 30                  | S <sub>32</sub>         | 3,4                      | 7                | $s_{16} + \Delta s_4, s_{48} + \Delta \overline{s_4}$ |
|             | 31                  | S <sub>29</sub>         | 5,6                      | 6                | $s_{15} + \Delta \overline{s_3}, s_{47} + \Delta s_3$ |
|             | 32                  | S <sub>30</sub>         | 7,8                      | 5                | $s_{15} + \Delta s_3, s_{47} + \Delta \overline{s_3}$ |

-33-

許

经济利益 建能 有有

Ì1

| Memory      | Physical | Memory          | Read     | Write           | Input Selected From                                                |
|-------------|----------|-----------------|----------|-----------------|--------------------------------------------------------------------|
| <u>Unit</u> | Location | <u>Contents</u> | Sequence | <u>Sequence</u> |                                                                    |
| 9           | 33       | S33             | 1,2      | 4               | $s_{17} + \Delta \overline{s}_2, s_{49} + \Delta \overline{s}_2$   |
|             | 34       | S34             | 3,4      | 3               | $s_{17} + \Delta s_2, s_{49} + \Delta \overline{s}_2$              |
|             | 35       | S35             | 5,6      | 2               | $s_{18} + \Delta \overline{s}_1, s_{50} + \Delta \underline{s}_1$  |
|             | 36       | S36             | 7,8      | 1               | $s_{18} + \Delta s_1, s_{50} + \Delta \overline{s}_1$              |
| 10          | 37       | S38             | 1,2      | 6               | $s_{20}+\Delta \overline{s_3}, s_{52}+\Delta s_3$                  |
|             | 38       | S40             | 3,4      | 5               | $s_{20}+\Delta s_3, s_{52}+\Delta \overline{s_3}$                  |
|             | 39       | S37             | 5,6      | 8               | $s_{19}+\Delta \overline{s_4}, s_{51}+\Delta s_4$                  |
|             | 40       | S38             | 7,8      | 7               | $s_{19}+\Delta s_4, s_{51}+\Delta \overline{s_4}$                  |
| 11          | 41       | S43             | 1,2      | 6               | S22+ △S3, S54+△S3                                                  |
|             | 42       | S44             | 3,4      | 5               | S22+ △S3, S54+△S3                                                  |
|             | 43       | S41             | 5,6      | 8               | S21+ △S4, S53+ △S4                                                 |
|             | 44       | S42             | 7,8      | 7               | S21+ △S4, S53+ △S4                                                 |
| 12          | 45       | 545             | 1,2      | 4               | $s_{23}+\Delta \overline{s_2}, s_{55}+\Delta s_2$                  |
|             | 46       | 546             | 3,4      | 3               | $s_{23}+\Delta s_2, s_{55}+\Delta \overline{s_2}$                  |
|             | 47       | 546             | 5,6      | 2               | $s_{24}+\Delta \overline{s_1}, s_{56}+\Delta s_1$                  |
|             | 48       | 548             | 7,8      | 1               | $s_{24}+\Delta s_1, s_{56}+\Delta \overline{s_1}$                  |
| 13          | 49       | S50             | 1,2      | 1               | $s_{25}+\Delta s_1, s_{57}+\Delta s_1$                             |
|             | 50       | S49             | 3,4      | 2               | $s_{25}+\Delta s_1, s_{57}+\Delta s_1$                             |
|             | 51       | S52             | 5,6      | 3               | $s_{26}+\Delta s_2, s_{58}+\Delta s_2$                             |
|             | 52       | S51             | 9,8      | 4               | $s_{26}+\Delta s_2, s_{58}+\Delta s_2$                             |
| 14          | 53       | S 56            | 1,2      | 7               | S <sub>28</sub> +∆S <sub>4</sub> ,S <sub>60</sub> +∆S <sub>4</sub> |
|             | 54       | S 55            | 3,4      | 8               | S <sub>28</sub> +∆S <sub>4</sub> ,S <sub>60</sub> +∆S <sub>4</sub> |
|             | 55       | S 54            | 5,6      | 5               | S <sub>27</sub> +∆S <sub>3</sub> ,S <sub>59</sub> +∆S <sub>3</sub> |
|             | 56       | S 53            | 7,8      | 6               | S <sub>27</sub> +∆S <sub>3</sub> ,S <sub>59</sub> +∆S <sub>3</sub> |
| 15          | 57       | S57             | 1,2      | 6               | $s_{29} + \Delta \overline{s_3}, s_{61} + \Delta s_3$              |
|             | 58       | S58             | 3,4      | 5               | $s_{29} + \Delta \overline{s_3}, s_{61} + \Delta \overline{s_3}$   |
|             | 59       | S59             | 5,6      | 8               | $s_{30} + \Delta \overline{s_4}, s_{62} + \Delta s_4$              |
|             | 60       | S60             | 7,8      | 7               | $s_{30} + \Delta s_4, s_{62} + \Delta \overline{s_4}$              |
| 16          | 61       | S63             | 1,2      | 4               | $s_{32} + \Delta \overline{s}_2, s_{64} + \Delta s_2$              |
|             | 62       | S64             | 3,4      | 3               | $s_{32} + \Delta \overline{s}_2, s_{64} + \Delta \overline{s}_2$   |
|             | 63       | S61             | 5,6      | 2               | $s_{31} + \Delta \overline{s}_1, s_{63} + \Delta \overline{s}_1$   |
|             | 64       | S62             | 7,8      | 1               | $s_{31} + \Delta s_1, s_{63} + \Delta \overline{s}_1$              |

## Table 2-1 (Continued)

l

1

ų.

ji

ļ

I

ran Sec.3

l

tai\_≠

د د رو

1 3 1

1) 1 1} 1 1 ∎

t - - -

ŝ,

and a second

 Ax and Ay memories is accomplished through "wire-oring" of the memory outputs.

## Memory Addressing

On the lower left side of Figure 2-18 is the MY memory address and enable logic for the entire ACS board. The corresponding logic for the MX memories is on the upper right side of Figure 2-18.

All memory addresses (cf. Table 2-1) are read sequentially during the read operation. In order to allow this sequential addressing while reading, certain units, M2, M4, M8, M9 and M11, must be addressed, when writing, in a 3, 4, 1, 2 sequence; M13 and M6 are addressed in a 2, 1, 4, 3 sequence: M5 and M14 in a 4, 3, 2, 1 sequence; and the remainder are addressed in the normal sequential manner.

The memory enable signals shown in Figure 2-18 are used as part of the address control during write operation. When the address count is below 4, the memory locations 1-4 are enabled, while locations 5-8 are enabled when the count is 4 or greater.

## ACS Timing

i i L'a

8.0

÷.

60

enan Tit The timing diagram for the ACS section is shown in Figure 2-19. All time periods are based on a maximum information bit rate of 100 K bits per second. For lower bit rates, the decoder operates at the same rate, then halts in an idle state until the next three input signals are received.

The 10  $\mu$ sec bit period is divided into eight 1.25  $\mu$ sec sequence intervals. Each sequence interval is then further divided into eight time pulses, T1-T8, of 156 nsec duration.

The memory address is changed at the start of the Tl time and the read output allowed to ripple through the adder/comparator combination. No data set-up time is required for the memory units; a maximum hold time of 40 nsec is sufficient. The CMOS memory unit is the 4 word by 8 bit, RCA 4036.

### 2.2.3 Path Memory Logic

This section consists of the path memory and the associated output bit selection logic. The path memory (reference Figure 2-20) is implemented with 128 CD4036 memory units. These units are connected to provide dual 64 word by 32 bit memory sections, Bl-B46, and Rl-R46. The 64 words are then subdivided into eight 8-word functional groups. The ACS comparison result from group number one is used to determine whether to read from Bl-B4 in MBl, or from B33-B36 in MB9. Data from

-35-

the selected memory, bits 1-31 are then written into location Rl of MRl, bits 2-32. A 1 or 0 is written into bit 1 = PML AUG., depending on the result of the comparison. Detailed logic diagrams for the path memory are on Drawings VDBD #3, #4 and #5.

## Path Memory Addressing

11

المتراسية

مأغيد

i.

enna Liis The path memory addressing organization is the same as that for the ACS memory described in section 2.2.2. Address and enable logic is shown in Figure 2-21.

### Memory Read Control

Logic for controlling the read signals to each memory is shown on Figure 2-22. The eight score signals from the ACS sections are level converted from +10 VDC logic to +8 VDC logic with buffers El4 and El5, then trapped in flip-flops E9 through E12.

When the Read MB signal is received during the odd cycle, it is directed to one memory in each group, according to the trapped scores. Identical selection gates are provided for MR memory read control during even cycles.

## Typical Path Memory Functional Group

The interconnections for one-half of a typical path memory group, in this example MBL and MB2, are shown on Figure 2-23. The remaining half of the group consists of MRL and MR2. This sheet serves as a master for all sixteen similar sections. Control signals and PM input and output numbers are assigned to comply with each location.

#### Output Bit Selection

Logic for output bit selection is shown in Figure 2-24. The output bit is selected by counting the number of logic "ls" in the oldest bit position (bit 32) of all 64 memory locations. If the count is less than or equal to 32, then the output is a logic 0; if the count is greater than 32, the output is a logic "l".

One 5-stage counter is multiplexed between two path memory groups, requiring a total of four counters. A memory in each group is loaded once per sequence interval. The write input for bit position 32 is brought out for each group (labeled MSB on the drawing) and connected to the and-or-select gates. Data being written into memory is held long enough to be strobed into the counter by enable signals EN1 and EN2.

-36-

----MIS MI CTR 4030 ----162 1. 19 192 THEATHR FOIR (1) + 10 24 (1) + 1024 -----11 41 1 1140 E 19 111. 1 1115 . 1501 118 4050 (18 5 4017 MSR) 1.14 1110 1 2 1116 FANS 2 1 12 30 FANG 16140 F. 1628 119 READ MAB FAUL SPAL 40 FAUT \*\*\*1 REALMS + C 4011 110 03 1010 1850 F 121 10 1848 F631Q6 516 - 1623 18 45 12 F21 110 F8 42 \*\*\* \*\*\* 2\*\* \_\_\_\_\_ PEAD MB Ť 40.30 101 1 21630 165 5 × + FOU 4050 EH12 0 5031 1627. 1516 . 1581 1816 ..08/1 18:1 5 - 1028 .... MSBI 1.11 Heb 7 6 2700 FF36 1 FHJLQ FIL M 11 27 2" S DE 12 1. 1. 1. 1. 1 11 30 5 115 40 FE 29 4011 PEALME! 4011 1902 d 2019 FF31 9 15 10 FF 30 1H 28 5 .... FF 27 13 15 11 EF 27 PATH MEMORY ADDRESS & ENABLE LOGIC FOLDOUT FRAME ORIGINAL PAGE IS Figure 2-21 OF POOR QUALITY







-38- FOLDOUT FRAME



FOLDOUT FRAME

ORIGINAL PAGE IS OF POOR QUALITY TYPICAL LOGIC FOR <u>A</u> PATH MEMO Figure







Following the eighth sequence interval which is the end of the bit interval, the contents of the counters are added and compared with binary 32 for the output indication. This indication is clocked into a flip-flop and sent to VDBD #1, Sheet 1, where it is level converted to a 9614 TTL output driver. The counter inputs are level converted from 8 to 10 volts and the output counters, adders and comparators are operated at 10 VDC.

## Path Memory Timing

 $\square$ 

 $\square$ 

[]

[]

[]

0

1

[]

I

I

Basic timing for the path memory is shown in Figure 2-25. The output bit is available at the end of sequence 2 of the following bit interval.

## Path Memory Packaging

Path memory logic is packaged on three Augat 54-row universal packaging panels. Each of two panels holds 48 memories, while the remaining 32 memories are mounted on the third panel, along with the addressing and output control logic.

12: W.R 112555557575122345673123456778123 3567812345673123456731255670 3/20 560 2 1 mg 360. 1 310 \$ 563 8 PATH MEMORY TIMING 5 Î \$20.3 I INPUT CYCLE = 10 MSRC WB 360.2 FOR PREN. 300. Max Rate -OW C USE 359 2 5 569.1 - SYNC CAT (AUTRE) SYMEOL CLA (SEL) ! PATH MEM. CYCLE CLK DELAY CTR OUTPUT-BIT VALID EN/INH P. MEM. (TRAP) TIME PULSES INCH ADDRESS TRAP SCORE WRITE MEM. SEQUENCES EN. OUT 2 EN OUT 1

PATH MEMORY TIMING Figure 2-25

1

[]

[]

[]

0

0

Û

-----

I

I

## ZII. TEST SET DESIGN AND IMPLEMENTATION

ي د د ا

The major function of the test set is to provide noise modulated data to the Viterbi Decoder. It perfoms this function by encoding random data (changing at a variable rate of up to 100 KBPS) with a rate 1/3 convolutional encoder. It modifies this data with the output of a random noise generator and transfers the resultant data as three parallel bits at three times the original data frequency. This data is sent to the Viterbi Decoder along with a clock. Data returned from the Viterbi Decoder is compared with the original data and the number of errors generated can be recorded by an external counter. Figure 3-1 is a general block diagram of the test set with each block containing the figure number of three more detailed block diagrams. Each block in Figures 3-1, 3-2, and 3-3 contains the sheet number of the logic diagram wherein it originates.

The Random Data Generator (Figure 3-2) consists of a 24-stage SR (shift register) wirh appropriate exclusive OR circuits feeding back various outputs into the first stage. A data SR of up to 48 stages is provided to accommodate the delay of the data in moving through the path memory in the Viterbi Decoder to insure that the returned data will be in phase with the original data at the data error comparator. The decoder clock from the Viterbi Decoder will assure that the two data signals are compared at the proper time. The convolutional encoder input consists of seven lines from the random data generator. The resultant data is sequentially presented on the encoder output at three times the data generator rate.

The Noise Encoder (Figure 3-3) combines the output of the noise comparators with the convolutional encoder output and the resultant three outputs are stored in D flip-flops. The outputs of these flip-flops are Data BITS 1, 2, and 3 for the Viterbi Decoder.

The Symbol Error Comparator (Figure 3-2) compares the encoder output with the three data bits. The symbol errors generated by the noise can be counted with an external counter.

The Noise Generator (Figure 3-3) consists of 6 ROM memories (256 x 8); 5 twenty-one bit D-registers; a random noise generator; 7 twenty-one bit comparators; a noise generator encoder; a two-stage counter and a priority encoder.

The original noise source used one 63-stage shift-register rather than the 19-, 20-, and 23-stage shift-registers now being used. The 63-stage shift-register had several deficiencies that were substantially eliminated by changing to the shorter registers. The periods of the overall sequences generated by the two

-43-



I

The state

I

The

I

-

l



-45-



methods are of the same order of magnitude (roughly  $10^4$  years and 3 x  $10^4$  years, respectively). The 63-stage register, however, takes the full 10,000 years to go through all of its states whereas each of the shorter registers goes through all of its states in roughly 1/2 second. The long register thus cycles through only an infinitesimal fraction of its possible states during any one run while each of the short registers completes a number of cycles. As a consequence, the long register can spend an entire run in a relatively atypical portion of its sequence. This defect was compounded by the fact that trinominal feedback was used to generate the sequence. If the register cells were rich in zeros at the beginning of a run (which they tended to be even without the "initialize" switch in the ON position) they would stay rich in zeros for some time. This resulted in an atypically bursty noise environment and blased the results accordingly. The effect was negligible at low signal-to-noise ratios but introduced a degradation of roughly 1/2 dB at bit error rates of 10<sup>-16</sup>. Although the two generators results in identical firstorder statistics, the second approach was felt to be more representative of the anticipated noise in that it is able to cycle through a full range of "typical" states during any one run.

ر نې

w.c

The desired noise and threshold levels are obtained by setting the threshold switch (S3) and the S/N dB switch S2. Sixty-four lines from S2 are inputs to a priority encoder whose output generates the six least significant bits of the ROM address. The two-stage counter determines the two most significant bits of the ROM address. Whenever the start button is depressed a sequence of pulses IA1, IA2, and IA3 increment the two-stage counter. The contents of the ROM's are stored in registers A, B, C, E and F as indicated in Figure 3-6. A burst of twenty-one 10 MHz pulses shift the data in the 63-bit noise shift register. The twenty-one output lines from this noise register are compared with the noise data from registers A, B, C, D, E, F, and G. The 7 comparator outputs (NA, NB,...,NG) are inputs to the noise generator encoder.

The Timing Generator (Figure 3-5) consists of a 10 MHz oscillator; 8-stage Johnson Counter; 6-stage binary counter; a 7-counter burst generator timing logic and control logic. The HALT button resets all counters and presets the noise generator shift register and data generator shift register to anone in the first stage and zeros in the remaining stages if the corresponding initialize switch S5 (Figure 2) and S9 (Figure 1) are in the up position. If S5 and S9 are down, the HALT switch has no effect; thus the register contents remain random. Depressing the

-47-

start switch will first load the A, B, C, E and F registers with the noise data from the ROM's, next the noise generator shift registers are shifted 7 times. The resultant data from the Noise Generator Encoder (Figure 3-3) is stored on the 3 bit lines, the 300 KHz clock will sample the data at the Viterbi Decoder. The 7-burst counter is preset and a second burst of 7 pulses is applied to the noise generator SR. Upon completion of this second burst the second group of data occurs as levels on the 3 bit lines. The multiplex counter has been incremented each time to select sequentially each of the three lines to the multiplexer. The sequence is repeated a third time after which the third set of data levels appears on the three bit lines. After the third 300 KHz square wave pulse a Data Shift pulse is generated moving the data in the Random Data SR one place. Now all the aforementioned sequences repeat except the memory registers contents are not changed.

لىر يەل 1-49 د. الم ) | | 2 1 1 1 1 1 J 間 

Ĵ J

|                         | 1             |               |               |                   |               |               |                   |                   |
|-------------------------|---------------|---------------|---------------|-------------------|---------------|---------------|-------------------|-------------------|
| DATA STORED IN          | Reg. A        | Reg. B        | Reg. C        | MI1, MI2, MI3 (D) | Reg. E        | Reg. F        | M21, M22, M23 (G) | M21, M22, M23 (G) |
| MEMORY                  | M11, M12, M13 | MI1, M12, M13 | MIL, MI2, MI3 | NIL, NI2, MI3     | M21, M22, M23 | M21, M22, M23 | M21, M22, M23     | M21, M22, M23     |
| MSB'S OF MEMORY<br>ADD. | 00            | TO            | TO            | п                 | 00            | 10            | IO                | 10                |
| STATE OF 2-STAGE<br>CTR | 00            | οī            | 10            | 11                | 00            | 10            | IO                | 11                |
|                         | ···           | <u></u>       |               |                   |               | <u> </u>      |                   |                   |

(D) = Memory Acts as Reg. D.

(G) = Memory Acts as Reg. G.

Figure 3-4

ż

-49-



## IV. TEST AND TRADEOFF RESULTS

## Output Selection Algorithm Tradeoffs

The error rates observed using various path lengths and output selection algorithms are plotted in Figure 4-1 as a function of the signal-to-noise ratio (i.e., ratio of signal enerty to noise spectral density). The results summarized in Figure 4-1 were obtained using a soft-decision threshold spacing of 0.6  $\sigma_n$ . The effect of varying this threshold spacing is indicated in Figure 4-2. The major conclusions to be gleaned from these results are: (1) The majority vote output selection algorithm results in a performance degradation of about 0.05 dB relative to the minimum metric algorithm. (Spot checks indicate that the arbitrary oldestbit algorithm, in contrast, results in degradation of the order of 0.3 dB). (2) The improvement obtained by increasing the path length from 32 to 39 bits in negligible using either the minimum-metric or the majority-vote algorithm. (3) The optimum soft-decision threshold spacing is between 0.5  $\sigma_n$  and 0.6  $\sigma_n$  for most signal-to-noise ratios of interest. The optimum spacing does increase slowly with signal-to-noise ration, however. On the basis of the results observed at high signal-to-noise ratios, we conjecture that a spacing of 0.7  $\sigma_n$  or greater is optimum for ratios exceeding 5 dB. In any case, the minimum-metric and the majorityvote algorithms are both relatively insensitive to the threshold spacing for any spacing in the range of 0.4  $\sigma_n$  to 0.7  $\sigma_n$ .

Since the majority-vote output selection algorithm is only slightly inferior to the minimum-metric rule and since the advantage of increasing the path length from 32 to 39 bits is negligible, the majority-vote method was implemented in the brassboard and the path memory limited to 32 bits. The increase in hardware complexity associated with either the minimum-metric rule or with the increased path length does not appear to be justified by these slight improvements in performance.

#### Branch Synchronizer Tradeoffs

4

195

The performance of the baseline branch synchronizer is depicted in Figure 4-3. The two curves represent the synchronization delay in information bits as a function of the signal-to-noise ratio for two up-down counter thresholds, T = 32, and T = 64 (cf. Section 2). The dashed lines indicate the synchronization delay to the first acceptance; the solid lines indicate the delay to the first correct acceptance. In both cases, the synchronizer was able to retain synchronization at all test set signal-to-noise ratio settings greater than -3 dB.

-51-







A more conventional branch synchronizer was breadboarded and tested in order to establish performance data against which to compare the baseline synchronizer. This synchronizer is conceptually straightforward and not difficult to implement. It functions by allowing the Viterbi Decoder to begin decoding at an arbitrary position (i.e., at any one of the three possible branch sync positions with any one of the two possible polarities) and, after a  $\mathcal{A}$ -bit delay (with  $\mathcal{A}$  the path memory length), it monitors the oldest bits in each of the 64 paths. If all of these bits are the same when an output bit is selected, it augments a "unanimity counter"; if not, it augments a "non-unanimity counter." If the unanimity counter reaches the count of U before the non-unanimity counter reaches  $\overline{U}$ , it accepts the current position (and polarity) as correct. If the non-unanimity counter first reaches  $\overline{U}$ , either the sync position is slipped or the polarity changed and the procedure begins anew. The position and polarity are alternately changed until an acceptable (i.e., presumably the correct) position and polarity are found.

د م

بنف

ú.

3,

Various values of U and  $\overline{U}$  were tested in order to find the most efficacious settings. Table 4-1 lists the lowest signal-to-noise ratio at which the resulting synchronizer could retain branch synchronization as a function of U and  $\overline{U}$ . The propensity for false-sync is also indicated in Table 4-1. This refers to the tendency for the synchronizer temporarily to accept an erroneous sync position during acquisition. In every case, however, the false position is quickly rejected and correct sync eventually found. This, of course, increases the amount of time needed for correct synchronization. Nevertheless, a small U,  $\overline{U}$  setting might result in a faster sync acquisition than a larger setting, even though the former involves a number of false attempts before it finally succeeds, since each individual decision requires less time.

Acquisition delay measurements were made with U = 32 and  $\overline{U} = 64$  and with U = 256 and  $\overline{U} = 128$  in order to estimate the efficacy of this approach. Since false syncs made direct measurement of the acquisition time difficult, the following indirect method was used: Acquisition delay measurements were taken under two conditions, one in which the synchronizer was set to begin testing the correct position immediately and one in which it was set so that it had to test all six possibilities before reaching the correct position. In the latter case, the number m of positions tested and rejected before the first acceptance (either correct or false sync) was also recorded.

-55-

| U   | ΰ   | Frequency of<br>False<br>Syncs | Lowest S/N Ratio<br>Without<br>Loosing Sync |  |  |
|-----|-----|--------------------------------|---------------------------------------------|--|--|
| 32  | 64  | High                           | 2.3 dB                                      |  |  |
| 128 | 128 | Moderate                       | 1.46 dB                                     |  |  |
| 128 | 256 | High                           | 1.46 dB                                     |  |  |
| 256 | 128 | Zero                           | 1.46 dB                                     |  |  |
| 256 | 256 | Low                            | 1.46 dB                                     |  |  |
| 512 | 256 | Zero                           | 1.46 dB                                     |  |  |
| 512 | 512 | Low                            | 0.41 dB                                     |  |  |

# TABLE 4-1

ψb

نو فيه

( \_\_\_\_\_

i la

į.

5

 Counter Settings for Alternate Branch Synchronizer

-56-

Since

. موتية

236

10

$$E(m) = \beta \{ (1 - \beta) + 2(1 - \beta)^{2} + 3(1 - \beta)^{3} + 4(1 - \beta)^{4} \} + 5(1 - \alpha)(1 - \beta)^{5}$$

with  $\beta$  the probability of false acceptance and  $\alpha$  the probability of false rejection, and since  $\alpha \equiv 0$  for all U and U sattings of concern here, the average  $\widehat{m}$  of the numbers m thus provides an estimate  $\widehat{\beta}$  of  $\beta$ . Further, since the expected number of false syncs before correct sync when  $\mathcal{L}$  false positions are tested before the correct position is tested is

$$\sum_{i=1}^{\infty} \left( \frac{\ell + \ell' - \ell}{\ell} \right) \beta^{i} (1 - \beta)^{\ell} = \frac{\ell \beta}{1 - \beta}$$

the expected search time is

$$\hat{n}_{1} + \frac{1}{6} \sum_{\ell=0}^{5} \frac{\ell \beta}{1-\beta} \hat{n}_{2} + \frac{1}{6} \sum_{\ell=0}^{5} \ell \hat{n}_{3}$$
$$= \hat{n}_{1} + \frac{5}{2} \frac{\beta}{1-\beta} \hat{n}_{2} + \frac{5}{2} \hat{n}_{3}$$

where  $n_1$  is the expected test time (in information bits) needed to accept the correct sync position,  $n_2$  the expected time to accept an incorrect sync position and  $n_3$  the expected time to reject an incorrect position. The quantities  $n_1$ ,  $n_2$ , and  $n_3$  were directly estimated by, respectively, the average number of bits needed for a decision when the correct position was the first tested, the average number needed when the first incorrect position tested was accepted, and the average of the number of bits needed to make the first acceptance minus the expected number needed for that final test  $(n_1 \text{ or } n_2)$  divided by the number of rejections prior to the first acceptance.

The results of these measurements and calculations showed an expected sync acquisition time remarkably independent of the signal-to-noise ratio. When U = 32,  $\overline{U} = 64$ , the average acquisition delay varied from 430 bits at  $^{\rm S}/N = 2.3$  dB to 400 bits at an infinite signal-to-noise ratio. Similarly, when U = 256,  $\overline{U} = 128$ , the average delay remained almost constant at 800 bits as the signal-to-noise ratio ranged from 1.46 dB to infinity. These results are also indicated on Figure 4-2.

-57-

This alternate branch synchronization technique thus appears to be reasonably effective. Since it is slower than the baseline approach, however, and since it is unable to retain synchronization at low signal-to-noise ratios, the baseline approach was implemented in the brassboard. The up-down counter threshold T was set at 32 since the Pasulting synchronizer is faster, at most signal-to-noise ratios of interest, than when T = 64. This setting is easily changed, however, should more rapid acquisition be desired at signal-to-noise ratios of less than 1 dB.

## Power Consumption

ن قد

Power consumption calculated from measured current for the decoder totaled approximately 2.6 watts, with the power breakdown per voltage as follows:

10 VDC @ 180 ma = 1.8 watts 8 VDC @ 65 ma = 0.52 watts 5 VDC @ 60 ma = 0.30 watts TOTAL = 0.62 watts

In the preliminary design it was planned to operate the synchronizer and self-test logic at 5 VDC to save power. In the final design, however, the number of level conversion gates required for the interface between the 5 and 10 VDC turned out to be fifteen. It was decided, therefore, that little if any power savings would be realized, considering the additional logic requirements.

The preliminary power estimation of 0.5 watts for the path memory was based on operation of the memory devices (CD4036) at 5 VDC. Interconnection capacity in the brassboard, however, requires a minimum path memory voltage of approximately 7.2 VDC for proper operation. Below this voltage the rise and fall times of the memory output signals become excessive. An 8 VDC source was, therefore, provided for the path memory from a regulator circuit driven by the 10 VDC power supply.

## LSI Implementation

The Viterbi Decoder brassboard was partitioned to facilitate eventual LSI implementation. The ASC and path memories, in particular, have highly regular organizations readily translatable to either monolithic or hybrid LSI. For the relatively low production volume anticipated for the Space Vehicle Viterbi Decoder, the hybrid approach merits serious consideration. It can result in packaging densities comparable to those attainable with specially developed monolithic chips

-58-

at considerably reduced development cost. For this reason, the major emphasis concerning possible LSI implementations for the Viterbi Decoder involved the hybrid approach.

The packaging technique resulting from this study is shown in Figure 4-4. The path memory entails the use of thick-film hybrids. The 128 chips comprising the two path memories are partitioned into eight identical circuit groups, each packaged as a hybrid circuit containing 16 CMOS chips. The high density of parallel connections required in each hybrid is satisfied using three, thickfilm, etched-back conductive layers. The chips are attached to the substrate through epoxy die bonding. The chip-to-substrate interconnections are ultrasonically bonded aluminum wire; the substrate-to-package interconnections are TC-bonded gold ribbon. The enclosure for this circuit is a custom-tooled flatpack with a total of 116 input-output pins. The sealed package size is approximately 1.5" x 2.0" x 0.16".

Both the ACS section and the remainder of the decoder logic (the metric calculator, node synchronizer, and control logic) could also be implemented as hybrid LSI circuits. The resulting reduction in volume is considerably less dramatic in either of these sections, however, than it is in the path memory. The ACS section and the metric-calculator-synchronizer-control section could, in fact, each be packaged on one  $4.2" \times 5.8" \times 0.39"$  module using flatpack integrated circuits (cf. Figure 4-4) without recourse to any additional hybrid or monolithic LSI development. The entire hybrid path memory could then be packaged on a third module having the dimensions  $4.2" \times 5.8" \times 0.49"$ , resulting in an overall 38 in.<sup>3</sup> decoder package.

The total circuit capacitance in the 38 cubic inch package just described is estimated to be less than one-half the capacitance in the brassboard decoder. Since the dynamic power dissipation is directly proportional to the load capacitance, the total power would be roughly one-half the predicted attainable brassboard value of 2.6 watts, or approximately 1.3 watts.

### Maximum Decoder Bit Rate

-a-

į.

برقط

The maximum data bit rate in the brassboard decoder is approximately 99 KBPS. Logic delays in the sequencer start/stop logic prevent operation at a 100 KBPS bit rate with the included 6.5 MHz crystal oscillator. The decoder was operated at approximately 105 KBPS, however, using an external pulse generator in place of the internal oscillator. This indicates that the substitution of an oscillator

-59-

of slightly higher frequency would easily allow operation at a data rate of 100 KBPS or more.

the second second

, , , , , , , ,

ij

1111-1

N. SALAN SALAN

11111

ï

1

2 2 2

-5



. ویک

i. i

1000 1000

ELECTRONIC MODULES (SRU'S)

Figure 4-4

-61-

#### V. CONCLUSIONS

The Space Vehicle Viterbi Decoder Brassboard has demonstrated that a lowpower, rate 1/3, constraint-lenght 7 Viterbi decoder capable of operation of rates up to 100 KBPS is indeed within the current state-of-the-art. Were the same design implemented using flatpacks for the ACS and control sections of the decoder and hybrid packages for the path memory, the resulting decoder would occupy only 38 cubic inches and consume an estimated 1.3 watts of power. Furthermore, this significant reduction in volume could be achieved at a modest cost since only one hybrid package need be developed; no monolithic LSI chip developments are required.

In addition to validating the design concept, the brassboard effort also accomplished a number of other objectives. It is verified, for example, that the optimum soft-decision threshold spacing is between  $0.5 \frac{\sigma}{n}$  and  $0.6 \frac{\sigma}{n}$  for most signal-to-noise ratios of interest and that the optimum ratio of the spacing to the noise standard deviation  $\frac{\sigma}{n}$  is a slowly increasing function of the signal-to-noise ratio. It established that the difference between the majority-vote output selection rule and the minimum-metric rule is indeed small (the former performing within about 0.05 dB of the latter at all signal-to-noise ratios) and that the advantage of increasing the path length from 32 bits to 39 bits is entirely negligible. (The arbitrary-oldest-path output selection rule, in contrast, was found to introduce degradations of roughly 0.3 dB.)

Finally, it verified that the stand-alone node synchronizer implemented as part of the brassboard is indeed an effective device resulting, for example, in an average node acquisition delay of less than 200 bits at a 3 dB signal-to-noise ratio and capable of acquiring and retaining nodes synchronization at signal-to-noise ratios of as low as -1 dB.

-62-