Parallel processing methods for space based power systems by Berry, F. C.
PARALLEL PROCESSINGMETHODSFOR
SPACEBASEDPOWERSYSTEMS
N94- 25352
Final Report
NASA/ASEE Summer Faculty Fellowship Program--1993
Johnson Space Center
Prepared By:
Academic Rank:
University & Department:
NASMJSC
Directorate:
Division:
Branch:
JSC Colleagues:
Data Submitted:
Contract Number:
F.C. Berry
Associate Professor
Department of Electrical Engineering
Louisiana Tech University
Ruston, Louisiana 71272
Engineering
Propulsion And Power
Power
Bob Hendrix
Tom Jeffcoat
7-30-93
NGT-44-001-800
4-,0
https://ntrs.nasa.gov/search.jsp?R=19940020870 2020-06-16T13:57:01+00:00Z
ABSTRACT
This report presents a method for doing load-flow analysis of a power system by
using a decomposition approach. The power system for the Space Shuttle is used as a
basis to build a model for the load-flow analysis. To test the decomposition method for
doing load-flow analysis, simulations were performed on power systems of 16, 25, 34,
43, 52, 61, 70, and 79 nodes. Each of the power systems was divided into subsystems
and simulated under steady-state conditions. The results from these tests have been
found to be as accurate as tests performed using a standard serial simulator. The division
of the power systems into different subsystems was done by assigning a processor to each
area. There were 13 transputers available, therefore, up to 13 different subsystems could
be simulated at the same time.
This report has preliminary results for a load-flow analysis using a decomposition
principal. The report shows that the decomposition algorithm for load-flow analysis is
well suited for parallel processing and provides increases in the speed of execution.
INTRODUCTION
A research project, Advanced Electrical Power Management Techniques for
Space Systems (ADEPTS), was started at the Johnson Space Center in 1986. The basic
goal of ADEPTS was to automate the operations of a space based power system (SBPS)
by using the technology of parallel and distributed processing.
From the ADEPTS project, three basic functions were identified which would
form the basis of a management system for a SBPS. First was the monitoring of the
power system that could be accomplished by the use of state estimation. Second was the
scheduling of the generation to meet the required load of the SBPS that could be
accomplished by unit dispatch. Third was the solution of the SBPS that could be
accomplished by the use of load-flow analysis.
Methods like state estimation, unit dispatch, load-flow analysis, etc., are well-
established tools that have been used heavily by the electric power industry since the
introduction of the digital computer. However, the use of parallel processing is still
relatively new when applied to the area of power systems [1]. In the last few years, a
significant amount of research has been done in the area of parallel processing of power
system problems. Most of this work has been in the development of algorithms. Actual
testing on multiprocessor architectures is still near the beginning stages. The largest
uncertainty today is the evolution of the hardware. For example:
. Pipelined computers in which temporal parallelism is used to calculate overlapped
computations. Early Cray- 1 and Cyber 205 are good examples of these systems.
. Array processors in which spatial parallelism is utilized through synchronized
arithmetic logical units (ALU). The Illiac-IV is a good example of this system.
. Parallel processing systems in which asynchronous parallelism is employed at the
software level with the Clay X-MP and Clay 2 as typical examples.
Shared-memory and distributed processors are two basic systems of parallel
computers that can offer increases in computational power but their acceptability and
long term viability is unpredictable [1].
Because parallel algorithm development was much further along, ADEPTS chose
to make use of the existing algorithms and focused on tailoring these algorithms to a
4-2
specific hardware design. To accomplish this, a hardware platform and software
environment had to be decided upon. The INMOS T800 transputer was used as the basic
hardware platform. The T800 transputer is a 32 bit reduced instruction set computer
(RISC) running at 25 MHz [2]. The T800 transputer can communicate with other
transputers or a host computer by a set of four bi-directional channels. By using these
channels, a variety of hardware architecture can be investigated. Also, additional
transputer modules can be easily added in the future to further enhance the system
without making obsolete the existing transputers. Because of the difficulties encountered
in developing parallel software, the Express@ "operating system" was chosen. The
Express package is best classified as a transparent parallel operating system with a set of
tools and utilities for developing parallel programs [3, 4, 11].
A preliminary parallel architecture for a management system of a SBPS (Figure 1)
was developed and tested using the power system of the Space Shuttle (Orbiter) as a
model. The electric power distribution system that is present on the Orbiter is made up of
three strings each having a fuel cell that provides DC voltage to those systems that
require DC, and the inverters that convert the DC to AC for those elements that require
AC. Figure 2 is a block diagram representation of the power system of the Orbiter.
Analog
Measurements
I
Power
ystem
Generator Control
Output
State EstimatorState Output
Estimator
_DI State Estimator
Output
iattch
Load
OW
Future Load
Requirement
Figure 1" Block Diagram Of A Power Management System.
4-3
Preliminary testsof the architecture of Figure 1, using the Orbiter power system,
showed that the load-flow program took from 20% to 50% of total processing time
depending on the size of the electrical circuit that the load-flow program was solving. To
improve the overall performance of the architecture of Figure 1, a different architecture
for the load-flow was investigated.
I Power Control IAssembly
Load Control ]Assembly
Motor Control
Assembly
DC/ACInverter
Forward
Main
Distribution
Assembly
I
Power Control [Assembly
I Power Control [Assembly
I
Load Control lAssembly
Mid Aft
Figure 2: Power Distribution System for the Space Shuttle (1 of 3 Strings).
RELATED RESEARCH
A general definition of load-flow is given: Load-flow is the steady-state solution
of equations that describe a power system. These equations can be nonlinear and
algebraic. They can be nonlinear because they express powers as a function of voltages.
They are algebraic because they describe the steady-state instead of the transient behavior
of the power system. [5]
Rafian, Sterling, and Irving [6], presented a method for load-flow analysis for
power systems by tearing the network into a number of independent subsystems. These
subsystems could then be solved in parallel resulting in a saving of time for on-line
control. This method of decomposed load-flow analysis was based on the fast decoupled
load-flow algorithm. The decomposed load-flow technique that was presented would
4-4
normallygive thegreatestreductionin time, if the system could be divided such that the
subsystems were interconnected only by a few tie lines (separating nodes).
Rafian, Sterling, and Irving [7], present a mathematical technique for the
decomposition of a set of nonlinear algebraic and differential equations which result from
a corresponding dynamic model of an electric power system. This method of load-flow
analysis for power systems again tears the network into a number of independent
subsystems. These subsystems could then be executed in parallel. This method of
decomposed load-flow analysis was based on the Newton-Raphson load-flow algorithm.
It was also shown that this decomposed load-flow algorithm was suitable for
implementation on parallel processors with a 32-bit word capability. This decomposition
technique resulted in significant savings in the simulation elapsed time.
Wang, Xiang, Wang, and Huang [8], presented two algorithms for a parallel
solution of the reduced gradient optimal power flow. The parallel solution method for
optimal power flow was accomplished by tearing the network into a number of
independent subsystems. These subsystems were then solved in parallel by using what
the authors called a two-level computer network.
Berry and Cox [9], presented a coordination decomposition method for load-flow
analysis using the Gauss-Siedel algorithm. This coordination decomposition method was
accomplished by the tearing of the electrical network into a number of independent
subsystems which could then be solved in parallel. However, this load-flow algorithm
was only simulated on a serial computer and no true parallel processing took place.
Taoka, Iyoda, Noguchi, Sato, and Nakazawa [10], presented a Gauss-Siedel
method for doing load-flow analysis on a hyper cube computer. The algorithm that was
presented consisted of solving the power flow equations in an iterative manner in order to
minimize the communication between nodes.
LOAD-FLOW
Load-flow is the name given to a network solution that shows currents, voltages,
and power flow at every bus in the system. In the load-flow problem, a relationship
between voltage and current at each bus exists and the load-flow calculation must solve
for all voltages and currents such that these relationships are satisfied. As such, the load-
flow gives the electrical response of the transmission system to a particular set of loads
and generator unit outputs. Therefore, load-flow is the solution of an electrical network
that gives the values of currents, voltages, and power flow at every bus (node) in the
electrical power system.
To start the load-flow solution process, a set of linear equations is formulated for
an electrical network. This linear set of equations is generally of the node form as
represented by equations (1).
Eli]iYllY12Yljl[il112 Y21 Y22 • E2
i Y'i 1 Yii i
(1)
Although currents entering the nodes from generators and load branches are not known,
they can be written in terms of P and E.
4-5
Ii- Pi
Ei (2)
Since currents entering the nodes are considered positive, then power flow into the node
is also considered positive, and power dissipated in the resistors would be entered as a
negative number.
Substituting equation 2 into equation 1 gives the following result.
-h-
E] EP2 I2 Y21 Y22E2 = = .
Pi Y'il
- Ei -
Yii i
(3)
hD
E1-
a2_
E2-
Ei-
YllE1 + Y12E2 + ... +YljEi
Y21E! + Y22E2 + ... +Y2jEi
YilE1 + Y12E2 + ... +YliEi
(4)
The set of equations (4) may be solved for E1 through Ei by the use of an iteration
technique•
1 P2 [Y21EI+ +Y2jEd}E2= Y22 {E2" "'"
(5)
Ei = { - [YilE! + ... +Y(i- 1)(j- DE(i - 1)]}
Note that Ei has been written in terms of itself and the other voltages. Also, one equation
has been deleted, this equation or bus is designated as the swing or slack bus. The swing
or slack bus is generally a generator that is the first to react to changes in the system•
Generally the voltage at the slack bus is specified.
DECOMPOSITION METHOD
The decomposition method for load-flow analysis was started by dividing the
electrical power system into a number of smaller systems or subsystems• These
subsystems can then be solved independently in parallel [13]. To demonstrate the
decomposition method for load-flow analysis, an example will be presented.
The electrical power system of Figure 3 was decomposed by using the admittance
matrix (Table 1). The admittance matrix was transformed so that each of the subsystems
would appear sequentially; that is, the nodes were renumbered for the electrical circuit.
This renumbering caused the network equations for each subsystem to follow the
4-6
dynamic equations for that same subsystem. Hence, for the power system of Figure 3
there were 4 different subsystems. In the admittance matrix, the diagonal elements Yii
are the summation of all admittances that surround the ith node, where Yil = Yli and is
the negative of the branch. The admittance for Figure 3 is given in Table 1.
i l
FC 1 FC 2
Bus 1 -- -- Bus 2
1.4 i R 2,4
B_ 4
R4,5 R4,6Bus 5 Bus 6
1
2
3
4
5
6
7
8
9
10
-- -- Bus 3
i R3,4
Bus 7
Figure 3: Space Shuttle Power Distribution System.
Table 1: The Admittance Matrix.
R4,7
5 6 7 8 9 10
-Y4,7
-Y5,8
Y8,8
-Y5,9 -Ys,10
Y 10,10
1 2
YI,1
Y2,2
Y3,3
-Y4,1 -Y4,2 -Y4,3
3 4
-Y 1,4
-Y2,4
-Y3,4
Y4,4
-Y5,4
-Y6,4
-Y7,4
-Y4,5 -Y4,6
Y5,5
Y6,6
-Y8,5
-Y9,5
-Y10,5
Y7,7
Y9,9
After the renumbering of Table 1 is completed the admittance matrix of Table 2 is
created. The admittance matrix from Table 2 can be expressed as the sum of two
matrices.
[Y] = [Yd + [Yc] (6)
Where [YI] are a number of block diagonal submatrices that represents a matrix for a
specific subsystem. From the [Yd matrix, it can be seen that four different sets of
equations can be produced, one set for each subsystem as shown in Figure 4 and Table 3.
4-7
1
2
3
4
5
6
7
8
9
10
1
2
3
4
5
6
7
8
9
10
Table2:
1 2 3
The Admittance Matrix Renumbered.
4 5 6 7 8
....viii -Y1A
Y2,2 -Y2A
Y3,3 -Y3A
:Y%l -Y4,2 -Y4,_ Y4,4
-Y5A
1 2
YI,1
Y2,2
-Y4.1 -Y4.2
-Y4,_
Y5_ _Ys,6 -Y5,7 -Y5,8
-Y6,5 Y6,6
-Y7,5 Y7,7
:Y8.5 Y8,8
-Y9,4
-YIoA
Table 3: The Admittance Marx [YI].
3 4 5 6 7 8
-Y1,4J
-Y2,4
Y3,3 - 3A
-Y4.3 Y4.4
YS_ -Y5,6 -Y5,7 -Y5,8 I
-Y6,5 Y6,6
- 7,5 7,
_.:Y8_.5.......... Y8.8
FC 1
slack bus-J--Bus 1
R1,4
L I
FC2 _R3US 2 FC3 TBus 3
2,4 IR3,4[ Bus 4
9 10
-Y4,9 -Y4,10
9 10
IYlo.lo
Bus4 slack bus
Bus 5_ 4'5
IR5.6 _R5,7 _R5.8
B.....us4_... slack bus
Bus 9 ,9
-_ slack bus
_ R4,10
Bus I0 [
Figure 4: Decomposed Space Shuttle Power Distribution System.
4-8
Eachof the individual subsystems shown in Figure 4 and Table 3 can be solved
independently. These subsystems require their own slack bus and one of these slack
buses will serve as the slack bus for the entire system.
The remaining elements, [Yc] of the admittance matrix are the nonzero off
diagonal elements that are the separating nodes of one subsystem with respect to the node
voltages in another subsystem. Table 4 shows the [Yc] matrix.
1
2
3
4
5
6
7
8
9
10
Table 4: The Admittance Matrix [Yc]-
1 2 3 4 5 6 7 8 9 10
-Y4,9 -Y4,10
-Y9,4
-Y 10,4
These nonzero off diagonal elements act as sending or receiving nodes for power flow
from one subsystem to another. Therefore, these nonzero off diagonal elements can be
referred to as voltage and power (EP) separating nodes. Finally, these nonzero off
diagonal elements are directly related to the number of common nodes in different
subsystems.
From the [Yc] matrix, it can be seen that at node 4 the system was divided. The
[Yc] matrix is the coordination matrix and generally this matrix will contain the
separating nodes. The function of the coordination matrix is to determine the amount of
AP and AE that the separating nodes contribute to each subsystem.
PERFORMANCE ANALYSIS
To test the decomposition method for doing load-flow analysis, simulations were
performed on power systems of 16, 25, 34, 43, 52, 61, 70, and 79 nodes. Each of the
power systems was divided into subsystems and simulated under steady-state conditions.
The results from these tests have been found to be as accurate as tests performed using a
standard serial simulator. The division of the power systems into different subsystems
was done by assigning a processor to each area. Table 5 provides the results of timing
tests for each of the power systems. The times reported in Table 5 are the times
associated with the solution of the load-flow problem (Node Time) and part of the host
I/O (Host I/O). The Node Time includes internode communication time, calculation
time, and part of the host I/O time. The Host I/O time reported in Table 5 is that part of
the total host I/O time required to print the result of the load-flow calculation to the
screen and the time to create two files, which contains the data of the timing tests that are
reported in the following tables.
4-9
1
.... L ............. ,,
Transpuler
2
T_spmer
4
Transmuter
10
.....Yranspmer
13
.__J ................... ,.....
Tran_uter
Table 5: Timin Data
16
buses
Node Time ms 56.77
Host I/E)ms 115.25
Node Time ms 33.41
Host I/_3ms 112.45
Node Time ms 33.02
Host I/O ms 102.46
Node T!mems ...... * ........... *
Host I[O ms * *
Node Time ms ...........* ..........*
Host I/O ms * *
25 34
buses buses
94.59 146.88
116.35 115.46
44.29 51.97
123.20 122.88
44.67 48.19
but:?:
2115
11%.t_
7I ._r
1 1-'-.'_
54._]
122.82 122.94 123.2:.
* 44.4_
* 25.es-
* 1lC_.<,_
The results of the timing tests that are reporte_ !-
running each program 10 times and examining the tim_-,
recorded when the program finishes [3, 4, 11]. In order t:.
like speedup and efficiency, the times reported for the si.-:.
"best case" and the times reported for the multiprocesso: ::
By reporting the "best case" times for the single processor : -
times for the multiprocessor programs would provide the
and efficiency for the multiprocessor programs.
Again from Table 5 the Node Times did not chang_ _-
these numbers are the actual times. The Host I/O numbers c:_
next. Being consistent with the idea of the "worst case" '.:._
the best "best case" time for the single processor progr_:=:
time for the multiprocessor programs. By using the best ":
processor programs and the worst "worst case" time fo-
would provide the worst "worst case" for the speec__
multiprocessor programs. This adjustment to Table 5 is giv:,-
Transputer
Table 6: Ad usted Time
16 25 34 43
buses buses buses buse,,: : "
Node Time ms • 56.77 ...._.59_ !46:88 2!!:_!!..i 71
Host I/O ms 99.84 99.84 99.84 99.ga
Node Time ms 33.41 44.29 51.97 71.6_,
............................ L
Host I/O ms 128.26 128.26 128.26 128.3_ _ '"
Node Time ms___33_02....44....67_....4..8_.I.9_.. . .._.:9.!.._.....
Host I/Ores 128.26 128.26 128.26 128.26i "
. • • 444_ , ._Node Time ms ..............................................: _:..:..........
Host I/O ms * * * 128.2_;
Node Time ms * * * 25 q'_
....................................... t_._:_-...: ....
Host I/tD ms * * * 128.2,.
Transputer
4
Transputer
10
Transputer
13
Transputer
From Table 6, Table 7 was created. Table 7 is the total "b::
processor program and the total "worst case" time for each ..':.
4-10
Oi_GINAt. PAQE I11
OF POOR QUALITY
Table 7:
16
buses
...... 1 _ Node Time.ms... 56.77
......Tran_uter Host I/O ms
Total Time ms
................_2__..................No_i._eTime ms •
.... Transputer Host I/O ms _i
Total Time ms
4 Node Time ms
99.84
156.61
33.41
128.26
161.67
33.02
"Worst Case" Total Time
25 34 43 52 61 70 79
buses buses buses buses buses buses buses
.....9,_..59 146.88 211.90 233.02 252.99 275.14 297.28
99.84 99.84 99.84 99.84 99 84.  ,84  :84
194.43 246.72 311.74 332.86 352.83 374.98 397.12
44.29 51.97 71.68 82.68 91.14 98.88 104.38
128.26 128.26 128.26 128.26 128.26i !28.26 128.26
172.55 180.23 199.94 210.94 219.40 227.14 232.64
44.67 48.19 54.91 57.92i 61.06 63.81 .... 66:2_
TotalTimems 161.28 172.93 176.45 183.17 186.18
_ !0 Node Time ms * * ...... , .... 44.42 44.99
Transputer Host I/O ms * * * 128.26 128.26
.... 4 ' '
Total Time ms * * * 172.68 173.25
....... !3. NodeTimems ...... * ...... * _. * • 25.92. 26.56
......Tran_u_r • Host I/O ms * * * 128.26 128.26
Total Time ms * * * 154.18 154.82
128.26 128.26 128.26_ 128.26 128.26L !28:2,6.!28,26j 128.26
189.32 192.07 194.50
45.63 49.02 49.54
128.26] 128.26 128.26
173.89 177.28 177.80
26.88 26.94 26.82
128.26 i28 26L 128.26
155.14 155.20 155.08
Speedup is a measure of the application software utilization of a multiple
processor system. However, the original purpose of the speedup was to compare the time
of the fastest serial program, T1, with the time of the parallel equivalent of the same
program, T(p). [ 15] This definition of speedup has a different meaning, e.g.,
T1
S(p)- T(p) (7)
The definition of the speedup from equation 7 is rarely used because of the difficulty in
measuring TI. Instead, T(1) is used as an approximation ofT1. Therefore, the speedup is
estimated from the measurement for T(1) and T(N) [15] and is def'med as follows:
T(1)
S(p)- T(p) (8)
The speedup of the different multiprocessor programs is given in Table 8. Again these
are the "worst case" values for the speedup.
Table 8: S Results
16 25 3iI 43
buses buses buses buses
1 Total Time ms 156.61 194.43 246.72 311.74
Transputer Speedup NA NA NA NA
2 Total Timems 161.67 172.55 180.23 199.94
Transputer Speedup 0.96 1.12 1.36 1.55
4 Total Timems 161.28 172.93 176.45 183.17i
Transputer r Speedup - 0.97 - iA2 1.39- 1.70i
10 Total Time ms * * * 172.68
Transputer Speedup * * * 1.80
13 Total Time ms * * * 154.18
Transputer Speedup * * * 2.02 _,
52 61 70 79
buses buses buses buses
332.86 352.83 374.98 397.12
NA NA NA NA
210.94 219.40 227.14 232.64
1.57 1.60 1.65 1.70
186.18 189.32 192.07 194.50
1.78 1,86 i.95i 2.04
173.25 173.89 177.28 177.80
1.92 2.02 2.11 2.23
154.82 155.14 155.201 155.08
2.14 2.27 2.41 2.56
4-11
Processor efficiency measures the contribution of each processor to the parallel
solution when p processors are employed. That is, E(p) equals the average efficiency per
processor when the problem is run with p parallel processors [15].
E(p) = S(p) (9)
P
Therefore an efficiency rating of 100% means the program runs in linear speedup time,
while an efficiency of 0% means the processor is of no use in solving the problem in
parallel [15].
The efficiency of the different multiprocessor programs is given in Table 9.
Again these are the "worst case" values for the efficiency.
Table 9: Final S Anc Results
16 25 34 43 52 61 70 79
buses buses buses buses buses buses buses buses
I ......Speedup . NA .... !_!A...I_.A NA NA NA NA_ NA
T_n_uter Efficiency NA NA NA NA NA NA NA NA
2 S_UP_ ......... 0.96 1.12 ....... 1.36 1.55 ..........!.,57 1.60 1.65 ....... .1_.7_0__.
Transputer Efficiency 48.0% 56.0% 68.0% 77.5% 78.5% 80.0% 82.5% 85.0%
4 . Speedup ......... 0.97 I. 12 1.39 1.70 1.78 1.86 ....... !_:95.........2:__04__
Transputer Efficiency 24.2% 28.0% 34.7% 42.5% 44.5%i 46.5% 48.7% 51.0%
I0 ........... Speedup * . * *.............1.80_ 1.92 _ 2.02 2.11 .... 2_..23__.
Transputer Efficiency * * * 18.0% 19.2% 20.2% 21.1% 22.3%
13 Speedup * * * 2.02 2.!4_ 2.27 2.41 2.56
.....Transputer Efficiency * * * 15.5% 16.4%: 17.4%1 18.5% 19.6%
FAULT-TOLERANCE
Fault tolerance is a class of methods to achieve dependable and reliable
computing systems. Dependability is that property of a computer system that allows
reliance to be justifiably placed on the service a system delivers [12, 14]. Fault tolerance
is the ability to deliver a service in spite of faults. Fault tolerance is generally obtained
from redundancy. This redundancy can be accomplished by masking (static) redundancy
or dynamic redundancy [12, 14].
Masking (static) redundancy uses extra components to obscure, or mask, the
effect of a faulty component [12, 14]. This technique is regarded as static because once
the redundant copies are connected their interconnections remain fixed. The error
resulting from faulty components are masked by the presence of other copies of those
components. An example of masking redundancy is the computer system for the Space
Shuttle. There are four different computers on the Space Shuttle. Three of these
computers execute the same program using the same data while the fourth computer runs
a different program which performs the same function as the other three. The results of
the three computers running the same program are compared or voted on, if any one of
the result from one of the computers is inconsistent with that of the others, it is masked
out.
The reason that the fourth computer has a different set of software that performs
the same function as the other three is to grade against an error in programming. This is a
frorm of dynamic redundancy. Dynamic redundancy refers to systems which can be
reconfigured in response to a fault. If the fourth computer results were different from the
results of the other three computers on the Space Shuttle, the fourth computer would take
4-12
over and mask out the others, thereby reconfiguring the computer system from a
multiprocessor to a single processor computer system.
Another fault tolerant technique is fault detection, but fault detection provides no
tolerance to faults, rather it gives warning when they occur [12, 14].
The reason for this introduction to fault tolerant system was to show how they
could be applied to this research. The goal of this research is not to design a fault tolerant
computer system but to apply parallel and distributed processing to construct an electric
power management system (EPMS) for space based power systems. In the development
of this EPMS, two different architectures for doing the decomposition method for load-
flow were tested. These results have been presented in Tables 5 through 9. They were
the 10 and 13 transputer systems and are presented in Figures 5 and 6.
Figure 5: 10 Transputer System
Figure 6:13 Transputer System
From Figures 5 and 6 it can be seen that processor 0 is the critical processor. If
processor 0 were to fail, both systems in Figure 5 and 6 would fail. However if processor
1, 2, or 3 were to fail in Figure 6, the processors which feed into any of these processors
would be removed from service. An alternative system which is given in Figure 7 would
use 12 processors. The system in Figure 7 would operate the same as the system in
Figure 5 with the exception that processors 10 and 11 would perform the same function
as processor 0. This would provide a static redundancy for processor 0 and would
eliminate the risk of a fault on the middle layer of processors in Figure 6. The system in
4-13
Figure7 was not tested so no timing data are available at this time, but the point is made
that fault tolerance can be addressed in this design.
Figure 7: Proposed Fault Tolerant System
CONCLUSIONS
This paper has presented preliminary results for a load-flow analysis using a
decomposition principal. It has been shown that the decomposition algorithm for load-
flow analysis is well suited for parallel processing and provides increases in the speed of
execution. Also two different hardware structures for doing the decomposition method
were presented (Figures 5 and 6) and they were analyzed for speedup and efficiency. A
fault tolerant design was also proposed (Figure 7) which made use of static redundancy.
Further work will be conducted on the decomposition principal by the addition of
more transputers to the present system. This will be done to see whether or not further
decomposition by the addition of more processors will be of any benefit. Also with the
addition of more transputers, a hyper cube architecture could also be investigated; this
approach was presented by Taoka, Iyoda, Noguchi, Sato, and Nakazawa [10].
REFERENCES
[I] IEEE Committee Report, "Parallel Processing in Power Systems Computation,"
IEEE Transactions On Power Systems, Vol. 7, No. 2, May 1992, pp. 629-637.
[2] A.M. Tyrrell and J.D. Nicoud, "Scheduling And Parallel Operations On The
Transputer," Microprocessing and Microprogramming, Vol 26, 1989, pp. 175-
185.
[3] Express 3.0. Introductory. Guide Macintosh, ParaSoft Corporation, 2500, E.
Foothill Blvd., Pasadena, CA 91107, 1988, 1989, 1990.
[4] Ex_m'ess 3.0. User's Guide, Version 3.0, ParaSoft Corporation, 2500, E. Foothill
Blvd., Pasadena, CA 91107, 1988, 1989, 1990.
[51 Y. Wallach, Calculations And Pro_ams For Power System Networks. Prentice-
Hail, Inc., Englewood Cliffs, New Jersey 07632, 1986.-
4-14
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
M. Rafian, M.J.H. Sterling, and M.R. Irving, "Decomposed Load-Flow
Algorithm Suitable For Parallel Processing Implementation," lEE Proceedings,
Vol. 132, Pt. C, No. 6, November 1985.
M. Rafian, M.J.H. Sterling, and M.R. Irving, "Parallel Processing Algorithm For
Power System Simulation," lEE Proceedings, Vol. 135, Pt. C, No. 6, November
1988.
L. Wang, N. Xiang, S. Wang, and M. Huang, "Parallel Reduced Gradient Optimal
Power Flow Solution," Electric Power Systems Research, Vol. 17, 1989, pp. 229-
237.
F.C. Berry and M.D. Cox, "A Technique For Load Flow Analysis On A Power
System," IECEC Proceedings, Washington, D.C., 1989.
H. Taoka, I. Iyoda, I., H. Noguchi, N. Sato, and T. Nakazawa "Real-Time Digital
Simulation For Power System Analysis On A Hyper Cube Computer," IEEE
Transactions On Power Systems, Vol. 7, No. 1, Feb. 1992, pp. 1-7.
F.C. Berry, "Express, A Tool for Teaching Parallel Processing," Computers In
Education Journal, Computers In Education Division Of ASEE, Accepted, 1992.
D.P. Siewiorek, "Architecture of Fault-Tolerant Computers: An Historical
Perspective," Proceedings Of The IEEE, Vol. 79, No. 12, Dec. 1991, pp. 1710-
1734.
H.M. Chen and F.C. Berry, "Parallel Load-Flow Algorithm Using A
Decomposition Method For Space Based Power Systems," IEEE Transactions on
Aerospace and Electronic Systems, Accepted, 1993
L.J.M. Nieuwenhuis and G.D. Blom, "Fault Tolerant Computing With
Transputers And Occam," Real-Time Systems With Transputers, H. Zedan, Ed.,
1990, IOS Press, pp. 108-118.
T.G. Lewis and H. E1-Rewini, Introduction To Parallel Computing, Prentice Hall,
Englewood Cliffs, New Jersey 07632, 1992.
F.C. Berry, R.F. Gasser Jr., and H.M. Chen, "Course Grain Parallel Processing for
the Management of a Space Based Power System," IEEE Transactions on
Aerospace and Electronic Systems, Accepted, 1993.
4-15
