ABSTRACT
INTRODUCTION
System testing is the most complex and intricate among all types of program testing. The complexity of system testing can possibly be attributed to the fact that it involves testing a fully integrated system that may be large and complex. Therefore, with continually increasing system sizes, the issue of automatic design of system test cases is assuming prime importance [26] . A properly generated test suite may not only locate the errors in a software system, but also help in reducing the high cost associated with software testing [16] . Many present day software solutions are state based. In such systems, the system behavior is determined by its state. In other words, a system can respond differently to the same event in different states. Therefore, unless a system is made to assume all its possible states and tested, it would not be possible to uncover state-based bugs. Adequate system testing of such software requires satisfactory coverage of system states and transitions. Generation of test specifications to meet these coverage criteria can be accomplished by using the state model of a system. However, it is a non-trivial task to manually construct the state model of a system. The state model of an actual system is usually extremely complex and comprises of a large number of states and transitions. Possibly for this reason, state models of complete systems are rarely constructed by system developers [26] .
Testing activities consist of designing test cases that are sequences of inputs, executing the program with test cases and examining the results produced by this execution. Testing can be carried out earlier in the development process so that the developer will be able to find the inconsistencies and ambiguities in the specification and hence will be able to improve the specification before the program is written [11] . Unified modeling language (UML) has emerged as the de facto standard for modeling software systems and has received significant attention from researchers as well as practitioners. UML models are popular not only for designing and documenting systems, the importance of UML models in designing test cases has also been well recognized. Even though UML models are intended to help reduce the complexity of a problem, with the increase in product sizes and complexities, the UML models themselves become large and complex involving thousands of interactions across hundreds of objects [15] . The important part of quality control in the software life-cycle is testing. As the complexity and size of software increase, the time and effort required to do sufficient testing grow. Manual testing is time-consuming and error prone. So, there is a pressing to automate the testing process. The testing process can be divided into three parts: test case generation, test execution, and test evaluation. The latter two parts are relatively easy to automate provided that the criteria for passing the tests are available. However, to determine which tests are required to achieve a certain level of confidence is not trivial [17] .
Model-based testing [5] has grown in importance. Models are specified to represent the relevant, desirable features of the system under consideration (SUC). These models are used as a basis for (automatically) generating test cases to be applied to the SUC. Typical models that are used for representing system behavior are unified modeling language, finite state machines, state charts etc. [3] .
With this motivation, we aim our work at deriving the test sequence from state transition diagram and maximizing state or node coverage. Also our method minimizes the size of test, time and cost, while preserving test coverage. The rest of the paper is structured as follows: A brief discussion on UML diagrams, which are relevant to our paper is described in the Section 2. Then, we discuss some testing coverage criteria in Section 3. Section 4 represents some concepts, notations and definitions of state chart diagram. In Section 5, we explain the overview of our proposed method for construction of state transition graph, generation of test sequence using state charts and how node coverage is calculated for each test case. Section 6 provides the working of our methodology with the ATM (Automatic Ticket Machine) case study. Section 7 discusses some related work. Finally, Section 7 presents the conclusion and future work of this paper.
UML DIAGRAMS
In this section, we discuss some basic concepts which will be used subsequently in our paper. UML, Unified Modeling Language is a visual language that has been developed to support the design of complex object-oriented systems. Since its introduction in the late 90s, it has undergone several revisions. The latest release being UML version 2.0, which adds several new capabilities to UML 1.x. In this section, we restrict our review to only those developments that are directly relevant to our work. A typical software procedure incorporates all the three aspects: It uses data structure (class model), it sequences operations in time (state model), and it passes data and control among objects (interaction model). The three kinds of models separate a system into different views. The different models are not completely independent but a system is more than a collection of these independent parts. UML specification defines two major kinds of UML diagram: structural diagrams and behavioral diagrams.
The elements in a structural diagram represent the meaningful concepts of a system, and may include abstract, real world and implementation concepts. Behavioral diagrams show the dynamic behavior of the objects in a system, which can be described as a series of changes to the system over time.
UML state chart diagram
In this section, we explain the few fundamentals on state chart diagram. The name of the diagram itself clarifies the purpose of the diagram and other details. It describes different states of a component in a system. The states are specific to a component or object of a system. A state chart diagram describes a state machine. Now to clarify it state machine can be defined as a machine which defines different states of an object and these states are controlled by external or internal events. State chart diagram is one of the five UML diagrams used to model dynamic nature of a system. They define different states of an object during its lifetime. And these states are changed by events. So state chart diagrams are useful to model reactive systems. Reactive systems can be defined as a system that responds to external or internal events. State chart diagram describes the flow of control from one state to another state. States are defined as a condition in which an object exists and it changes when some event is triggered. So the most important purpose of State chart diagram is to model life time of an object from creation to termination. State chart diagrams are also used for forward and reverse engineering of a system. But the main purpose is to model reactive system. State diagrams are used to give an abstract description of the behavior of a system. This behavior is analyzed and represented in series of events that could occur in one or more possible states. Hereby "each diagram usually represents objects of a single class and tracks the different states of its objects through the system". State diagrams can be used to graphically represent finite state machines. Followings are the main purposes of using state chart diagrams:
 To model dynamic aspect of a system.  To model life time of a reactive system.  To describe different states of an object during its lifetime.  To define a state machine to model states of an object. 
Transition:
Arrows, denote transitions. The name of the event (if any) causing this transition is written as the labels with the transition names or event names. A guard expression may be added before a "/" and enclosed in square-brackets (event Name [guard Expression]), denoting that this expression must be true for the transition to take place. The general format of the label of a transition is as follows:
An event:
An event is an occurrence at a point of time. Events often correspond to verbs in the past tense e.g. (power turned on, alarm set) [4] . Events include error conditions as well as normal occurrences, e.g. motor jammed, transaction aborted etc.
TEST ADEQUECY CRITERIA
Testing coverage / adequacy criterion specifies the requirement of a particular testing, and can be used as an objective measurement of the test case. In traditional software code testing, the definition of testing adequacy is defined as a measurement function. The case of UML state chart diagrams is different because it is in the form of model instead of code. Especially the coverage of state chart diagram is little bit complex. In this paper, we propose different types of coverage metrics as follows:
 State Coverage requires that all the state nodes in a state chart diagram be covered. The value of state coverage is the ratio between the covered states and all the states in the state chart diagram.
 Action Coverage:
In order to generate test cases with respect to action coverage we construct as many marked specifications as there are actions within the specification.
 Transition Coverage requires that all the completion transitions in a state chart diagram be covered. The value of transition coverage is the ratio between the checked transitions and all the transitions in the state chart diagram.
 Path Coverage requires that all the paths in a state chart diagram be covered. The value of path coverage is the ratio between the traversed paths and all the paths in the diagram.
 Condition coverage. A decision consists of conditions separated by logical operators (e.g. and, or). A single condition is covered, if it evaluates to both true and false at some point during test execution. Decision coverage has also been called branch coverage or predicate coverage. This means that 100% condition / decision coverage is achieved if all conditions evaluate to true and to false and if every decision also evaluates to true and to false during the test execution.
SOME BASIC DEFINITIONS
There are many forms of state diagrams, which differ slightly and have different semantics. For a deterministic finite state machine (DFA), nondeterministic finite state machine (NFA), generalized nondeterministic finite state machine (GNFA), Mealy machine or Moore machine, the input is denoted on each edge. For a Mealy machine, input and output are signified on each edge, separated with a slash "/": "1/0" denotes the state change upon encountering the symbol "1" causing the symbol "0" to be output. For a Moore machine the state"s output is usually written inside the state"s circle, also separated from the state"s designator with a slash "/". There are also variants that combine these two notations.
It is assumed that a state chart STc is correct in the sense that for each state s ∈ S simple there exists a sequence of transitions t 1 , t 2 ..., t k so that source(t 1 ) ∈ Si and target(t k ) = s and for each state s ∈ S simple there exists a sequence of transitions t 1 , t 2 ..., t k so that source(t 1 ) = s and
The following terms will be used to describe our technique.
Definition 1.
A state chart can be a quadruple S c = (E, S t , H, T), where E is a finite set of events and St = (S, S i , S f ) is a triple of set of states with S as a finite set of states, S i ⊆ S denoting the entries (initial states) and S f ⊆ S the exits (final states), H ⊆ S × S is a hierarchy relation, a binary relation on the set S forming a tree. For an element (s, s′ ) ∈ H holds, that a state s is an immediate sub state of state S′.
T ⊆ S × E × S is a finite set T of transitions. The set of states S is composed of disjoint sets of simple states S simple and composite states S comp .
Definition 2.
A transition pair TP = (t, t′ ) with t, t′ belongs to T legal is a sequence of a legal incoming transition to a legal outgoing transition of a (simple) state so that ∃ s ∈ S simple : t ∈ in(s) U t′ ∈ out(s).
Definition 3.
A false transition pair FP = (t, t′ ) with t ∈ T legal and t ′ ∈ T faulty is a sequence of a legal incoming transition to a faulty outgoing transition of a (simple) state so that ∃ s ∈ S simple : t ∈ in(s) U t′ ∈ out(s). , where I is the initial state of the system at which the test data is input, S is the test data input to the system and O is the expected output of the system [18] , [19] . The output produced by the execution of the software with a particular test case provides a specification of the actual software behavior. A test case is also characterized by an ordered pair of an input and an expected output of the SUC.
Definition 7:
A test suite is a set of test cases. A single test case in most cases may satisfy more than one test obligation. For instance, a test case used to cover a certain state of interest may also cover other states during its execution. This then provides for a way to reduce the size of the final test suite by choosing a subset of test cases that preserves the coverage obtained by the full test suite [9] . Based on Definitions 4 and 5 the following two coverage criteria are introduced:
Definition 8: k-transition coverage (k-TC) generates complete transition sequences that sequentially conduct all legal transition sequences of length k ∈ N.
Definition 9: Faulty transition pair coverage (FTC), generates complete faulty transition sequence for each faulty transition and faulty transition pair [13] .
Definition 8 guarantees that all possible (legal) transition sequences of length k will be tested. A test suite consisting of all transition sequences of a fixed length k does not necessarily cover a set of all sequences of length i ∈ 1, ...k-1 as there may exist sequences of length i that cannot be expanded to length k. Definition 9 guarantees that all potential malfunctions will be tested. Typically, state based test generation methods focus on some form of coverage, for instance on covering transitions [19] , [20] or on transition coverage and state identification [6] , [14] . Our approach creates all transition sequences of length k including all shorter sequences of length 1, ..., k -1 that cannot be found in longer sequences.
GeMiTefSc-OUR PROPOSED APPROACH TO GENERATE AND MINIMIZE TEST SEQUENCE FROM STATE CHART DIAGRAM
In this section, we discuss a overview of our proposed approach to generate test sequence from UML state chart diagram and then, we optimize the test node coverage while minimizing time and cost. We have named our approach, Generation and Minimization of test cases from State Charts (GeMiTefSc).
Our approach consists of 5 steps. The schematic representation of our approach is shown in fig.2 .

Step 1: Building model of the system.
Step 2: Constructing the transition graph from state chart diagram.
Step 3: Extracting the required information from the transition.
Step 4: Generating test cases.
Step 5: Minimize a set of test cases by calculating node coverage for each test sequence.
In the next section, we discuss each step in detail, by using different algorithms for each step in different sections. Before that we present some related definitions. 
Building model of the system.
State diagram is graph where nodes represent states end the directed arcs that interconnect states represent transitions. It also models dynamic behavior, and captures the different states that an object can be in, and its response to various events that may arise in each of its states. State chart diagram is one of the 13 UML diagrams used to model dynamic nature of a system. They define different states of an object during its lifetime. The notation and semantics of UML state diagrams are substantially based on State charts modified to include object-oriented features [12] . The states and the transition of a system are important to set up a state diagrams from system. Fig. 3 shows a UML state diagram for a ticket machine system. 
Constructing the transition graph from state chart diagram
Here, we convert the state diagram into state transition graph. A state transition graph TG = (V t , E d ).
Definition 10:
A transition graph TG = (V t , E d ) represents a directed graph consisting of a set of vertices (V t ), a set of directed edges (E d ). In TG, nodes represent states and edges represent transitions between states. Without any loss of generality, we assume that there is a unique node that corresponds to the initial state and that one or more nodes represent the final states. The initial state is represented as the root of the tree. States at each level of nesting are considered as a sub graph. We represent this step through an algorithm.
Algorithm-1
Input: Sc = (E, S t , H, T) Output: A transition graph TG = (V t , E d )
1.
V := {t i , t j }, A := NULL Finally, the transition graph is used as a source for the graphical traveling salesman problem and it is augmented by an additional edge (t i , t j ). This edge and the preconditions for a state chart claimed in the last section ensure that the resulting graph is strongly connected.
Extracting all required information from the graph
Here, in this subsection, we present how to extract all the information which are required to generate the test sequences.
Input: Transition graph TG = (V t , E d )
Output: A set of stages or node ST, set of input data ID, set of output data OD and set of transition TR. [23] .A test case set fulfilling k-transition coverage for k > 1 is computed by transforming the transition graph stepwise and then applying the graphical traveling salesman problem. Also by computing all complete transition sequences whose length is smaller than k and that cannot be expanded to longer sequences, a minimal test case set fulfilling k-transition coverage for all k ∈ 1, ..., n is achieved. This proceeding is described in Algorithm-2, in the next section.
Generating test cases
After, a transition graph is constructed based on a state chart Sc. If n = l, the graphical traveling salesman problem can be applied directly. If n is greater than 1 the transition graph has to be transformed k -1 times. The resulting graph represents all possible sequences of transitions of length k. Additionally, all sequences of length k-1 are computed that cannot be expanded to longer sequences. These sequences are characterized by the fact that the corresponding vertex representing that sequence is solely connected with vertices t i and t j. The functions indeg(v) := v′,(v′, v) ∈ A and outdeg(v) := v′(v, v′) ∈ A are used to compute these vertices. As these sequences are already complete, they can be added to the set TS. TS := ϕ
Algorithm-2: Generation of a test case

15.
For k := 2 to n do 16. 
Minimizing the test cases
Here, in this section , we discuss how the generated test cases are reduced while maximizing test coverage. Though Wang"s algorithm [17] is widely used, but it does not cover other critical attributes, like defect id, dependency and automated test case indicator. In order to generate an effective size of generated test cases. This step contains two subactivities, which are  calculate node coverage for each test case. Let NC(tc) = t 1 , t 2 ,...t n for NC(tc) to be a set of test cases that tc is covered by t 1 , t 2 ,...t n . Hence, if a number of set tc is zero, then tc is included in the effective set of test cases.
 select effective test cases. Now, we present this step through an algorithm. 
The resulting graph TGout = (V tout, Edout ) contains a vertex v = (t 1 , ..., tk) for each transition sequence of length k. Two vertices v = (t 1 , ..., t k ) and v′ = (t′ 1 , ..., t′ k ) are connected by a directed edge if it holds that (t 2 , ..., t k ) equals (t′ 1 , ..., t′ k-1 ) for each component. These two vertices thus represent the sequence (t 1 , ..., t k , t′ k ).
WORKING OF OUR ALGORITHM
In this section, we explain the working of our algorithm using an automatic ticket machine example as described below.
The ATM(Automatic Ticket machine)issues tickets to passengers on receiving money from them. The state chart diagram of an automatic ticket machine, ATM object for various events of interest are shown in Fig 3. The object enters into idle state, when the power switch is on. Once the user selects an ticket type button in the menu, the object enters the display destination state, where the ticket fare to different destinations are displayed. The user can select the destination, ticket type and the number of persons(n) to travel. The condition n≤6 is inserted for the event tickettypeselected, as the ticket machine is not expected to issue a ticket for more than 6 persons in one transaction. Once the ticket type and number of persons required are selected, the object enters the display money state. In this state, the object displays the amount of money (totalfair) the user has to enter into the ticket machine. Note that totalfair = ticket fare × number of persons. As the user enters money (amt) into the machine, the machine object changes its state to busy. In the busy state, it calculates how much balance or change (chng) has to be returned to the user if any, where chng = (amt -totalfair). If the change balance is less than zero, the machine object changes its state from busy to return money as the money inserted is insufficient. If the change balance is more than or equal to zero, the machine object goes to issueticket state and issues the ticket for requested number of persons. If the balance is zero, then once the ticket is issued the machine object changes its state from issueticket to idle. If the balance (chng) is more than zero, the machine object enters into the return money state, where the change balance money is returned. Once the money is returned, the machine object again enters into idle state. Here the state chart diagram of the machine is given in the figure 3.
After constructing state chart diagram, we construct transition graph using algorithm-1. The transition graph is shown in figure 4 . In next step, we extract all required information from the state transition graph as described below. cases like tc1, tc2, tc3, tc4, tc6, tc8, tc9, tc10, tc11, tc12, tc14,  tc15, tc16, tc17, tc19, tc20, tc21, tc23, tc24, tc26 should be ignored. Hence, the remaining effective set of test cases is TS = {tc5, tc7, tc13, tc18, tc22, tc25}.
Reduction
In practical situations, there is often insufficient time for thorough testing activities within industrial projects. Hence, it is reasonable to try to reduce the size of generated test suites. However, the effect of the reduction on the fault-detection ability of the test suites should be small. The techniques proposed in this paper can be used to apply reduction during test case generation. A single test case may cover more than the coverage item it has been generated for. When using a probe based technique as described in this paper it is easy to identify all items covered by a particular test case.
RELATED WORK
A lot of studies have investigated the effect of test-set reduction on the size and fault finding capability of a test set.
In an early study, Wong et al. [27 ] address the question of the effect on fault detection of reducing the size of a test set while holding coverage constant [27] , [28] . They randomly generated a large collection of test sets that achieved block and all-uses data flow coverage for each subject program. For each test set they created a minimal subset that preserved the coverage of the original set. They then compared the fault finding capability of the reduced testset to that of the original set. Their data shows that test minimization keeping coverage constant results in little or no reduction in its fault detection effectiveness. This observation leads to the conclusion that test cases that do not contribute to additional coverage are likely to be ineffective in detecting additional faults.
To confirm or refute the results in the Wong study, Rothermel et al. [24] performed a similar experiment using seven sets of C programs with manually seeded faults [24] . For their experiment they used edge-coverage [7] adequate test suites containing redundant tests and compared the fault finding of the reduced sets to the full test sets. In this experiment, they found that [2] the fault-finding capability was significantly compromised when the test-sets were reduced and [25] there was little correlation between test-set size and fault finding capability. The results of the Rothermel [24] study were also observed by Jones and Harrold in a similar experiment [10] .
Offutt and Abdurazik [19] , [20] developed a technique for generating test cases from UML state diagrams. They have highlighted several useful test coverage criteria for UML state charts such as: (1) full predicate coverage, (2) transition coverage etc.
Kansomkeat and Rivepiboon [11] have introduced a method for generating test sequences using UML state chart diagrams. They transformed the state chart diagram into a flattened structure of states called testing flow graph (TFG). From the TFG, they listed possible event sequences which they considered as test sequences. The testing criterion they used to guide the generation of test sequences is the coverage of the states and transitions of TFG.
Kim et al. [12] proposed a method for generating test cases for class testing using UML state chart diagrams. They transformed state charts to extended FSMs (EFSMs) to derive test cases. In the resulting EFSMs, the hierarchical and concurrent structure of states are flattened and broadcast communications are eliminated. Then data flow is identified by transforming the EFSMs into flow graphs, to which conventional data flow analysis techniques are applied. These different results are difficult to reconcile and the relationship between coverage criteria, test-suite size, and fault finding capability clearly needs more study. In the experiment discussed in this paper we attempt to highlight some additional issue. Our work is different in some respects. First, we are not studying testing of traditional programs, we are interested in test-case generation, test case minimization by calculating node coverage for each test case and testing of specifications. A single test-case in most cases may satisfy more than one test obligation. For instance, a test case used to cover a certain state of interest may also cover other states during its execution. It provides a way to reduce the size of the final test-suite by choosing a subset of testcases that preserves the coverage obtained by the full test-suite.
CONCLUSSION AND FUTURE WORK
In this paper, first, we build a state chart model of our system under test. Next, we derived state transition graph from state chart diagram. Then, all required information are extracted from the graph. Then, the test cases are generated by applying Wang"s algorithm. Lastly, a set of test cases are minimized by calculating node coverage for each test case and it is determined that which test case are covered by other test cases. In this way, this paper introduces efficient test generation technique to optimize test coverage by minimizing time and cost.
