Abstract
The aim of this paper is the analysis of the property of the relaxed structurally boundedness of the unobservable subnet of the Petri net which brings a condition guaranteeing the finitude of all possible sequence lengths in the context of an on-line estimation in Partially Observable Petri Nets relevant to a sliding horizon or a receding horizon starting from the initial marking. Based on specific invariants defined over the real numbers, the approach focuses on an offline structural analysis, that is, the determination of the parts of the unobservable subnet where an online estimation for any criterion can be made. The decomposition-composition technique is based on a block triangular form obtained with any technique. The composition of the substructures leads to a propagation of the relaxed structurally boundedness property through the structure. The study of a large-scale manufacturing system shows that the direct treatment of the large system system can be avoided and that the triangular form brings a sequential treatment allowing a computation based on smaller systems independently of the resolution of the complete system.
1 Introduction
In this paper, we focus on the analysis of the modelling conditions such that any possible unobservable event sequence leading to an observation presents a finite length in the estimation problem. Finite lengths are necessary in fault diagnosis Lefebvre (2014) and diagnosability problem Basile et al. (2012) where all the sequences which end in an observation or a fault event must be finite. Indeed, if the estimation considers an observation relevant to a given finite time, the length of all the possible sequences starting from the initial marking and finishing in this event must also be finite, otherwise, the modelling of the estimation problem is not sufficient to characterize the relevant unobservable sequences. This situation which can affect the estimation and the fault detection, can be produced by the lack of sensors or inadequate positions of the sensors in the Petri net. The presence of a finite length can be interpreted not as a condition of the uniqueness of the trajectory Aguirre-Salas et al. (2002); Arichi et al. (2015), but as a necessary condition contributing to the estimation. Let us develop the motivations of this paper. The context is the sequence estimation problem (Problem 1) whose main points are presented below. The problem traited in this paper is described in the sequel (Problem 2).
-
1.
The event sequences (the events considered in the sequences are only the transition firings throughout the article) are represented with the counter form which is an efficient tool developed in the field of dioid algebra (max-plus algebra, min-plus algebra,...) for originally Timed Event Graphs. Based on Libeaut (1996), the study Declerck and Bonhomme (2014) shows that for estimation in Timed Petri nets, all the unobservable sequences (for any length and rank of the events) can be described with a unique polyhedron which can be easily deduced from the structure of the Petri net. This possibility is extended to Untimed Petri nets under a weak condition in Declerck (2021) which allows to avoid the combinatorial building of a reachability graph or a similar graph.
-
2.
As a unique polyhedron which describes all the sequences can be established, a classical technique of optimization which generates optimal solutions can be applied. A natural objective is to optimize an arbitrary criterion which can present the form of a linear weighting of the transition firing numbers. The aim is not to make the time-consuming computation of all the possible event sequences but only the pertinent sequences useful for the considered problem. The well-developed theory of optimization can be exploited and standard algorithms of linear programming and integer linear programming can be used. Possibly leading to the state space explosion, the burdensome enumeration of all the numerical solutions as all possible markings reachable from the initial marking is avoided.
-
3.
A practical advantage of an estimation based on transition firings is the possibility to define different criteria. The criterion can be a minimum or maximum number of firings in the fault detection presented in Chouchane et al. (2018). But criteria outside diagnosis can be defined as a criterion can be a balance-sheet, that is a global price depending on the costs and gains provided by the tasks Declerck (2021). Another criteria is the least cost where each transition has a nonnegative cost in labeled Petri nets Yue et al. (2019); Li et al. (2022). Potentially, each criteria can lead to a practical application of the estimation as the sequences can model faulty behaviors (simple faulty event, multiple faults, intermittent faults, faulty behaviors,.... ) but also the repair of a system after the occurrence of a fault and any normal behaviors Gougam et al. (2017).
Therefore, an interesting aim is the on-line estimation of optimal current subsequences in Partially Observable Petri Net, that is, the determination of sequences of unobservable transition firings which are coherent with the observed label sequence produced by the observable transitions and are (preferably) optimal with respect to a criterion which can present the form of a linear weighting of the transition firing numbers (Problem 1).
An important point is that the estimation of optimal sequences is based on the existence of a time horizon (corresponding to a sequence length) which is sufficient to describe all the sequences and to write a finite set of relations describing the model with the counter form (System (8) in Declerck (2021)). In the timed case, the successive dates of observations are known and the time horizon is simply the difference between the dates relevant to the beginning and the end of the desired horizon Declerck and Bonhomme (2014). This technique is used in Declerck and Bonhomme (2023) which presents an application in P-time Petri nets which facilitates the schedulability of estimated sequences, that is, the determination of possible firing dates of a given sequence without token deaths. But, this way cannot be used in the untimed case where the dates of observations are unknown by assumption. A possible solution is to take an arbitrary large horizon but this technique does not guarantee that this value is sufficient for each considered estimation problem and increases the size of the treated system and consequently the execution time of the corresponding computations. So, the aim is to determine appropriate horizons which allow to express every subsequence. The paper Declerck (2021) presents Integer Linear Programming Problems and problems relaxed over the real numbers which provide temporary guaranteed horizons adapted to the current evolution.
However, to exploit the previous results, the computation of these problems must converge to a finite horizon. In that case, any possible sequence presents a finite length in the estimation problem: a finite set of relations describing the model with the counter form can be written and any criterion can be considered. As Declerck (2021) shows that the horizon is always finite when the unobservable induced subnet is Relaxed Structurally Bounded (RSB), the objective of this paper is the analysis of the Relaxed Structurally Boundedness property (RSB property: the same acronym is taken) based on an offline analysis of the specific invariants defined over the real numbers (Problem 2). Particularly, different questions emerge in the following cases.
- When the incidence matrix describing the unobservable induced subnet is non-RSB, the resolution treating the complete system is not guaranteed. However, the analysis of some examples shows that an estimation for any criterion can be made on some subsystems and an objective is the determination of the RSB parts. A corollary question is the improvement of the non-RSB parts.
- Petri nets can describe large scale systems as production systems Becha et al. (2013) and transportation systems (railway systems Defossez et al. (2008)). This case cannot be treated by the estimation approaches which suffer of the state explosion particularly when the estimation tries to determine all the numerical solutions of the state space. An efficient way to cope with the complexity in modelling large-scale systems consists in decomposing it into subsystems and refining the results Schmidt (2003); Aybar and Iftar (2009); Baccelli and Gaujal (1996); Savi and Xie (1992). Clearly, a resolution considering smaller systems improves the numerical efficiency of the chosen technique.
In this context, this paper proposes a structural approach presenting two phases.
-
1.
The decomposition of the structure into substructures leading to a triangular form where the blocks can be non-square (Tables 2 and 4 present this triangular form). Obtained with any technique, this practical structure can be obtained if the Petri net is an association of Petri subnets. This triangular form can also be deduced with the Dulmage-Mendelsohn decomposition Dulmage and Mendelsohn (1959, 1958) (DM decomposition) presented in the appendix which completes the paper.
-
2.
The composition of the algebraic substructures and the relevant analysis of the RSB property. The goal is to integrate the substructures together to determine the property of the whole system.
Related problems.
Let us describe some other problems related to Problem 2. As the definition of relaxed structurally boundedness concerns the sequences, this definition is fundamentally different from the classical definitions characterizing the marking as the (marking) boundedness, the structurally boundedness (the number of tokens in each place does not exceed a finite natural number for any marking reachable from any finite initial marking) and deadlock structurally boundedness (it leads to a deadlock state from any finite initial marking) even if the relevant algebraic expressions are close Murata (Apr. 1989). Moreover, the definition of RSB which is over the real numbers and not the integers considers the unobservable induced subnet and not the complete Petri net. More details can be found in Declerck (2021) and Example 1 below. A problem presenting a similarity with Problems 1 and 2 is the study Lefebvre (2014) which proposes the maximal size of all the elementary firing sequences by determining offline the maximal length of the paths in the induced unobservable graph which is obtained according to the reachability graph or the coverability graph. Howevever, the construction of these graphs suffers from the well-known state explosion problem related to the exponential space and time complexity which limits the application of this technique to modest-size Petri nets Ahmad et al. (2009). The approach proposed in this paper is complementary to this approach as the online computation of the guaranteed horizons provides different temporary bounds which are adapted to the current evolution. In addition, the assumption of the existence of a finite horizon is not taken as the objective of problem 2 is precisely the analysis of this existence in the context of the paper.
Another similar problem is diagnosability which is the property of a partially observed dynamical system that asserts whether it is always possible to diagnose with certainty the occurrence of an anticipated failure. This problem focuses on the detection of a given event Basile et al. (2012) or a given event pattern Gougam et al. (2017). Problem 1 presents another aim as: the objective is the determination of a current majorant which depends on the observations and must be computed online. Another problem is observability which presents different definitions in the literature as sequence estimation or marking estimation can be considered. In max-plus algebra, the structural observability expresses only a dependence between the unobservable events and the observations Baccelli et al. (1992) while different papers consider a constraining definition based on the uniqueness of the marking for any known initial marking Ru and Hadjicostis (2010) or the uniqueness of the unobservable sequence and marking Aguirre-Salas et al. (2002); Arichi et al. (2015). Contrary to these previous studies, the general aim of this paper is the determination of a possible optimal sequence in the solution space and not the detection of a unique sequence of transition firings. Observable places are not considered in this paper and the initial marking is assumed to be known.
The paper is organized as follows. The preliminary notations are given in Section 2 and Table 1 which summarizes the main notations. The beginning of Section 3 presents the principle of the computation of a guaranteed horizon, that is, a horizon which is sufficiently large such that any possible sequence is modelled in the estimation problem Declerck (2021). This section is completed by different results. A comparison of the definition of RSB, structural boundedness and deadlock structurally bounded Petri nets is made in Section 3.3. Then, the composition of substructures and the propagation of the RSB property through the triangular form is analyzed in Section 4. The elementary Example 1 illustrates the main concepts of the problem, while the other examples focus on the structural aspect. Well-known in the literature, the case study of Example 3 is a classical manufacturing system which is a large scale system. The reading of the paper is largely independent of the appendix which presents the DM decomposition yielding a triangular form as in Tables 2 and 4. As the paper focuses on the sequence length, the state estimation studied in many references is out the scope of this paper Dotoli et al. (2009); Komenda et al. (2018); Boussif et al. (2019); Chouchane and Declerck (2022); Bonhomme (2021). Examples of criteria can be found in Chouchane et al. (2018); Declerck (2021); Li et al. (2022); Yue et al. (2019).
2 Preliminary notations
2.1 General notations
The notation |Z| is the cardinality of set Z and \(A^{T}\)corresponds to the transpose of matrix A. The entry in the \(i-th\) row and \(j-th\) column of a matrix A is denoted A(i, j). The notation A(i, .) represents the row i of \(A\ \) while A(., j) expresses the column j. Symbol \( \backslash \) is the set difference, that is, \(U\setminus V\) is the set formed by the elements of set U that are not in set V. The 1-norm of vector u is equal to the sum of the absolute values of the vector elements and is denoted as \(\parallel u\parallel _{1}.\) The notation \(\lfloor x\rfloor \) represents the greatest integer less than or equal to x. A Place/Transition (P/TR) net is the structure \(N=(P,TR,W^{+},W^{-})\), where P is a set of |P| places and TR is a set of |TR| transitions. The matrices \(W^{+}\) and \(W^{-}\) are respectively the \(|P|\times |TR|\) post and pre-incidence matrices over \(\mathbb {N}\), where each row \(l\in \left\{ 1,...,|P|\right\} \) specifies the weight of the incoming and outgoing arcs of the place \(p_{l}\in P,\) respectively. The incidence matrix is \( W=W^{+}-W^{-}\). The preset and postset of the node \(v\in P\bigcup TR\) are denoted by \(^{\bullet }v\) and \(v^{\bullet }\), respectively. A source transition \(tr_{i}\in TR\) satisfies \(^{\bullet }tr_{i}=\emptyset .\) The notation \(\Omega ^{*}\) represents the set of firing sequences, denoted as \(\sigma \), consisting of transitions of the set \(\Omega \subset TR\). The vector \(\overline{\sigma }\) of dimension |TR| expresses the firing vector or count vector of the sequence \(\sigma \in TR^{*}\), where the i -th component \(\overline{\sigma _{i}}\) is the firing number of the transition \(tr_{i}\in TR\) which is fired \(\overline{\sigma _{i}}\) times in the sequence \(\sigma .\) A source transition \(tr_{i}\in TR\) satisfies \( ^{\bullet }tr_{i}=\emptyset \) and its firing count can be infinite.
The marking of the set of places P is a vector \(M\in \mathbb {N}^{|P|}\) that assigns to each place \(p_{i}\in P\) a non-negative integer number of tokens \(M_{i}\), represented by black dots. The i-th component \(M_{i}\) is also written as \(M(p_{i}).\) The marking M reached from the initial marking \(M^{0}\) by firing the sequence \(\sigma \) can be calculated by the fundamental relation: \(M=M^{0}\ +W.\overline{\sigma }\). The transition \(tr_{i}\) is enabled at M if \(M\ge W^{-}(.,tr_{i})\) and may be fired yielding the marking \(M^{\prime }=M+W(.,tr_{i})\) for a unique firing. We write \(M[\sigma \succ \) to denote that the sequence of transitions \(\sigma \) is enabled at M, and we write \( M[\sigma \succ M^{\prime }\) to denote that the firing of \(\sigma \) yields \( M^{\prime }.\) To easily describe the Petri net with the incidence matrices \( W^{+}\) and \(W^{-},\) we assume that there is at most a unique arc between a place \(p_{l}\) and each input (resp. output) transition \(x_{i}\) of this place: each weight \(\left( W\right) _{l,i}^{+}\ne 0\) ( respectively, \( \left( W\right) _{l,i}^{-}\ne 0)\) corresponds to a unique arc in the Petri net. Otherwise a simple modification of the Petri net yields the desired form.
2.2 Notations for estimation
A labeling function L : \( TR\rightarrow AL\cup \{\varepsilon \}\) assigns to each transition \(tr_{i}\in TR\) either a symbol from a given alphabet AL or the empty string \( \varepsilon \). In a partially observed Petri net, we consider that the set of transitions TR can be partitioned as \(TR=TR_{obs}\bigcup TR_{un}\), where the set \(TR_{obs}\) (respectively, \(TR_{un}\)) is the set of observable transitions associated with a label of AL (respectively, the empty string \( \varepsilon )\). In this paper, we assume that the same label of AL cannot be associated with more than one transition of \(TR_{obs}\) (named Assumption \( AS-4\) below).
The \(TR_{un}-\)induced subnet of the Petri net N is defined as the new net \( N_{un}=(P,TR_{un},{W_{un}^{+}},{W_{un}^{-}})\), where \({W_{un}^{+}}\) and \({ W_{un}^{-}}\) (respectively, \({W_{obs}^{+}}\) and \({W_{obs}^{-}}\)) are the restrictions of \(W^{+}\) and \(W^{-}\) to \(P\times TR_{un}\) (respectively, \( P\times TR_{obs}\)). This \(TR_{un}-\)induced subnet is also named unobservable induced subnet. Therefore, \(W_{un}={W_{un}^{+}}-{W_{un}^{-}}\) (respectively, \(W_{obs}={W_{obs}^{+}}-{W_{obs}^{-})}\). A reorganization of the columns with regards to \(TR_{obs}\) and \(TR_{un}\) yields \(W=\left( \begin{array}{ll} W_{obs}&W_{un} \end{array} \right) .\)
Notation \(x_{i}\) expresses a unobservable transition, belonging to \(TR_{un}\) while an observable transition belonging to \(TR_{obs}\) is denoted as \(y_{i}\) . The notation of the count vectors is taken for \(\overline{x}\) of dimension \(|TR_{un}|\) and \(\overline{y}\) of dimension \(|TR_{obs}|.\) The reorganization of the components of \(\overline{\sigma }\) yields \(\overline{\sigma }=\left( \begin{array}{ll} \overline{x}^{T}&\overline{y}^{T} \end{array} \right) ^{T}.\)
The estimation of the current unobservable sequence is based on the treatment of the data produced by the observed transitions successively in an on-line procedure. For step \(<k>,\) the subsequence \(x^{<k>}\) relevant to the unobservable transitions of\(\ TR_{un}\) separates two successive observations and precisely leads to the subsequence \(y^{<k>}\) corresponding to the observable transition of\(\ TR_{obs}.\) From \(M^{<k>},\) the subsequence \(\sigma ^{<k>}=x^{<k>}y^{<k>}\) allows the establishment of marking \( M^{<k+1>}:\) formally, \(M^{<k>}[\sigma ^{<k>}\succ M^{<k+1>}\) for \(k\ge 1\) where \(M^{<1>}\) represents the initial marking \(M^{0}.\) Moreover, \( \overline{\sigma }^{<k>}=\left( \begin{array}{ll} (\overline{x}^{<k>})^{T}&(\overline{y}^{<k>})^{T} \end{array} \right) ^{T}\) where notations \(\overline{x}^{<k>}\) and \(\overline{y}^{<k>}\) represent the count vector of \(x^{<k>}\) and \(y^{<k>}\) respectively. We assume that \(\overline{x}^{<k>}=0\) and \(\overline{y}^{<k>}=0\) for \(k\le 0.\) So, the estimation limited to one step must consider \(M^{<1>}[x^{<1>}y^{<1>} \succ M^{<2>}\) for the first step \(<1>,\) then \(M^{<2>}[x^{<2>}y^{<2>}\succ M^{<3>}\) for step \(<2>\) and so on. The generalization to a horizon composed of several steps is immediate. Note that these notations are not cumulative as we can have \(\overline{x}^{<3>}=0\) but \(\overline{x} ^{<1>}\ne 0\) and \(\overline{x}^{<2>}\ne 0:\) the condition \(\overline{x} ^{<1>}\le \overline{x}^{<2>}\le \overline{x}^{<3>}\) does not hold. The notations \(\overline{x}^{<0>\rightarrow <k>}=\underset{k^{\prime }=0,\ldots ,k}{\sum }\) \(\overline{x}^{<k^{\prime }>}\) and \(\overline{y}^{<0>\rightarrow <k>}=\underset{k^{\prime }=0,\ldots ,k}{\sum }\) \(\overline{y}^{<k^{\prime }>} \) allow to write shorter expressions.
3 Guaranteed horizons
3.1 Assumptions
In this paper, we assume the following assumptions for the different Petri nets under investigation:
-
Assumption \(\mathcal {AS-}1:\) the incidence matrices and the initial marking \(M^{0}\) are known.
-
Assumption \(\mathcal {AS-}2:\) the Petri net is live.
-
Assumption \(\mathcal {AS-}3:\) the firing number of each observable transition is finite.
-
Assumption \(AS-4:\) the observations are distinguishable, that is, the same label cannot be associated with more than one observable transition (the case of indistinguishable events is treated in Chouchane et al. (2018)).
-
Assumption \(AS-5.\) A unique firing of a transition on all transitions occurs at each time (Assumption \(AS-5\) is a facility to express the sequences under a form without concurrency).
The assumptions about cyclicity of the structure and boundedness of the marking are not considered in this article contrary to many papers in this topic.
3.2 Computation of a guaranteed horizon
Given a sequence of the observed firing events of the transitions of \( TR_{obs}\) generated by the activity of the Petri net, we desire to find a guaranteed horizon such that any possible sequence \(x^{<k>}y^{<k>}\) of the Petri net can be expressed for step \(<k>.\) The principle is to take a pessimistic point of view of the behavior of the Petri net. Precisely, we consider the worst case in terms of number of firings for the unobservable transitions, which corresponds to the greatest possible length of any unobservable sequence \(x^{<k>}\) when Assumption \(AS-5\) is taken. Therefore, for step \(<k>\) and its relevant sequence \(x^{<k>}y^{<k>}\), a guaranteed horizon denoted as \(h_{g}^{<k>}\) is given by \(h_{g}^{<k>}=\max \parallel \overline{x}^{<k>}\parallel _{1}+1=max(c.\overline{x}^{<k>})+1\) with \( \overline{x}^{<k>}\) over the integers and \(c_{1\text {x}|TR_{un}|}\) unitary where 1 corresponds to the unique observation \(y^{<k>}.\) A consequence is the possibility to treat an estimation problem for any criterion Declerck (2021). This worst case treats all the transitions even if a given criterion exploits a subset of transitions.
For step \(<k=1>,\) a majorant relevant to the unobservable transitions is given by \(\max \parallel \overline{x}^{<k>}\parallel _{1}=max(c.\overline{x} ^{<k>})\) with \(c_{1\text {x}|TR_{un}|}\) unitary for the system Declerck (2021)
with \(\overline{x}^{<1>}\ge 0\) and \(b^{<1>}=M^{<1>}-W_{obs}^{-}.\overline{y} ^{<1>}.\)
For the following steps \(k\ge 2,\ \)we can consider the maximization \(max(c. \overline{x}^{<k>})\) for a sliding horizon reduced to \(<k>\) or \(max(c. \overline{x}^{<0>\rightarrow <k>})\) for a receding horizon going from \(<0>\) to \(<k>\). A possible system Declerck (2021) is
with \(\overline{x}^{<0>\rightarrow <k>}\ge 0\) and \(b^{<k>}=M^{<1>}+W_{obs} \cdot \overline{y}^{<0>\rightarrow<k-1>}-W_{obs}^{-}.\overline{y}^{<k>}.\) Naturally, more complex systems including the firing conditions can be considered but the advantage of these forms is the fixed dimensions of the matrices which only depend on the numbers of places and transitions of the Petri net (and not the number of observations) even if greater values are obtained. With that in mind, a simplified form of (2) which facilitates the analysis is studied in the sequel.
3.3 Upper boundedness of the problem
If the computed value \(max(c.\overline{x} ^{<k>})\) is finite, the problem is upper-bounded, and the horizon length presents a majorant. So, we now analyze the conditions such that the above problems of optimization using (1) or (2) converge to a finite solution. With that aim, we consider the second line of (2) which includes inequality (1) as for \(k=1,\) \(\overline{x}^{<0>\rightarrow <0>}=0\) and \(b^{<1>}=M^{<1>}+W_{obs}\cdot \overline{y}^{<0>\rightarrow<0>}-W_{obs}^{-}.\overline{y}^{<1>}=M^{<1>}-W_{obs}^{-}.\overline{y}^{<1>}\) since \(\overline{y}^{<0>\rightarrow<0>}=\overline{y}^{<0>}=0.\) We consider the Integer Linear Programming problem (ILP problem)
with \(\overline{x}^{<0>\rightarrow <k>}\ge 0\) over \(\mathbb {Z}\) and \( b^{<k>}=M^{<1>}+W_{obs}\cdot \overline{y}^{<0>\rightarrow<k-1>}-W_{obs}^{-}. \overline{y}^{<k>}\) for the succession of steps going from \(<0>\) to \(<k>.\)
Moreover, the ILP problem (3) is relaxed over \( \mathbb {R}\) and become a Linear Programming Problem (LP problem). The relevant maximization can converge to a greater value than the initial ILP problem as the space is not limited to the natural numbers. Formally, \( \underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})\le \underset{ \mathbb {R} }{\max }(c.\overline{x}^{<0>\rightarrow <k>})\) for \(-W_{un}.\overline{x} ^{<0>\rightarrow<k>}\le b^{<k>}.\) In addition, if the relaxed problem provides a finite solution, the same conclusion holds for ILP problem which is more restrictive.
A standard theorem of linear programming is that the dual problem of the primal problem max(c.x) subject to \(A.x\le b\) and \(x\ge 0\) is min(y.b) subject to \(y.A\ge c\) and \(y\ge 0.\) Consequently, we can deduce the LP problem dual to the ILP problem (3) relaxed over \( \mathbb {R}\) which is
Let \(\overline{x}_{opt}^{<0>\rightarrow <k>}\) and \(z_{opt}\), be the optimal solutions relevant to the ILP problem (3) relaxed over \(\mathbb {R}\) and LP problem (4).
Theorem 1
Declerck (2021) The maximization of problem (3) relaxed over \(\mathbb {R }\) is upper-bounded if there is \(z\ge 0\) over \(\mathbb {R}\) satisfying (4). Moreover, \(c.\overline{x}^{<0>\rightarrow<k>}\le z.b^{<k>}\) and \(c.\overline{x}_{opt}^{<0>\rightarrow<k>}=z_{opt}.b^{<k>}\). \(\blacksquare \)
Therefore, the ILP problem (3) relaxed over \( \mathbb {R}\) can be replaced by the problem (4 ).
3.4 New results on the computation of a guaranteed horizon
Theorem 1 is completed by the following theorems 2 and 3 which focus on the computation of a guaranteed horizon \(h_{g}^{<0>\rightarrow <k>}\) which represents a guaranteed receding horizon relevant to the succession of steps going from \(<0>\) to \(<k>.\)
Theorem 2
Assume that the space defined by \(z.W_{un}\le -c\) with c unitary and \(z\ge 0\) over \(\mathbb {R}\) is non-empty. For the succession of steps going from \(<0>\) to \(<k>,\) a guaranteed horizon is \( h_{g}^{<0>\rightarrow<k>}=\lfloor c.\overline{x}_{opt}^{<0>\rightarrow<k>}\rfloor +k=\lfloor z_{opt}.b^{<k>}\rfloor +k\) where \(\overline{x} _{opt}^{<0>\rightarrow <k>}\) and \(z_{opt}\) are the optimal solutions relevant to the ILP problem (3) relaxed over \( \mathbb {R}\) and LP problem (4).
Proof. Theorem 1 which considers the real numbers says that, if the space defined by \(z.W_{un}\le -c\) is non-empty (formally, \(\exists z\ge 0\) over \(\mathbb {R}\) satisfying \( z.W_{un}\le -c),\) then \(c.\overline{x}^{<0>\rightarrow <k>}\) for \(-W_{un}. \overline{x}^{<0>\rightarrow<k>}\le b^{<k>}\) is upper-bounded by \( z.b^{<k>}.\) The maximization of the ILP problem converges to a lower value than the relevant relaxed problem as the space of \(-W_{un}.\overline{x} ^{<0>\rightarrow<k>}\le b^{<k>}\) is restricted to the natural numbers. Formally, \(\underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})\le \underset{ \mathbb {R} }{\max }(c.\overline{x}^{<0>\rightarrow <k>})\) for \(-W_{un}.\overline{x} ^{<0>\rightarrow<k>}\le b^{<k>}.\) In addition, as c is an integer row-vector and the maximization over \(\mathbb {N}\) provides an integer solution, \(\underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow <k>})\) is integer and \(\lfloor \underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})\rfloor =\underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})\le \lfloor \underset{ \mathbb {R} }{\max }(c.\overline{x}^{<0>\rightarrow <k>})\rfloor \). Moreover, \(\underset{ \mathbb {R} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})=c.\overline{x} _{opt}^{<0>\rightarrow <k>}\) by definition and Theorem 1 gives \(c.\overline{x} _{opt}^{<0>\rightarrow<k>}=z_{opt}.b^{<k>}.\) Finally, variable k, that is, the k observations \(y^{<k>}\) for the succession of steps going from \( <0>\) to \(<k>\) under Assumption \(AS-5,\) must be added to obtain \( h_{g}^{<0>\rightarrow <k>}.\) \(\blacksquare \)
Degraded horizons \(h_{g}^{<0>\rightarrow <k>}\) can be proposed if a non-optimal z is taken or if the vector \(b^{<k>}\) is modified.
Theorem 3
Assume that the space defined by \( z.W_{un}\le -c\) with c unitary and \(z\ge 0\) over \(\mathbb {R}\) is non-empty. Some possible guaranteed horizons \(h_{g}^{<0>\rightarrow<k>}=\lfloor z.b^{<k>}\rfloor +k\) with z over \(\mathbb {R}\) can be deduced if z is taken as follows:
-
1.
z is an arbitrary solution of \(z.W_{un}\le -c\)
-
2.
z is deduced from the optimization \(\min (z.d)\) for \(z.W_{un}\le -c\) where d is an arbitrary positive row-vector.
-
3.
A possible guaranteed horizon is \(\lfloor z.b^{^{\prime }<k>}\rfloor +k \) with \(b^{\prime<k>}=M^{<1>}+W_{obs}\cdot \overline{y}^{<0>\rightarrow <k-1>}\) where z is provided by Point 1 or 2.
Proof. Point 1) The second point of Theorem 1 says that any solution z to (4) satisfies \(c.\overline{x}^{<0>\rightarrow<k>}\le z.b^{<k>}.\) So, the relevent \(z.b^{<k>}\) is a possible majorant of \( c.\overline{x}_{opt}^{<0>\rightarrow <k>}\) and Theorem 2 can be applied.
Point 2) As point 1) says that any solution z satisfying \(z.W_{un}\le -c\) can be used, we can choose a minimization of z.d with an arbitray \(d>0\) as it always converges to an arbitrary finite solution as \(z\ge 0\) and \(d>0\).
Point 3) \(b^{<k>}=M^{<1>}+W_{obs}\cdot \overline{y}^{<0>\rightarrow<k-1>}-W_{obs}^{-}.\overline{y}^{<k>}\le \) \(M^{<1>}+W_{obs}\cdot \overline{y }^{<0>\rightarrow <k-1>}\ \)as \(W_{obs}^{-}.\overline{y}^{<k>}\ge 0\) (\( W_{obs}^{-}\ge 0\) and \(\overline{y}^{<k>}\ge 0).\) So, \(b^{<k>}\le b^{\prime <k>}\) and \(z.b^{<k>}\le z.b^{\prime <k>}\) as \(z\ge 0.\) So, \( \lfloor z.b^{<k>}\rfloor \le \lfloor z.b^{^{\prime }<k>}\rfloor \). Each vector z given by Point 1) or 2) provides a possible guaranteed horizon \( \lfloor z.b^{^{\prime }<k>}\rfloor +k\). \(\blacksquare \)
Let us consider an execution of the computation made with (4). In Point 1), a horizon can be computed without waiting the end of this computation by taking an intermediate solution z while an arbitrary z for any step \(<k>\) can be computed with a simple criterion in Point 2). As \(b^{\prime <k>}\) does not contain \( \overline{y}^{<k>}\) in Point 3), a degraded horizon can be computed at the end of step \(<k-1>\) without waiting the occurrence of observation \(y^{<k>}\) and the knowledge of \(\overline{y}^{<k>}.\) Knowing \(\overline{y}^{<k>},\) a better result is obtained with \(z.b^{<k>}.\)
3.5 RSB property
The following definition introduced in Declerck (2021) modifies the condition in Theorem 1 by replacing \(z.W_{un}\le -c\) by \(z.W_{un}<0\).
Definition 1
The unobservable induced subnet is Relaxed Structurally Bounded (RSB) when there exists a non-negative vector z over \(\mathbb {R}\) such that \( z.W_{un}<0.\) Formally,
Example 1
Let us consider the Petri net of Fig. 1 which is an elementary event graph containing a self-loop, a source transition \(y_{1}\) and a sinks transition \(y_{2}\). We have: \(P=\{p_{1},p_{2},p_{3}\};\) \( TR=TR_{obs}\bigcup TR_{un}\) with \(TR_{obs}=\{y_{1},y_{2}\}\) and \( TR_{un}=\{x_{1}\}.\) So, \(W_{obs}=\left( \begin{array}{rr} 1 &{} 0 \\ 0 &{} 0 \\ 0 &{} -1 \end{array} \right) \) and \(W_{un}=\left( \begin{array}{r} -1 \\ 0 \\ 1 \end{array} \right) .\) The unobservable induced subnet is RSB as \(\exists z=\left( \begin{array}{ccc} 2&1&1 \end{array} \right) \ge 0\) such that \(z.W_{un}=-1<0\) with z over \(\mathbb {R}\). Note that the Petri net is neither structurally bounded (\(\exists z\in \) \(\mathbb { N}^{3}\) with \(z>0\) and \(z.W\le 0\)), nor deadlock structurally bounded (\( \exists z\in \) \(\mathbb {N}^{3}\) with \(z>0\) and \(z.W<0\)) as the first column of W is non-negative. Remember that these definitions concern the marking and not the sequences. Now if the first element of the first column becomes \( -1,\) the unobservable induced subnet remains RSB but the Petri net becomes structurally bounded and deadlock structurally bounded (for \(z=\left( \begin{array}{rrr} 2&1&1 \end{array} \right) ,\) \(z.W=\left( \begin{array}{rrr} -2&-1&-1 \end{array} \right) <0).\) Also note that, if we assume \(W_{obs}=0\) in a general Petri net and add the conditions z over \(\mathbb {N}\) and \(z>0\) to a RSB unobservable induced subnet, the Petri net becomes structurally bounded and deadlock structurally bounded. \(\blacksquare \)
The following theorem makes the connection between the primal problem and the RSB property.
Theorem 4
Declerck (2021) The maximization \( max(c.\overline{x}^{<0>\rightarrow <k>})\) with \(c>0\) unitary over \(\mathbb {R}\) for system (3) is upper-bounded if the unobservable induced subnet is RSB.
Example 1 continued.
a) Let us analyze the upper-boundedness of the count number \(\overline{x} _{1} \) of the unobservable transition \(x_{1}.\) Note that the analysis of the Petri net in Fig. 1 shows that, for \(p_{1},\) if \( \overline{y}_{1}\) is upper-bounded, then \(\overline{x}_{1}\) is also upper-bounded. The resolution of (1) for the observation sequence \(y_{1}y_{2}\) gives \(\overline{x}_{1}^{<1>}=0\) for step \( <k=1>\) (no firing of \(x_{1}\) before observation \(y_{1}\)) and the resolution of (2) yields \(\overline{x}_{1}^{<1>}+ \overline{x}_{1}^{<2>}=1\) for step \(<k=2>\) (firing of \(x_{1}\) at step \(<1>\) or \(<2>\) before observation \(y_{2}\)).
b) Let us consider the dual problem (4). Theorem 4 implies that \(\overline{x}_{1}\) is upper-bounded as the unobservable induced subnet is RSB. These results which are consistent are obtained despite that the self-loop is not represented in the incidence matrices \(W_{obs}\) and \(W_{un}.\) In fact, the same computation can be made without place \(p_{2}\) and its initial token. \(\blacksquare \)
4 Composition of substructures and propagation of the RSB property
4.1 Introduction
Considering the primal problem (3) relaxed over \(\mathbb {R}\) where the objective is \(max(c.\overline{x}^{<k>})\) with c unitary, a necessary condition to define a guaranteed horizon is that each component \(\overline{x}_{i}\) must be upper-bounded. Indeed, if \( max(c_{i}.\overline{x}_{i})=+\infty \) with \(c_{i}\ne 0,\) the variable is not upper-bounded and a relevant guaranteed horizon cannot be computed. However, a possibility is to consider not the complete set of unobservable transitions but a subset. Therefore, the objective of Section 4 is to determine the set of variables which are upper-bounded. An interesting phenomenon which may be studied is a possible propagation of the upper-boundedness: if the event numbers of all input transitions of a place are upper-bounded, the event numbers of all the output transitions of this place are also upper-bounded.
In the rest of the paper, the objective is not to analyze the primal problem (3) relaxed over \(\mathbb {R}\) but the dual problem (4) and particularly the propagation of the RSB property through the composition of substructures in \(W_{un}.\)
4.2 Preliminary results
Let us analyze the columns of \(W_{un}.\)
Theorem 5
A sufficient condition such that \(W_{un}\) is non-RSB is the existence of a non-RSB column at least.
Proof. Consider a matrix \(A=W_{un}\ \)containing a non-RSB column denoted \(A_{2}.\) If \(A_{2}\) is non-RSB then \(\not \exists z_{2}\ge 0\) such that \(z_{2}.A_{2}<0.\) The consideration of the other columns of A which increases the constraints cannot improve this impossibility. \(\blacksquare \)
This result does not depend on the number of the non-RSB columns and their positions which can be separate. A practical way to reduce this difficulty is to add sensors such that the relevant unobservable transitions become observable.
Theorem 6
Let \(A=W_{un}=\left( \begin{array}{c} A_{1} \\ B \end{array} \right) .\) If the substructure \(A_{1}\) is RSB, then A is also RSB.
Proof If the substructure \(A_{1}\) is RSB, there is \(z_{1}\ge 0\) such that \(z_{1}.A_{1}<0.\) Let us build a new vector \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) \ge 0\) such that \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) .\left( \begin{array}{c} A_{1} \\ B \end{array} \right) \) is negative. If \(z_{2}=0,\) the product is equal to \(z_{1}.A_{1}\) and the vector \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) \ge 0\) is obtained. \(\blacksquare \)
In other words, a structure remains RSB if some rows are added to it.
4.3 Triangular structure
Let us consider more complex structures. We now assume that a reorganization of the rows and the columns of \(W_{un}\) has established a specific triangular structure where the blocks can be non-square and the top right corner contains null elements only. For each block, a rank can be numbered from the upper left corner to the lower right corner of the table.
We below consider the RSB characteristic of each block of the structure, that is, the blocks of the main diagonal and also the substructures making the connection between the blocks.
Before considering the general case, the block-triangular form is composed of two blocks below. Let
where the substructures \(A_{1},\) B and \(A_{2}\) are rectangular in general. Matrix B makes the connection with the two substructures \(A_{1}\) and \( A_{2} \).
Considering the primal problem, the resolution in the following example illustrates the propagation of the upper-boundedness of \(\overline{x} ^{<0>\rightarrow <k>}\) inside the structure. The propagation of the upper-boundedness on the count variables is represented by the propagation of the RSB characteristic in the blocks. This propagation follows the increasing order of the blocks.
Example 2
Let \(A_{1}=\left( \begin{array}{cc} -1 &{} 1 \\ -1 &{} 3 \\ 1 &{} -3 \end{array} \right) ,\) \(A_{2}=\left( \begin{array}{cc} -1 &{} 1 \\ 1 &{} -4 \end{array} \right) \) and \(B=\left( \begin{array}{cc} 1 &{} 2 \\ -1 &{} 2 \end{array} \right) \) in (5). Let us consider \( -W_{un}.\overline{x}^{<0>\rightarrow<k>}\le b^{<k>}\) in the primal problem (3). To facilitate the writing of the expressions, we take \(\overline{x}=\overline{x}^{<0>\rightarrow <k>}\) and \( b=b^{<k>}=\left( \begin{array}{ccccc} b_{1}&b_{2}&b_{3}&b_{4}&b_{5} \end{array} \right) ^{T}.\) For \(-A_{1}.\left( \begin{array}{rr} \overline{x}_{1}&\overline{x}_{2} \end{array} \right) ^{T}\le \left( \begin{array}{ccc} b_{1}&b_{2}&b_{3} \end{array} \right) ^{T},\) \(\overline{x}_{1}\) and \(\overline{x}_{2}\) are upper-bounded as some obtained relations are \(\overline{x}_{2}\le (b_{1}+b_{3})/2\) and \( \overline{x}_{1}\le b_{2}+3.\overline{x}_{2}\) (also, \(\overline{x}_{1}\le b_{1}+\overline{x}_{2}).\)
For \(-A_{2}.\left( \begin{array}{rr} \overline{x}_{3}&\overline{x}_{4} \end{array} \right) ^{T}\le \left( \begin{array}{cc} b_{4}&b_{5} \end{array} \right) ^{T},\) \(\overline{x}_{3}\) and \(\overline{x}_{4}\) are upper-bounded as a resolution yields \(\overline{x}_{4}\le (b_{4}+b_{5})/3\) and \(\overline{ x}_{3}\le b_{4}+\overline{x}_{4}.\)
Now, for the complete rows \(-\left( \begin{array}{rr} B&A_{2} \end{array} \right) .\overline{x}\le b,\) a similar resolution gives \(\overline{x} _{4}\le (b_{4}+b_{5}+4.\overline{x}_{2})/3\) and \(\overline{x}_{3}\le b_{4}+ \overline{x}_{1}+2.\overline{x}_{2}+\overline{x}_{4}.\) So, \(\overline{x}_{3}\) and \(\overline{x}_{4}\) are upper-bounded as \(\overline{x}_{1}\) and \( \overline{x}_{2}\) are upper-bounded. \(\blacksquare \)
Considering the structure of A, a first analysis shows that \(A_{1}\) can be treated without considering the rows of B and \(A_{2}.\) The dependence between \(A_{1}\) and \(A_{2}\) in (5) is expressed by each non-null component of B. When \(B=0\), there is no dependence with \(A_{1}\) and a resolution can treat the subsystem \(A_{2}\) independently of the treatment of \(A_{1}\) which can be RSB or not. So, \( A_{2} \) is independent when \(B=0.\) Note that a transition depending of an unbounded part can be upper-bounded as a dependence with another part which is upper-bounded is possible.
Theorem 7
If the substructures \(A_{1}\) and \(A_{2}\) are RSB in (5), then A is RSB.
Proof Let us consider the left columns of A. If the substructure \( A_{1}\) is RSB, there is \(z_{1}\ge 0\) such that \(z_{1}.A_{1}<0.\) Let us build a new vector \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) \ge 0\) such that \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) .\left( \begin{array}{c} A_{1} \\ B \end{array} \right) =z_{1}.A_{1}+z_{2}.B\) is negative.
We now show that for a given \(z_{1}\) such that \(z_{1}.A_{1}<0\), we can make a product by \(\lambda >0\) such that \(\lambda .z_{1}.A_{1}\le -u\) where u is a unitary row-vector with the adapted dimension. Indeed, if \( -1<z_{1}.(A_{1})_{.,i}<0\) for the column \((A_{1})_{.,i},\) then we can always multiply \(z_{1}\) by \(\lambda _{i}=1/|z_{1}.(A_{1})_{.,i}|\) with \( z_{1}.(A_{1})_{.,i}\ne 0\) which implies \((\lambda _{i}.z_{1}).(A_{1})_{.,i}=-1.\) The result is kept if we take a greater \( \lambda _{i}:\) \((\lambda _{i}^{\prime }.z_{1}).(A_{1})_{.,i}<-1\) for \( \lambda _{i}^{\prime }>\lambda _{i}.\) The reasoning can be repeated for all columns satisfying \(-1<z_{1}.(A_{1})_{.,i}<0.\) So, we can make a multiplication of \(z_{1}\) by taking the maximum \(\lambda _{\max }\) on all the obtained values \(\lambda _{i}\) and the desired result is obtained. With \( z_{1}^{\prime }=\lambda _{\max }.z_{1}\), we have \(z_{1}^{\prime }.A_{1}+z_{2}.B\le -u+z_{2}.B\) and we can take the new objective \( -u+z_{2}.B<0\) which implies \(z_{1}^{\prime }.A_{1}+z_{2}.B<0.\) So, for the left columns of A, we must have \(z_{2}.B<u\) with \(z_{2}\ge 0\ \)and also, \( z_{2}.A_{2}<0\) for the right columns of A. So, to prove that A is RSB, the problem is equivalent to search \(z_{2}\ge 0\) such that
As \(A_{2}\) is RSB, there is an arbitrary vector \(z_{2}\ge 0\) such that \( z_{2}.A_{2}<0\) which is modified below. In general, the product \( z_{2}.B_{.,i}\) for each column \(B_{.,i}\) can be in the intervals \([1,+\infty [\) (case 1), [0, 1[ (case 2) or \(]-\infty ,[0\) (case 3).
Case 1. If the product satisfies \(z_{2}.B_{.,i}\ge 1,\) then the scalar \(z_{2}.B_{.,i}\) is positive, and we can always divide \( z_{2}\) by a positive scalar \(\mu _{i}>z_{2}.B_{.,i}\) such that \((z_{2}/\mu _{i}).B_{.,i}<1.\) Note that \(\mu _{i}>1.\) The same reasoning holds for all the columns \(B_{.,i}\) satisfying the case 1 and we can make a division of \( z_{2}\) by taking the maximum \(\mu _{\max }\) on all the obtained values \(\mu _{i}>1\). This implies that the relations \((z_{2}/\mu _{\max }).B_{.,i}<1\) are kept for the columns \(B_{.,i}\) in the case 1. Let us show that the columns presenting the cases 2 and 3 satisfy \((z_{2}/\mu _{\max }).B_{.,i}<1\) with this new vector \(z_{2}/\mu _{\max }\).
Case 2. If a column \(B_{.,i}\) satisfies \(0\le z_{2}.B_{.,i}<1,\) then \(0\le (z_{2}/\mu _{\max }).B_{.,i}<z_{2}.B_{.,i}<1\) as \(\mu _{\max }\ge 1.\)
Case 3. If a column \(B_{.,i}\) satisfies \(z_{2}.B_{.,i}<0,\) then the sign of \((z_{2}/\mu _{\max }).B_{.,i}\) remains negative as \(\mu _{\max }>0.\)
Finally, we have build a desired vector \(z_{2}^{\prime }=z_{2}/\mu _{\max }\ge 0\) such \(z_{2}^{\prime }.B<u\) and consequently, \(\left( \begin{array}{cc} z_{1}^{\prime }&z_{2}^{\prime } \end{array} \right) \ge 0\) with \(z_{1}^{\prime }=\lambda _{\max }.z_{1}\) such that \( \left( \begin{array}{cc} z_{1}^{\prime }&z_{2}^{\prime } \end{array} \right) .\left( \begin{array}{c} A_{1} \\ B \end{array} \right) \) is negative and \(\left( \begin{array}{c} A_{1} \\ B \end{array} \right) \) is RSB. In addition, \(z_{2}.A_{2}<0\) implies \(z_{2}^{\prime }.A_{2}<0\) as \(\mu _{\max }>0\) and the complete structure A is RSB. \( \blacksquare \)
Example 2 continued.
The two substructures \(A_{1}\) and \(A_{2}\) are RSB: \(z_{1}.A_{1}=\left( \begin{array}{cc} -1&-1 \end{array} \right) \) with \(z_{1}=\left( \begin{array}{ccc} 2&2&3 \end{array} \right) \) and \(z_{2}.A_{2}=\left( \begin{array}{cc} -1/2&-1/4 \end{array} \right) \) with \(z_{2}=\left( \begin{array}{cc} 3/4&1/4 \end{array} \right) .\) We have \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) .W_{un}=\left( \begin{array}{cccc} -1/2&+1&-1/2&-1/4 \end{array} \right) \nless 0\) but the new vector \(\left( \begin{array}{cc} z_{1}&z_{2}/\mu \end{array} \right) \) with \(\mu =4\) gives \(\left( \begin{array}{cc} z_{1}&z_{2}/\mu \end{array} \right) .W_{un}=\left( \begin{array}{cccc} -7/8&-1/2&-1/8&-1/16 \end{array} \right) <0\). So, \(W_{un}\) is RSB and Theorem 4 implies that all the components of the vector \(\overline{x}^{<0>\rightarrow <k>}\) are upper-bounded. This result is coherent with the direct analysis made above. \(\blacksquare \)
Theorem 7 is now generalized by considering a triangular form where the main diagonal contains more than two blocks.
Let RSB(A) be the matrix expressing the RSB property of each submatrix of A which is denoted by Y if RSB, N if non-RSB, − if indifferent and 0 if the submatrix is null.
Theorem 8
If the substructures of the main diagonal are RSB, then A is RSB.
Proof. Let us consider a structure A where the main diagonal contains three RSB blocks: \(RSB(A)=\left( \begin{array}{rrr} Y &{} 0 &{} 0 \\ - &{} Y &{} 0 \\ - &{} - &{} Y \end{array} \right) .\) Matrix A can be rewritten as \(A=\left( \begin{array}{rr} C_{11} &{} C_{12} \\ C_{21} &{} C_{22} \end{array} \right) \) with \(RSB(C_{11})=\left( \begin{array}{rr} Y &{} 0 \\ - &{} Y \end{array} \right) ,\) \(RSB(C_{12})=\left( \begin{array}{r} 0 \\ 0 \end{array} \right) ,\) \(RSB(C_{21})=\left( \begin{array}{rr} -&- \end{array} \right) \) and \(RSB(C_{22})=\left( Y\right) .\) The application of Theorem 7 implies that \(C_{11}\) is RSB. Moreover, the same theorem shows that A is RSB. The reiteration of this reasoning for greater dimensions gives the desired result. \(\blacksquare \)
5 Case study: a manufacturing system (example 3)
This example is a classical automated manufacturing system which has been presented in Section VII.A of Mahulea et al. (2012) and is recognized to be significant in the literature since slight variations of it have already been considered by different authors. The plant produces two different types of products from two types of raw materials. It consists of five machines (MA1 to MA5), four robots (R01 to R04), a finite capacity buffer (BU ), two inputs of raw parts and two automated guided vehicle systems ( AGV1 and AGV2). Presented in Mahulea et al. (2012), the description of the plant and its layout is out the scope of the paper.
The Petri net Fig. 2 has 38 places and 26 transitions and corresponds to the Petri net Fig. 3 page 979 in Section VII.A of Mahulea et al. (2012) with the same initial marking. Remember that observable transitions are \(y_{1},\ldots ,y_{12}\) while \(x_{13},\ldots ,x_{26}\) correspond to unobservable transitions. Despite a complex aspect, the structural approach proposed in this paper shows that this Petri net can be represented under a more approachable form in a context of estimation: in fact, the incidence matrix of the unobservable induced subnet is large (38\( \times \)14) but sparse (by lake of space, this complex matrix is not given), and a judicious reorganization of the rows and columns can provide a clearer view under a block-triangular form presented below.
Corresponding to places \(p_{1},\) \(p_{14},\) \(p_{18},\) \(p_{19},\) \(p_{20},\) \( p_{21},\) \(p_{28},\) \(p_{32}\) and \(p_{37}\) the null rows of the unobservable incidence matrix cannot affect the sequence estimation and can be disregarded. As shown by Table 2, the reorganization of the rows and columns shows that the unobservable incidence matrix presents two disconnected parts denoted as \(S_{A}\) and \(S_{B}\) (named simply connected structures in graph theory, these disconnected parts can be found by a standard search of paths between the vertices). System \(S_{A}\) decribes the connections between 19 places and 11 unobservable transitions while system \(S_{B}\) in the bottom right corner is about 7 places and 3 unobservable transitions: \(S_{A}=( \{p_{33},p_{36},p_{38},p_{26},p_{11},p_{33},p_{9},p_{10},p_{5},p_{24},p_{3},p_{2},p_{7},p_{31},p_{8},p_{6},p_{27},p_{16},p_{15},p_{17},\) \(p_{4},p_{29},p_{25}\}\times \) \( \{x_{19},x_{20},x_{15},x_{17},x_{13},x_{14},x_{18},x_{16},x_{23},x_{22},x_{25}\}) \) and \(S_{B}=(\{p_{34},p_{30},\) \(p_{22},p_{12},p_{23},p_{13},p_{35}\}\times \{x_{21},x_{26},x_{24}\}).\) The unobservable induced subnet presents some circuits (\(x_{16}p_{6}x_{18}p_{25}x_{16}\), \( x_{15}p_{5}x_{17}p_{24}x_{15},x_{13}p_{3}x_{15}p_{29}x_{13})\) but system \( S_{B}\) has no circuit. This first decomposition suggests that the consideration of these systems can reduce the size of the matrices and vectors used in the estimation.
Presented in the appendix, the DM decomposition is a possible approach leading to a triangular form. Let consider an application of this technique presented in Table 2 and analyze the obtained subsystems.
System \(\textbf{S}_{A}\)
System \(\textbf{S}_{A}\) is composed of \(S_{1},\) \(S_{2},\) \(S_{3}\) and \(S_{4}\) defined as follows:
\(S_{1}=\{p_{33},p_{36},p_{38},p_{26},p_{11},p_{33},p_{9},p_{10}\}\times \{x_{19},x_{20}\},S_{2}=\{p_{5},p_{24},p_{3},p_{2},p_{7}\}\times \{x_{15},x_{17},x_{13},x_{14}\},\) \(S_{3}=\{p_{31},p_{8}\}\times \{x_{18}\}\) and \(S_{4}=\{p_{6},p_{27},p_{16},p_{15},p_{17},p_{4},p_{29},p_{25}\}\times \{x_{16},x_{23},x_{22},x_{25}\}\)
Highlighted in grey in the table, the four incidence matrices \(W_{un,1}\) for \(S_{1},\) \(W_{un,2}\) for \(S_{2},\) \(W_{un,3}\) for \(S_{3}\) and \(W_{un,4}\) for \( S_{4}\) can easily be expressed and are RSB. Indeed, the relevant possible row-vector \(z_{i}\) satisfying \(z_{i}.W_{un,i}<0\) are \(z_{1}=\left( \begin{array}{rrrrrrrr} 0&0&0&0&0&0&2&1 \end{array} \right) ,\) \(z_{2}=\left( \begin{array}{rrrrr} 3&1&3&4&1 \end{array} \right) ,\) \(z_{3}=\left( \begin{array}{rr} 2&1 \end{array} \right) \) and \(z_{4}=\left( \begin{array}{rrrrrrr} 1&2&2&3&1&2&0 \end{array} \right) .\) So, we can deduce that \(\textbf{S}_{A}\) is RSB (Theorem 8) and that all count variables of structure \(\textbf{S}_{A}\) are upper bounded (Theorem 4).
System \(S_{B}\)
The unique substructure of \(S_{B}\) is \(S_{5}= \{p_{34},p_{30},p_{22},p_{12},p_{23},p_{13},p_{35}\}\times \{x_{21},x_{26},x_{24}\}.\)
The incidence matrix \(W_{un,5}\) for \(S_{5}\) is RSB as a possible \(z_{5}\) satisfying \(z_{5}.W_{un}<0\) is \(z_{5}=\left( \begin{array}{rrrrrrr} 1&0&2&3&0&0&1 \end{array} \right) .\)
Finally, the analysis of \(\textbf{S}_{A}\) and \(S_{B}\) shows the that the Petri net has good properties as the unobservable induced incidence matrix \( W_{un}\) is completely RSB and all count variables of \(\textbf{S}_{A}\) and \( S_{B}\) are upper bounded.
We now show that the computation of a guaranteed horizon can be applied on a subsystem.
System \(S_{1}\)
Let us consider the subsystem \(S_{1}\) at step \(<1>\) and the relevant estimation of the sequence length for transitions \(x_{19}\) and \(x_{20}\). In inequality (1), we can consider the rows relevant to the places of the set \( \{p_{33},p_{36},p_{38},p_{26},p_{11},p_{33},p_{9},p_{10}\},\) the relevant marking \(M_{R}=\left( \begin{array}{rrrrrrrr} M_{33}&M_{36}&M_{38}&M_{26}&M_{11}&M_{33}&M_{9}&M_{10} \end{array} \right) ^{T}\) (R as Reduced, remember that \(M_{i}=M(p_{i}))\) and the relevant rows of the incidence matrices: \(W_{un,R}=\left( \begin{array}{c} \begin{array}{r} W_{un}(33,.) \\ W_{un}(36,.) \\ W_{un}(38,.) \\ W_{un}(26,.) \\ W_{un}(11,.) \\ W_{un}(33,.) \end{array} \\ W_{un}(9,.) \\ W_{un}(10,.) \end{array} \right) \) and \(W_{obs,R}^{-}=\left( \begin{array}{c} \begin{array}{r} W_{obs}^{-}(33,.) \\ W_{obs}^{-}(36,.) \\ W_{obs}^{-}(38,.) \\ W_{obs}^{-}(26,.) \\ W_{obs}^{-}(11,.) \\ W_{obs}^{-}(33,.) \end{array} \\ W_{obs}^{-}(9,.) \\ W_{obs}^{-}(10,.) \end{array} \right) .\) Moreover, we can consider the columns of \(\{x_{19},x_{20}\}\) and remove the null columns of \(W_{un,R}\) which becomes \(W_{un,1}\) already introduced. We obtain
with \(\overline{x}_{R}=\left( \begin{array}{rr} \overline{x}_{19}&\overline{x}_{20} \end{array} \right) ^{T}\) and \(b_{R}^{<1>}=M_{R}^{<1>}-W_{obs,R}^{-}.\overline{y}^{<1>}.\) For the following steps \(<k>,\) the same reasoning implies that relation (3) becomes
with \(b_{R}^{<k>}=M_{R}^{<1>}+W_{obs,R}\cdot \overline{y}^{<0>\rightarrow<k-1>}-W_{obs,R}^{-}.\overline{y}^{<k>}\)
The size of vectors \(\overline{x}_{R}^{<0>\rightarrow <k>},\) \(M_{R}^{<1>},\) \( \overline{y}^{<0>\rightarrow <k-1>}\)and \(\overline{y}^{<k>}\) are \((2\times 1),\) \((8\times 1),\) \((12\times 1)\) and \((12\times 1)\) respectively. The size of matrices \(W_{un,1},W_{obs,R}\) and \(W_{obs,R}^{-}\) are \((8\times 2),\) \((8\times 12),\) and \((8\times 12)\) respectively. Except \(W_{obs,R}\) which is given below, all the matrices can easily be deduced.
In the case of a receding horizon going from \(<0>\) to \(<k>\), the application of Theorem 2 allows to compute the guaranteed horizon with the following linear programming problem: \(h_{g}^{<0> \rightarrow<k>}=\lfloor \max (c.\overline{x}_{R}^{<0>\rightarrow <k>})\rfloor +k\) for (6) relaxed over \(\mathbb {R }\) where \(\overline{x}_{R}^{<0>\rightarrow <k>}\) and (6) replace \(\overline{x}^{<0>\rightarrow <k>}\) and \( -W_{un}.\overline{x}^{<0>\rightarrow<k>}\le b^{<k>}\) in (3), respectively. Taking the initial marking and the first observed words of Table I in Mahulea et al. (2012), the results are presented in Table 3. The execution time of each computation based on function glpk() of GNU Octave is negligible.
Table 3 already gives a rough estimate of the sequences as \(\overline{x}_{19}+\overline{x}_{20}\le \lfloor \max (c.\overline{x}_{R}^{<0>\rightarrow <k>})\rfloor \). When \( \lfloor \max (c.\overline{x}_{R}^{<0>\rightarrow <k>})\rfloor =0,\) the firings of \(x_{19}\) and \(x_{20}\) are not possible. Knowing the guaranteed horizon for the subsystem \(S_{1}\) and each step, the sequence estimation presented in Section 3.2 Declerck (2021) can be applied: using the counter form, the finite set of relations can be established and the timed sequences relevant to transitions \(x_{19}\) and \(x_{20}\) can be estimated with any criterion.
The same remarks hold for subsystems \(S_{2}\) and \(S_{5}\) which are independent. When the majorants of the count variables relevant to \(S_{1}\) and \(S_{2}\) are obtained, the structure suggests that the majorant of the count variable \( \overline{x}_{18}\) of \(S_{3}\) which depends on \(S_{1}\) and \(S_{2}\) can be estimated in a sequential way. The same remark holds for the count variable\(\{ \overline{x}_{16},\overline{x}_{23},\overline{x}_{22},\overline{x}_{25}\}\) of \(S_{4}.\) Finally, the sequential treatment of each reduced system with smaller dimensions facilitates the computation and brings local results independently of the resolution of the complete system.
6 Conclusion
In Section 3, we have introduced new results as Theorems 2 and 3 which focus on the computation of a guaranteed horizon \(h_{g}^{<0>\rightarrow <k>}\) that is, a guaranteed receding horizon relevant to the succession of steps going from \(<0>\) to \(<k>.\) We have analyzed the structure of the incidence matrix of the unobservable induced subnet in Section 4 and have shown that a structure presenting at least a non-RSB column is non-RSB. The composition of the RSB blocks shows a propagation of the RSB property through the structure proving that a part of the transitions relevant to the RSB blocks is upper bounded even if the complete system is non-RSB. The algebraic analysis can be applied to any triangular form which can be obtained with any technique as the Dulmage-Mendelsohn decomposition or a variant: a double application of this decomposition has been made in Section 5. The case study shows that the direct treatment of the large system system can be avoided and that the triangular form brings a sequential treatment allowing a computation based on smaller systems independently of the resolution of the complete system. The adaptation to any known estimation approach is clearly a perspective. As this proposed paper shows that the consideration of the triangular form of the incidence matrix is an useful way to analyze a Petri net, another perspective is the adaptation of this approach to analyze the properties of structurally boundedness, and deadlock structurally boundedness.
References
L. Aguirre-Salas, O. Begovich, and A. Ramirez-Trevino, Observability in interpreted Petri nets using sequence invariants. In Proc. of the 41st IEEE Conf. on Decision and Control, volume 4, pp. 3602–3607, 2002
Aybar A, Iftar A (2009) Decentralized Structural Controller Design for Large-scale Discrete-Event Systems Modelled by Petri Nets. Kybernetika 45(1):3–14
F. Arichi, M. Petreczky, M. Djemai, and B. Cherki, Observability and Observer Design of Partially Observed Petri Nets, IFAC-PapersOnLine 48 (27), pp. 27-32, 4, 2015
F. Baccelli, and B. Gaujal, Liveness in Free-Choice Petri Nets- An Algebraic Approach, research report, Inria, 1996
Basile F, Chiacchio P, De Tommasi G (2012) On K-diagnosability of Petri nets via integer linear programming. Automatica 48(9):2047–2058
T. Becha, R. Kara, S. Collart-Dutilleul, and J. J. Loiseau, Modelling, Analysis and Control of Electroplating Line Modelled by P-Time Event Graphs, in Proc. 6th International Conference on Management and Control of Production and Logistics, Fortaleza, Brazil, pp. 311-316, Sep. 11-13, 2013
Bonhomme P (2021) Decentralized state estimation and diagnosis of P-time labeled Petri nets systems. Journal of Discrete Event Dynamic Systems 31(1):137–162
A. Boussif, B. Liu, and M. Ghazel, Diagnosability Analysis of Intermittent Faults in Discrete Event Systems, International Journal of Control, Automation and Systems 17(X), pp. 1-14, 2019
P. Bunus, and P. Fritzson, Methods for structural analysis and debugging of modelica models, in Proc. \(2^{nd}\) International Modelical Conference, pp.157-165, 2002
Cane J-M, Kubicki A, Michelucci D, Barki H, Foufou S (2014) Re-paramétrisation et réduction des systèmes irré ductibles. Revue électronique Francophone d’Informatique Graphique 8(2):7991
A. Chouchane, P. Declerck, A. Khedher, and A. Kamoun, Diagnostic based on estimation using linear programming for partially observable Petri nets with indistinguishable events, International Journal of Systems Science: Operations & Logistics, pp. 1-14, Taylor & Francis, 2018
Chouchane A, Declerck P (2022) Diagnosis on a sliding horizon for partially observable Petri nets, Kybernetika, International journal of Institute of Information Theory and Automation of The Czech Academy of. Science 58(4):479–497
F. Baccelli, G. Cohen, G.J. Olsder, and J.P. Quadrat Synchronization and Linearity. An Algebra for Discrete Event Systems, Available from http://maxplus.org, New York, 1992, Wiley
P. Declerck and P. Bonhomme, State Estimation of Timed Labeled Petri Nets with Unobservable Transitions, IEEE Transactions on Automation Science and Engineering, Special Issue on Discrete Event Systems for Automation, Vol. 11, No. 1, ITASC9, pp. 103-110, January 2014
P. Declerck, Counter approach for the estimation of optimal sequences in Partially Observable Untimed Petri Nets, Journal of Discrete Event Dynamic Systems, 2021
Declerck P, Bonhomme P (2023) Two-staged Approach for Estimation of sequences in Partially Observable P-time Petri Nets on a sliding horizon with schedulability analysis. International Journal of Control. https://doi.org/10.1080/00207179.2023.2204977
P. Declerck, and M. Staroswiecki, Identification of structurally solvable sub-systems for the design of Fault Detection and Isolation Schemes, using the Embedding Procedure, 9th IFAC/IFORS Symposium on Identification and System Parameter Estimation, pp. 230-235, Vol.1, Budapest, Hungary, July 8-12, 1991
P. Declerck, and M. Staroswiecki, Characterization of the Canonical Components of a Structural Graph for Fault Detection in Large Scale Industrial Plants, First European Control Conference, ECC’91, pp. 298-303, Vol.1, Grenoble, France, July 2-5, 1991
F. Defossez, S. Collart-Dutilleul, and P. Bon, Temporal requirements checking in a safety analysis of railway systems, FORMS/FORMAT 2008, Symposium on Formal Methods for Automation and Safety in Railway and Automotive Systems, TU Braunschweig and Budapest University of Technology and Economics, October, 2008
Dotoli M, Fanti MP, Mangini AM (2009) Fault detection of discrete event systems by Petri nets and integer linear programming. Automatica 45(11):2665–2672
Dulmage AL, Mendelsohn NS (1958) Covering of bipartite graphs. Canadian J. Math 10:517–534
A. L. Dulmage, and N. S. Mendelsohn, A structure theory of bipartite graphs of finite exterior dimension, Trans. of Royal Soc. Canada, Section III, 53, pp. 1-13, 1959
F. Ahmad, H. Huang, and X. Wang, A Technique for Reachability Graph Generation for the Petri Net Models of Parallel Processes, International Journal of Computer and Information Engineering, Vol: 3, No: 3, 2009
Frisk E, Bregon A, Aslund J, Krysander M, Pulido B, Biswas G (2012) Diagnosability Analysis Considering Causal Interpretations for Differential Constraints, IEEE transactions on systems, man and cybernetics. Part A. Systems and humans 42(5):1216–1229
Gougam HE, Pencolé Y, Subias A (2017) Diagnosability analysis of patterns on bounded labeled prioritized Petri nets. Discrete Event Dynamic Systems 27:143–180
J. Komenda, S. Lahaye, J-L. Boimond, and T. van den Boom, Max-plus algebra in the history of discrete event systems, Annual Reviews in Control. 45. 10.1016/j.arcontrol.2018.04.004, 2018
Lefebvre D (2014) On-Line Fault Diagnosis With Partially Observed Petri Nets. IEEE Transactions on Automatic Control 59(7):1919–1924
L.Li, Y. Li, B. Liu, and W. Wu. Least-Cost Transition Sequence Estimation in Labeled Time Petri Net Systems with Unobservable Transitions, International Journal of Control, pp 1-14, 10.1080/00207179.2022.2121764, 2022
Libeaut L (1996) Sur l’utilisation des dioides pour la commande des systèmes à événements discrets. Ph.D, Nantes
Mahulea C, Seatzu C, Cabasino M, Silva M (2012) Fault Diagnosis of Discrete-Event Systems Using Continuous Petri Nets. IEEE Transactions on Systems, Man, and Cybernetics 42:970–984
Murata T (Apr. 1989) Petri Nets: Properties. Analysis and Applications, in PROCEEDINGS OF THE IEEE 77(4):541–580
K. Murota, Systems Analysis by Graphs and Matroids. Structural solvability and controllability, Algorithms and Combinatorics 3, Springer-Verlag, 1987
Pothen A, Fan C-J (December 1990) Computing the Block Triangular Form of a Sparse Matrix. ACM Transactions on Mathematical Software 16(4):303–324
Ru Y, Hadjicostis CN (2010) Sensor selection for structural observability indiscrete event systems modeled by Petri nets. IEEE Trans. Autom. Control 55(8):1751–1764
K. Schmidt, Using Petri Net Invariants in State Space Construction, Proc. of Tools and Algorithms for the Construction and Analysis of Systems TACAS, LNCS, vol. 2619, Springer, pp. 473-488, 2003
V. M. Savi, and X. Xie, Liveness and Boundedness Analysis for Petri Nets with Event Graph Modules, Lecture Notes in Computer Science book series (LNCS,volume 616), International Conference on Application and Theory of Petri Nets, pp 328–347, 1992
H. Yue, S. Xu, G. Zhou, H. Hu, Y. Guo, and J. Zhang, Estimation of least-cost transition firing sequences in labeled Petri nets by using basis reachability graph, IEEE Access, pp. 1-1. 10.1109/ACCESS.2019.295205, 2019
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors have no conflict of interest to declare that are relevant to this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
This appendix presents the main lines of the Dulmage-Mendelsohn decomposition Dulmage and Mendelsohn (1958) Dulmage and Mendelsohn (1959) which brings a partition of the transitions and places in notable sub-structures. Generalizing the classical triangular form where the substructures are square, this structural analysis is based on a canonical decomposition of any bipartite graph (and its relevant table which is rectangular).
The DM decomposition presents a large scope of applications as it has been exploited in many fields such that: the resolution of large scale systems Pothen and Fan (December 1990); Murota (1987); Cane et al. (2014); the simulation of continuous systems where the debugging of software based on modeling languages is necessary Bunus and Fritzson (2002); the fault detection in continuous systems Declerck and Staroswiecki (1991a, 1991b). Similar to bond graphs, the study Frisk et al. (2012) improves this structural approach by taking into account integral and differential causal interpretations for differential constraints.
The classical algorithms of maximum matching as the classical "Hungarian method" (developed by H. Kuhn) and the algorithm of permutation of the rows and columns of a matrix Murota (1987); Pothen and Fan (December 1990); Declerck and Staroswiecki (1991a, 1991b) leading to its DM decomposition are out the scope of the appendix.
Let us define the initial table and adapt the DM decomposition to the Petri nets.
As the resolution focuses on the unknown variables, we separate the transitions of \(TR\ \)and the relevant columns in two sets:
-
The set of observable transitions \(TR_{obs}\) which correspond to the known variables \(\overline{y}\).
-
The set of unobservable transitions \(TR_{un}\) which are relevant to unknown variables \(\overline{x}\).
Therefore, a permutation of the columns allows to establish the incidence matrix \(W=\left( \begin{array}{ll} W_{obs}&W_{un} \end{array} \right) .\) Moreover, a structural point of view of the unobservable induced subnet is taken. Each (oriented) arc of the Petri net is replaced by a (non-oriented) edge which is non-valuated, that is, the valuations and the orientations of the arcs of the Petri net are neglected: only the existence of a connection (which is often represented by the symbol \(\times \) or 1 in the literature) is considered in this appendix. We now show that the analysis of this non-oriented bipartite graph of the unobservable induced subnet allows to determine notable sub-systems which are the support of a possible resolution and an algebraic interpretation.
The matching between the relations P and the unobservable transitions \(TR_{un}\) is defined as follows:
Definition 2
A matching C is a set of pairs \((p_{i},x_{j})\) where:
-
Each place \(p_{i}\) is associated with a transition \(x_{j}\) at the most.
-
Each transition \(x_{j}\in TR_{un}\) is associated with a place \(p_{i}\) at the most.
So, in a matching C, a unique transition is associated to each place and a unique place is associated to each transition at the most. In the bipartite graph, the matching is represented by a set of edges without common vertices.
As the non-oriented bipartite graph of the unobservable induced subnet is considered, a possible pair can be composed of a place and one of its input or output transitions. Moreover, we focus on maximum matchings where the number of its pairs is maximum. Different maximum matchings can be obtained but all of them have the same cardinality. In the Tables 2 and 4, each pair of the matching is expressed by a symbol in bold.
In this context, the maximum matching is the support of the canonical decomposition developed by A. L. Dulmage and N. S. Mendelsohn Dulmage and Mendelsohn (1958, 1959) in graph theory where a structural decomposition of the table leads to a diagonal of specific block substructures. Three distinct canonical structures named Just-, Over- and Under-structures are highlighted and the block substructures of the Just-structure are square. If the matching C is maximum, there is a unique partition of rows of P and columns of \(TR_{un}\) denoted as X such that: \( P=P^{>}\cup P^{=}\cup P^{<}\) and \(X=X^{>}\cup X^{=}\cup X^{<}\) with empty intersections. This partition highlights three important sub-structures: the Over-structure \(S^{>}=(P^{>},X^{>})\), the Just-structure \( S^{=}=(P^{=},X^{=}) \) and the Under-structure \(S^{<}=(P^{<},X^{<}).\) Moreover, we have \(|C|=|C^{>}|+\left| C^{=}\right| +|C^{<}|\) (expression \(\left| .\right| \) denotes the number of pairs in the matching) where \(C^{>},C^{=}\) and \(C^{<}\) satisfy the following points.
-
For the Over-structure, the maximum matching \(C^{>}\) satisfies \( |C^{>}|=\left| X^{\prime }\right| <\left| P^{\prime }\right| .\) All elements of \(X^{\prime }\) are matched but there is at least a non-matched element in \(P^{\prime }.\)
-
For the Just-structure, the maximum matching \(C^{=}\) satisfies \( \left| C^{=}\right| =\left| P^{\prime }\right| =\left| X^{\prime }\right| .\) All elements of \(P^{\prime }\) and \(X^{\prime }\) are matched in the case of a Just-structure which can be decomposed in square blocks.
-
For the Under-structure, the maximum matching \(C^{<}\) satisfies \( |C^{<}|=\left| P^{\prime }\right| <\left| X^{\prime }\right| .\) All elements of \(P^{\prime }\) are matched but there is at least a non-matched element in \(X^{\prime }.\)
Given a matching, an alternating path is a path whose edges belong alternatively to the matching and not to the matching. Using these notions, the following theorem transposed from Dulmage and Mendelsohn (1958, 1959); Declerck and Staroswiecki (1991a, 1991b) allows to determine different notable substructures. To facilitate the presentation of the results, a direction is added to the edges of the non-oriented bipartite graph. Each pair \((p_{i},x_{j})\) of the maximum matching C is oriented from \(p_{i}\) to \(x_{j}\) (graphically, \(p_{i} \overset{C}{\longrightarrow }x_{j})\) and in the opposite direction when \( (p_{i},x_{j})\notin C\) (graphically, \(p_{i}\leftarrow x_{j}\) ).
Theorem 9
Let us assume that the matching is maximum.
-
The places and transitions of an alternating path belong to the Over-structure \(S^{>}=(P^{>},X^{>})\) when this path starts from a matched place and finishes in a non-matched place.
-
The places and transitions of an alternating path belong to the Under-structure \(S^{<}=(P^{<},X^{<})\) when this path starts from a non-matched transition and finishes in a matched transition.
-
The Just-structure is defined by \(S^{=}=(P^{=},X^{=})\) with \( P^{=}=P\backslash (P^{>}\cup P^{<})\) and \(X^{=}=TR_{un}\backslash (X^{>}\cup X^{<}).\) \(\blacksquare \)
Example 4
In the Petri net of example 4, the sets of observable and unobservable transitions are {\(y_{1},y_{2}\)} and{\(x_{1},x_{2},\ldots ,x_{8}\)} respectively (Fig. 3). Remember that the firing number of \(y_{1},y_{2}\) is finite (Assumption \(\mathcal {AS-}3\)). Table 4 presents the non-oriented form of the incidence matrix \(W=\left( \begin{array}{ll} W_{obs}&W_{un} \end{array} \right) \) for the Petri net in Fig. 3: The values 1 and -1 are replaced by 1 which represents the presence of a connection between the relevant place and the unobservable transition. For clarity, the labels of the places and transitions in Petri nets Fig. 3 have been chosen such that they illustrates the DM decomposition without making a reorganization of the columns and rows of the table. A possible maximum matching C whose cardinality is 8 is represented in bold (this matching is suggested by the orientation of the Petri net): \(C= \{(p_{1},x_{1}),(p_{2},x_{2}),(p_{4},x_{3}),(p_{5},x_{4}),(p_{6},x_{5}),(p_{7},x_{6}),(p_{8},x_{8}),(p_{9},x_{9})\}. \) Other maximum matchings are available as \(\{(p_{2},x_{1}),(p_{3},x_{2}),(p_{4},x_{3}),\!(p_{5},x_{5}),\!(p_{6},x_{4}),\!(p_{7},x_{6}),\) \((p_{8},x_{7}),(p_{9},x_{8})\} \).
As C is maximum, we can apply the DM decomposition (Theorem 9). The edges of the non-oriented bipartite graph are now oriented with the direction presented above and the oriented path can be interpreted. In this simple example, the orientation deduced from the chosen maximum matching corresponds to the orientation of the Petri net. For \(S^{>}\), an alternating path starts from \(p_{1}\) (matched) and finishes in \(p_{3}\) (non-matched): \(p_{1}\overset{C}{\longrightarrow } x_{1}\longrightarrow p_{2}\overset{C}{\longrightarrow }x_{2}\longrightarrow p_{3}.\) For \(S^{<}\), an alternating path starts from \(x_{7}\) (non-matched) and finishes in \(x_{9}\) (matched): \(x_{7}\longrightarrow p_{8}\overset{C}{ \longrightarrow }x_{8}\longrightarrow p_{9}\overset{C}{\longrightarrow } x_{9} \). For \(S^{=}\), an alternating path starts from \(p_{4}\) (matched) and finishes in \(x_{5}\) (matched): \(p_{4}\overset{C}{\longrightarrow } x_{3}\longrightarrow p_{5}\overset{C}{\longrightarrow }x_{4}\longrightarrow p_{6}\overset{C}{\longrightarrow }x_{5}\). Another one starts from \(p_{7}\) (matched) and finishes in \(x_{6}\) (matched): \(p_{7}\overset{C}{ \longrightarrow }x_{6}\). \(\blacksquare \)
Let us consider the Just-structure \(S^{=}.\) Let us assume that a fictitious self-loop of null length connecting \(x_{i}\) to \(x_{i}\) is added to each vertex \(x_{i}\in X^{=}.\) For any pair of matched vertexes (\(x_{i},\) \(x_{j})\) with \(x_{j},x_{i}\in X^{=},\) we can focus on the case where there is a path from \(x_{i}\) to \(x_{j}\) and a path from \(x_{j}\) to \(x_{i}\) (mutual dependence of \(x_{i}\) and \(x_{j}\)). This case defines a pair of dependent transitions corresponding to a circuit and we can define a substructure composed of transitions where each transition is connected to any transition of the substructure with a circuit composed of places. Formally, the substructure contains a directed path from to \(x_{i}\) to \(x_{j}\) and a directed path from \(x_{j}\) to \(x_{i}\) for every pair of vertices \(x_{i},\) \( x_{j}.\) These substructures are usually named strongly connected substructure and, irreducible substructure for the corresponding representation in the table. As this type of substructure is remarkable, the approach is based on the determination of all these substructures which leads to a partition of the Just-structure \(S^{=}.\)
Example 4 continued.
Represented in grey in the Just-structure \(S^{=}\) of Table 4, substructures \((\{p_{4}\},\{x_{3}\})\), \((\{p_{5},p_{6}\},\{x_{4},x_{5}\})\) and \((\{p_{7}\},\{x_{6}\})\) are irreducible. \(\blacksquare \)
Note that each DM decomposition is common to a set of Petri nets. In fact, each filling of the lower-left corner of \(W_{un}\) defines a new Petri net with the same DM decomposition.
Example 4 continued.
The Petri net in Fig. 4 is a variant of the Petri net in Fig. 3 which presents the same DM decomposition. The difference is the addition of components in the lower-left corner of \( W_{un}\): \(\left( W_{un}\right) _{4,1}=1,\) \(\left( W_{un}\right) _{4,2}=1,\) \( \left( W_{un}\right) _{7,3}=1,\) \(\left( W_{un}\right) _{7,4}=1,\) \(\left( W_{un}\right) _{8,1}=1,\) \(\left( W_{un}\right) _{8,3}=1,\) \(\left( W_{un}\right) _{8,6}=-1.\) \(\blacksquare \)
Example 3 continued (case study).
The determination of a maximum matching gives a matching where the non-matched places are \( p_{33},p_{36},p_{38},p_{26},p_{11},p_{33},p_{7},p_{8},\!p_{17},\!p_{4},\!p_{29},\) \(p_{25}, \) \(p_{34},\) \(p_{30},\) \(p_{13},p_{35}\) (in Table 2, the matched components are in bold). The application of the DM decomposition of \(W_{un}\) is a priori unsuccessful as it leads to a unique under-determined structure which does not generate smaller subsystems. Keeping only the rows of the matched places as \(p_{9},\) \( p_{10},\) \(p_{5},\)..., a second DM decomposition leads to a just-structure which is a triangular form (shaded in darker grey) suggesting an order of resolution. The size of all the blocks in the main diagonal except the substructure \(\{p_{5},p_{24}\}\times \{x_{15},x_{17}\}\) is \(1\times 1\). The addition of the rows of the non-matched places proposes the subsystems in darker grey and light grey. Note that, as the DM decomposition has not been applied in the classical form, this decomposition of subsystems is not unique a priori. \(\blacksquare \)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Declerck, P. Offline analysis of the relaxed upper boundedness for online estimation ofoptimal event sequences in Partially Observable Petri Nets. Discrete Event Dyn Syst (2024). https://doi.org/10.1007/s10626-024-00396-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10626-024-00396-0