1 Introduction

In this paper, we focus on the analysis of the modelling conditions such that any possible unobservable event sequence leading to an observation presents a finite length in the estimation problem. Finite lengths are necessary in fault diagnosis Lefebvre (2014) and diagnosability problem Basile et al. (2012) where all the sequences which end in an observation or a fault event must be finite. Indeed, if the estimation considers an observation relevant to a given finite time, the length of all the possible sequences starting from the initial marking and finishing in this event must also be finite, otherwise, the modelling of the estimation problem is not sufficient to characterize the relevant unobservable sequences. This situation which can affect the estimation and the fault detection, can be produced by the lack of sensors or inadequate positions of the sensors in the Petri net. The presence of a finite length can be interpreted not as a condition of the uniqueness of the trajectory Aguirre-Salas et al. (2002); Arichi et al. (2015), but as a necessary condition contributing to the estimation. Let us develop the motivations of this paper. The context is the sequence estimation problem (Problem 1) whose main points are presented below. The problem traited in this paper is described in the sequel (Problem 2).

  1. 1.

    The event sequences (the events considered in the sequences are only the transition firings throughout the article) are represented with the counter form which is an efficient tool developed in the field of dioid algebra (max-plus algebra, min-plus algebra,...) for originally Timed Event Graphs. Based on Libeaut (1996), the study Declerck and Bonhomme (2014) shows that for estimation in Timed Petri nets, all the unobservable sequences (for any length and rank of the events) can be described with a unique polyhedron which can be easily deduced from the structure of the Petri net. This possibility is extended to Untimed Petri nets under a weak condition in Declerck (2021) which allows to avoid the combinatorial building of a reachability graph or a similar graph.

  2. 2.

    As a unique polyhedron which describes all the sequences can be established, a classical technique of optimization which generates optimal solutions can be applied. A natural objective is to optimize an arbitrary criterion which can present the form of a linear weighting of the transition firing numbers. The aim is not to make the time-consuming computation of all the possible event sequences but only the pertinent sequences useful for the considered problem. The well-developed theory of optimization can be exploited and standard algorithms of linear programming and integer linear programming can be used. Possibly leading to the state space explosion, the burdensome enumeration of all the numerical solutions as all possible markings reachable from the initial marking is avoided.

  3. 3.

    A practical advantage of an estimation based on transition firings is the possibility to define different criteria. The criterion can be a minimum or maximum number of firings in the fault detection presented in Chouchane et al. (2018). But criteria outside diagnosis can be defined as a criterion can be a balance-sheet, that is a global price depending on the costs and gains provided by the tasks Declerck (2021). Another criteria is the least cost where each transition has a nonnegative cost in labeled Petri nets Yue et al. (2019); Li et al. (2022). Potentially, each criteria can lead to a practical application of the estimation as the sequences can model faulty behaviors (simple faulty event, multiple faults, intermittent faults, faulty behaviors,.... ) but also the repair of a system after the occurrence of a fault and any normal behaviors Gougam et al. (2017).

Therefore, an interesting aim is the on-line estimation of optimal current subsequences in Partially Observable Petri Net, that is, the determination of sequences of unobservable transition firings which are coherent with the observed label sequence produced by the observable transitions and are (preferably) optimal with respect to a criterion which can present the form of a linear weighting of the transition firing numbers (Problem 1).

An important point is that the estimation of optimal sequences is based on the existence of a time horizon (corresponding to a sequence length) which is sufficient to describe all the sequences and to write a finite set of relations describing the model with the counter form (System (8) in  Declerck (2021)). In the timed case, the successive dates of observations are known and the time horizon is simply the difference between the dates relevant to the beginning and the end of the desired horizon Declerck and Bonhomme (2014). This technique is used in Declerck and Bonhomme (2023) which presents an application in P-time Petri nets which facilitates the schedulability of estimated sequences, that is, the determination of possible firing dates of a given sequence without token deaths. But, this way cannot be used in the untimed case where the dates of observations are unknown by assumption. A possible solution is to take an arbitrary large horizon but this technique does not guarantee that this value is sufficient for each considered estimation problem and increases the size of the treated system and consequently the execution time of the corresponding computations. So, the aim is to determine appropriate horizons which allow to express every subsequence. The paper Declerck (2021) presents Integer Linear Programming Problems and problems relaxed over the real numbers which provide temporary guaranteed horizons adapted to the current evolution.

However, to exploit the previous results, the computation of these problems must converge to a finite horizon. In that case, any possible sequence presents a finite length in the estimation problem: a finite set of relations describing the model with the counter form can be written and any criterion can be considered. As Declerck (2021) shows that the horizon is always finite when the unobservable induced subnet is Relaxed Structurally Bounded (RSB), the objective of this paper is the analysis of the Relaxed Structurally Boundedness property (RSB property: the same acronym is taken) based on an offline analysis of the specific invariants defined over the real numbers (Problem 2). Particularly, different questions emerge in the following cases.

- When the incidence matrix describing the unobservable induced subnet is non-RSB, the resolution treating the complete system is not guaranteed. However, the analysis of some examples shows that an estimation for any criterion can be made on some subsystems and an objective is the determination of the RSB parts. A corollary question is the improvement of the non-RSB parts.

- Petri nets can describe large scale systems as production systems Becha et al. (2013) and transportation systems (railway systems Defossez et al. (2008)). This case cannot be treated by the estimation approaches which suffer of the state explosion particularly when the estimation tries to determine all the numerical solutions of the state space. An efficient way to cope with the complexity in modelling large-scale systems consists in decomposing it into subsystems and refining the results Schmidt (2003); Aybar and Iftar (2009); Baccelli and Gaujal (1996); Savi and Xie (1992). Clearly, a resolution considering smaller systems improves the numerical efficiency of the chosen technique.

In this context, this paper proposes a structural approach presenting two phases.

  1. 1.

    The decomposition of the structure into substructures leading to a triangular form where the blocks can be non-square (Tables 2 and 4 present this triangular form). Obtained with any technique, this practical structure can be obtained if the Petri net is an association of Petri subnets. This triangular form can also be deduced with the Dulmage-Mendelsohn decomposition Dulmage and Mendelsohn (1959, 1958) (DM decomposition) presented in the appendix which completes the paper.

  2. 2.

    The composition of the algebraic substructures and the relevant analysis of the RSB property. The goal is to integrate the substructures together to determine the property of the whole system.

Related problems.

Let us describe some other problems related to Problem 2. As the definition of relaxed structurally boundedness concerns the sequences, this definition is fundamentally different from the classical definitions characterizing the marking as the (marking) boundedness, the structurally boundedness (the number of tokens in each place does not exceed a finite natural number for any marking reachable from any finite initial marking) and deadlock structurally boundedness (it leads to a deadlock state from any finite initial marking) even if the relevant algebraic expressions are close Murata (Apr. 1989). Moreover, the definition of RSB which is over the real numbers and not the integers considers the unobservable induced subnet and not the complete Petri net. More details can be found in Declerck (2021) and Example 1 below. A problem presenting a similarity with Problems 1 and 2 is the study Lefebvre (2014) which proposes the maximal size of all the elementary firing sequences by determining offline the maximal length of the paths in the induced unobservable graph which is obtained according to the reachability graph or the coverability graph. Howevever, the construction of these graphs suffers from the well-known state explosion problem related to the exponential space and time complexity which limits the application of this technique to modest-size Petri nets Ahmad et al. (2009). The approach proposed in this paper is complementary to this approach as the online computation of the guaranteed horizons provides different temporary bounds which are adapted to the current evolution. In addition, the assumption of the existence of a finite horizon is not taken as the objective of problem 2 is precisely the analysis of this existence in the context of the paper.

Another similar problem is diagnosability which is the property of a partially observed dynamical system that asserts whether it is always possible to diagnose with certainty the occurrence of an anticipated failure. This problem focuses on the detection of a given event Basile et al. (2012) or a given event pattern Gougam et al. (2017). Problem 1 presents another aim as: the objective is the determination of a current majorant which depends on the observations and must be computed online. Another problem is observability which presents different definitions in the literature as sequence estimation or marking estimation can be considered. In max-plus algebra, the structural observability expresses only a dependence between the unobservable events and the observations Baccelli et al. (1992) while different papers consider a constraining definition based on the uniqueness of the marking for any known initial marking Ru and Hadjicostis (2010) or the uniqueness of the unobservable sequence and marking Aguirre-Salas et al. (2002); Arichi et al. (2015). Contrary to these previous studies, the general aim of this paper is the determination of a possible optimal sequence in the solution space and not the detection of a unique sequence of transition firings. Observable places are not considered in this paper and the initial marking is assumed to be known.

The paper is organized as follows. The preliminary notations are given in Section 2 and Table 1 which summarizes the main notations. The beginning of Section 3 presents the principle of the computation of a guaranteed horizon, that is, a horizon which is sufficiently large such that any possible sequence is modelled in the estimation problem Declerck (2021). This section is completed by different results. A comparison of the definition of RSB, structural boundedness and deadlock structurally bounded Petri nets is made in Section 3.3. Then, the composition of substructures and the propagation of the RSB property through the triangular form is analyzed in Section 4. The elementary Example 1 illustrates the main concepts of the problem, while the other examples focus on the structural aspect. Well-known in the literature, the case study of Example 3 is a classical manufacturing system which is a large scale system. The reading of the paper is largely independent of the appendix which presents the DM decomposition yielding a triangular form as in Tables 2 and 4. As the paper focuses on the sequence length, the state estimation studied in many references is out the scope of this paper Dotoli et al. (2009); Komenda et al. (2018); Boussif et al. (2019); Chouchane and Declerck (2022); Bonhomme (2021). Examples of criteria can be found in Chouchane et al. (2018); Declerck (2021); Li et al. (2022); Yue et al. (2019).

Table 1 Main notations

2 Preliminary notations

2.1 General notations

The notation |Z| is the cardinality of set Z and \(A^{T}\)corresponds to the transpose of matrix A. The entry in the \(i-th\) row and \(j-th\) column of a matrix A is denoted A(ij). The notation A(i, .) represents the row i of \(A\ \) while A(., j) expresses the column j. Symbol \( \backslash \) is the set difference, that is, \(U\setminus V\) is the set formed by the elements of set U that are not in set V. The 1-norm of vector u is equal to the sum of the absolute values of the vector elements and is denoted as \(\parallel u\parallel _{1}.\) The notation \(\lfloor x\rfloor \) represents the greatest integer less than or equal to x. A Place/Transition (P/TR) net is the structure \(N=(P,TR,W^{+},W^{-})\), where P is a set of |P| places and TR is a set of |TR| transitions. The matrices \(W^{+}\) and \(W^{-}\) are respectively the \(|P|\times |TR|\) post and pre-incidence matrices over \(\mathbb {N}\), where each row \(l\in \left\{ 1,...,|P|\right\} \) specifies the weight of the incoming and outgoing arcs of the place \(p_{l}\in P,\) respectively. The incidence matrix is \( W=W^{+}-W^{-}\). The preset and postset of the node \(v\in P\bigcup TR\) are denoted by \(^{\bullet }v\) and \(v^{\bullet }\), respectively. A source transition \(tr_{i}\in TR\) satisfies \(^{\bullet }tr_{i}=\emptyset .\) The notation \(\Omega ^{*}\) represents the set of firing sequences, denoted as \(\sigma \), consisting of transitions of the set \(\Omega \subset TR\). The vector \(\overline{\sigma }\) of dimension |TR| expresses the firing vector or count vector of the sequence \(\sigma \in TR^{*}\), where the i -th component \(\overline{\sigma _{i}}\) is the firing number of the transition \(tr_{i}\in TR\) which is fired \(\overline{\sigma _{i}}\) times in the sequence \(\sigma .\) A source transition \(tr_{i}\in TR\) satisfies \( ^{\bullet }tr_{i}=\emptyset \) and its firing count can be infinite.

The marking of the set of places P is a vector \(M\in \mathbb {N}^{|P|}\) that assigns to each place \(p_{i}\in P\) a non-negative integer number of tokens \(M_{i}\), represented by black dots. The i-th component \(M_{i}\) is also written as \(M(p_{i}).\) The marking M reached from the initial marking \(M^{0}\) by firing the sequence \(\sigma \) can be calculated by the fundamental relation: \(M=M^{0}\ +W.\overline{\sigma }\). The transition \(tr_{i}\) is enabled at M if \(M\ge W^{-}(.,tr_{i})\) and may be fired yielding the marking \(M^{\prime }=M+W(.,tr_{i})\) for a unique firing. We write \(M[\sigma \succ \) to denote that the sequence of transitions \(\sigma \) is enabled at M, and we write \( M[\sigma \succ M^{\prime }\) to denote that the firing of \(\sigma \) yields \( M^{\prime }.\) To easily describe the Petri net with the incidence matrices \( W^{+}\) and \(W^{-},\) we assume that there is at most a unique arc between a place \(p_{l}\) and each input (resp. output) transition \(x_{i}\) of this place: each weight \(\left( W\right) _{l,i}^{+}\ne 0\) ( respectively, \( \left( W\right) _{l,i}^{-}\ne 0)\) corresponds to a unique arc in the Petri net. Otherwise a simple modification of the Petri net yields the desired form.

2.2 Notations for estimation

A labeling function L : \( TR\rightarrow AL\cup \{\varepsilon \}\) assigns to each transition \(tr_{i}\in TR\) either a symbol from a given alphabet AL or the empty string \( \varepsilon \). In a partially observed Petri net, we consider that the set of transitions TR can be partitioned as \(TR=TR_{obs}\bigcup TR_{un}\), where the set \(TR_{obs}\) (respectively, \(TR_{un}\)) is the set of observable transitions associated with a label of AL (respectively, the empty string \( \varepsilon )\). In this paper, we assume that the same label of AL cannot be associated with more than one transition of \(TR_{obs}\) (named Assumption \( AS-4\) below).

The \(TR_{un}-\)induced subnet of the Petri net N is defined as the new net \( N_{un}=(P,TR_{un},{W_{un}^{+}},{W_{un}^{-}})\), where \({W_{un}^{+}}\) and \({ W_{un}^{-}}\) (respectively, \({W_{obs}^{+}}\) and \({W_{obs}^{-}}\)) are the restrictions of \(W^{+}\) and \(W^{-}\) to \(P\times TR_{un}\) (respectively, \( P\times TR_{obs}\)). This \(TR_{un}-\)induced subnet is also named unobservable induced subnet. Therefore, \(W_{un}={W_{un}^{+}}-{W_{un}^{-}}\) (respectively, \(W_{obs}={W_{obs}^{+}}-{W_{obs}^{-})}\). A reorganization of the columns with regards to \(TR_{obs}\) and \(TR_{un}\) yields \(W=\left( \begin{array}{ll} W_{obs}&W_{un} \end{array} \right) .\)

Notation \(x_{i}\) expresses a unobservable transition, belonging to \(TR_{un}\) while an observable transition belonging to \(TR_{obs}\) is denoted as \(y_{i}\) . The notation of the count vectors is taken for \(\overline{x}\) of dimension \(|TR_{un}|\) and \(\overline{y}\) of dimension \(|TR_{obs}|.\) The reorganization of the components of \(\overline{\sigma }\) yields \(\overline{\sigma }=\left( \begin{array}{ll} \overline{x}^{T}&\overline{y}^{T} \end{array} \right) ^{T}.\)

The estimation of the current unobservable sequence is based on the treatment of the data produced by the observed transitions successively in an on-line procedure. For step \(<k>,\) the subsequence \(x^{<k>}\) relevant to the unobservable transitions of\(\ TR_{un}\) separates two successive observations and precisely leads to the subsequence \(y^{<k>}\) corresponding to the observable transition of\(\ TR_{obs}.\) From \(M^{<k>},\) the subsequence \(\sigma ^{<k>}=x^{<k>}y^{<k>}\) allows the establishment of marking \( M^{<k+1>}:\) formally, \(M^{<k>}[\sigma ^{<k>}\succ M^{<k+1>}\) for \(k\ge 1\) where \(M^{<1>}\) represents the initial marking \(M^{0}.\) Moreover, \( \overline{\sigma }^{<k>}=\left( \begin{array}{ll} (\overline{x}^{<k>})^{T}&(\overline{y}^{<k>})^{T} \end{array} \right) ^{T}\) where notations \(\overline{x}^{<k>}\) and \(\overline{y}^{<k>}\) represent the count vector of \(x^{<k>}\) and \(y^{<k>}\) respectively. We assume that \(\overline{x}^{<k>}=0\) and \(\overline{y}^{<k>}=0\) for \(k\le 0.\) So, the estimation limited to one step must consider \(M^{<1>}[x^{<1>}y^{<1>} \succ M^{<2>}\) for the first step \(<1>,\) then \(M^{<2>}[x^{<2>}y^{<2>}\succ M^{<3>}\) for step \(<2>\) and so on. The generalization to a horizon composed of several steps is immediate. Note that these notations are not cumulative as we can have \(\overline{x}^{<3>}=0\) but \(\overline{x} ^{<1>}\ne 0\) and \(\overline{x}^{<2>}\ne 0:\) the condition \(\overline{x} ^{<1>}\le \overline{x}^{<2>}\le \overline{x}^{<3>}\) does not hold. The notations \(\overline{x}^{<0>\rightarrow <k>}=\underset{k^{\prime }=0,\ldots ,k}{\sum }\) \(\overline{x}^{<k^{\prime }>}\) and \(\overline{y}^{<0>\rightarrow <k>}=\underset{k^{\prime }=0,\ldots ,k}{\sum }\) \(\overline{y}^{<k^{\prime }>} \) allow to write shorter expressions.

3 Guaranteed horizons

3.1 Assumptions

In this paper, we assume the following assumptions for the different Petri nets under investigation:

  • Assumption \(\mathcal {AS-}1:\) the incidence matrices and the initial marking \(M^{0}\) are known.

  • Assumption \(\mathcal {AS-}2:\) the Petri net is live.

  • Assumption \(\mathcal {AS-}3:\) the firing number of each observable transition is finite.

  • Assumption \(AS-4:\) the observations are distinguishable, that is, the same label cannot be associated with more than one observable transition (the case of indistinguishable events is treated in Chouchane et al. (2018)).

  • Assumption \(AS-5.\) A unique firing of a transition on all transitions occurs at each time (Assumption \(AS-5\) is a facility to express the sequences under a form without concurrency).

The assumptions about cyclicity of the structure and boundedness of the marking are not considered in this article contrary to many papers in this topic.

3.2 Computation of a guaranteed horizon

Given a sequence of the observed firing events of the transitions of \( TR_{obs}\) generated by the activity of the Petri net, we desire to find a guaranteed horizon such that any possible sequence \(x^{<k>}y^{<k>}\) of the Petri net can be expressed for step \(<k>.\) The principle is to take a pessimistic point of view of the behavior of the Petri net. Precisely, we consider the worst case in terms of number of firings for the unobservable transitions, which corresponds to the greatest possible length of any unobservable sequence \(x^{<k>}\) when Assumption \(AS-5\) is taken. Therefore, for step \(<k>\) and its relevant sequence \(x^{<k>}y^{<k>}\), a guaranteed horizon denoted as \(h_{g}^{<k>}\) is given by \(h_{g}^{<k>}=\max \parallel \overline{x}^{<k>}\parallel _{1}+1=max(c.\overline{x}^{<k>})+1\) with \( \overline{x}^{<k>}\) over the integers and \(c_{1\text {x}|TR_{un}|}\) unitary where 1 corresponds to the unique observation \(y^{<k>}.\) A consequence is the possibility to treat an estimation problem for any criterion Declerck (2021). This worst case treats all the transitions even if a given criterion exploits a subset of transitions.

For step \(<k=1>,\) a majorant relevant to the unobservable transitions is given by \(\max \parallel \overline{x}^{<k>}\parallel _{1}=max(c.\overline{x} ^{<k>})\) with \(c_{1\text {x}|TR_{un}|}\) unitary for the system Declerck (2021)

$$\begin{aligned} -W_{un}.\overline{x}^{<1>}\le b^{<1>} \end{aligned}$$
(1)

with \(\overline{x}^{<1>}\ge 0\) and \(b^{<1>}=M^{<1>}-W_{obs}^{-}.\overline{y} ^{<1>}.\)

For the following steps \(k\ge 2,\ \)we can consider the maximization \(max(c. \overline{x}^{<k>})\) for a sliding horizon reduced to \(<k>\) or \(max(c. \overline{x}^{<0>\rightarrow <k>})\) for a receding horizon going from \(<0>\) to \(<k>\). A possible system Declerck (2021) is

$$\begin{aligned} \begin{array}{c} \left( \begin{array}{cc} -W_{un} &{} 0 \\ -W_{un} &{} -W_{un} \end{array} \right) .\left( \begin{array}{c} \overline{x}^{<0>\rightarrow<k-1>} \\ \overline{x}^{<k>} \end{array} \right) \le \\ \left( \begin{array}{c} M^{<1>}+W_{obs}\cdot \overline{y}^{<0>\rightarrow<k-1>} \\ b^{<k>} \end{array} \right) \end{array} \end{aligned}$$
(2)

with \(\overline{x}^{<0>\rightarrow <k>}\ge 0\) and \(b^{<k>}=M^{<1>}+W_{obs} \cdot \overline{y}^{<0>\rightarrow<k-1>}-W_{obs}^{-}.\overline{y}^{<k>}.\) Naturally, more complex systems including the firing conditions can be considered but the advantage of these forms is the fixed dimensions of the matrices which only depend on the numbers of places and transitions of the Petri net (and not the number of observations) even if greater values are obtained. With that in mind, a simplified form of (2) which facilitates the analysis is studied in the sequel.

3.3 Upper boundedness of the problem

If the computed value \(max(c.\overline{x} ^{<k>})\) is finite, the problem is upper-bounded, and the horizon length presents a majorant. So, we now analyze the conditions such that the above problems of optimization using (1) or (2) converge to a finite solution. With that aim, we consider the second line of (2) which includes inequality (1) as for \(k=1,\) \(\overline{x}^{<0>\rightarrow <0>}=0\) and \(b^{<1>}=M^{<1>}+W_{obs}\cdot \overline{y}^{<0>\rightarrow<0>}-W_{obs}^{-}.\overline{y}^{<1>}=M^{<1>}-W_{obs}^{-}.\overline{y}^{<1>}\) since \(\overline{y}^{<0>\rightarrow<0>}=\overline{y}^{<0>}=0.\) We consider the Integer Linear Programming problem (ILP problem)

$$\begin{aligned} max(c.\overline{x}^{<0>\rightarrow<k>})\text { such that}-W_{un}.\overline{x} ^{<0>\rightarrow<k>}\le b^{<k>} \end{aligned}$$
(3)

with \(\overline{x}^{<0>\rightarrow <k>}\ge 0\) over \(\mathbb {Z}\) and \( b^{<k>}=M^{<1>}+W_{obs}\cdot \overline{y}^{<0>\rightarrow<k-1>}-W_{obs}^{-}. \overline{y}^{<k>}\) for the succession of steps going from \(<0>\) to \(<k>.\)

Moreover, the ILP problem (3) is relaxed over \( \mathbb {R}\) and become a Linear Programming Problem (LP problem). The relevant maximization can converge to a greater value than the initial ILP problem as the space is not limited to the natural numbers. Formally, \( \underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})\le \underset{ \mathbb {R} }{\max }(c.\overline{x}^{<0>\rightarrow <k>})\) for \(-W_{un}.\overline{x} ^{<0>\rightarrow<k>}\le b^{<k>}.\) In addition, if the relaxed problem provides a finite solution, the same conclusion holds for ILP problem which is more restrictive.

A standard theorem of linear programming is that the dual problem of the primal problem max(c.x) subject to \(A.x\le b\) and \(x\ge 0\) is min(y.b) subject to \(y.A\ge c\) and \(y\ge 0.\) Consequently, we can deduce the LP problem dual to the ILP problem (3) relaxed over \( \mathbb {R}\) which is

$$\begin{aligned} \min (z.b^{<k>})\text { such that }z.W_{un}\le -c\text { with }c\text { unitary and }z\ge 0\text { over }\mathbb {R} \end{aligned}$$
(4)

Let \(\overline{x}_{opt}^{<0>\rightarrow <k>}\) and \(z_{opt}\), be the optimal solutions relevant to the ILP problem (3) relaxed over \(\mathbb {R}\) and LP problem (4).

Theorem 1

Declerck (2021) The maximization of problem (3) relaxed over \(\mathbb {R }\) is upper-bounded if there is \(z\ge 0\) over \(\mathbb {R}\) satisfying (4). Moreover, \(c.\overline{x}^{<0>\rightarrow<k>}\le z.b^{<k>}\) and \(c.\overline{x}_{opt}^{<0>\rightarrow<k>}=z_{opt}.b^{<k>}\). \(\blacksquare \)

Therefore, the ILP problem (3) relaxed over \( \mathbb {R}\) can be replaced by the problem (4 ).

3.4 New results on the computation of a guaranteed horizon

Theorem 1 is completed by the following theorems 2 and 3 which focus on the computation of a guaranteed horizon \(h_{g}^{<0>\rightarrow <k>}\) which represents a guaranteed receding horizon relevant to the succession of steps going from \(<0>\) to \(<k>.\)

Theorem 2

Assume that the space defined by \(z.W_{un}\le -c\) with c unitary and \(z\ge 0\) over \(\mathbb {R}\) is non-empty. For the succession of steps going from \(<0>\) to \(<k>,\) a guaranteed horizon is \( h_{g}^{<0>\rightarrow<k>}=\lfloor c.\overline{x}_{opt}^{<0>\rightarrow<k>}\rfloor +k=\lfloor z_{opt}.b^{<k>}\rfloor +k\) where \(\overline{x} _{opt}^{<0>\rightarrow <k>}\) and \(z_{opt}\) are the optimal solutions relevant to the ILP problem (3) relaxed over \( \mathbb {R}\) and LP problem (4).

Proof. Theorem 1 which considers the real numbers says that, if the space defined by \(z.W_{un}\le -c\) is non-empty (formally, \(\exists z\ge 0\) over \(\mathbb {R}\) satisfying \( z.W_{un}\le -c),\) then \(c.\overline{x}^{<0>\rightarrow <k>}\) for \(-W_{un}. \overline{x}^{<0>\rightarrow<k>}\le b^{<k>}\) is upper-bounded by \( z.b^{<k>}.\) The maximization of the ILP problem converges to a lower value than the relevant relaxed problem as the space of \(-W_{un}.\overline{x} ^{<0>\rightarrow<k>}\le b^{<k>}\) is restricted to the natural numbers. Formally, \(\underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})\le \underset{ \mathbb {R} }{\max }(c.\overline{x}^{<0>\rightarrow <k>})\) for \(-W_{un}.\overline{x} ^{<0>\rightarrow<k>}\le b^{<k>}.\) In addition, as c is an integer row-vector and the maximization over \(\mathbb {N}\) provides an integer solution, \(\underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow <k>})\) is integer and \(\lfloor \underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})\rfloor =\underset{ \mathbb {N} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})\le \lfloor \underset{ \mathbb {R} }{\max }(c.\overline{x}^{<0>\rightarrow <k>})\rfloor \). Moreover, \(\underset{ \mathbb {R} }{\max }(c.\overline{x}^{<0>\rightarrow<k>})=c.\overline{x} _{opt}^{<0>\rightarrow <k>}\) by definition and Theorem 1 gives \(c.\overline{x} _{opt}^{<0>\rightarrow<k>}=z_{opt}.b^{<k>}.\) Finally, variable k,  that is, the k observations \(y^{<k>}\) for the succession of steps going from \( <0>\) to \(<k>\) under Assumption \(AS-5,\) must be added to obtain \( h_{g}^{<0>\rightarrow <k>}.\) \(\blacksquare \)

Degraded horizons \(h_{g}^{<0>\rightarrow <k>}\) can be proposed if a non-optimal z is taken or if the vector \(b^{<k>}\) is modified.

Theorem 3

Assume that the space defined by \( z.W_{un}\le -c\) with c unitary and \(z\ge 0\) over \(\mathbb {R}\) is non-empty. Some possible guaranteed horizons \(h_{g}^{<0>\rightarrow<k>}=\lfloor z.b^{<k>}\rfloor +k\) with z over \(\mathbb {R}\) can be deduced if z is taken as follows:

  1. 1.

    z is an arbitrary solution of \(z.W_{un}\le -c\)

  2. 2.

    z is deduced from the optimization \(\min (z.d)\) for \(z.W_{un}\le -c\) where d is an arbitrary positive row-vector.

  3. 3.

    A possible guaranteed horizon is \(\lfloor z.b^{^{\prime }<k>}\rfloor +k \) with \(b^{\prime<k>}=M^{<1>}+W_{obs}\cdot \overline{y}^{<0>\rightarrow <k-1>}\) where z is provided by Point 1 or 2.

Proof. Point 1) The second point of Theorem 1 says that any solution z to (4) satisfies \(c.\overline{x}^{<0>\rightarrow<k>}\le z.b^{<k>}.\) So, the relevent \(z.b^{<k>}\) is a possible majorant of \( c.\overline{x}_{opt}^{<0>\rightarrow <k>}\) and Theorem 2 can be applied.

Point 2) As point 1) says that any solution z satisfying \(z.W_{un}\le -c\) can be used, we can choose a minimization of z.d with an arbitray \(d>0\) as it always converges to an arbitrary finite solution as \(z\ge 0\) and \(d>0\).

Point 3) \(b^{<k>}=M^{<1>}+W_{obs}\cdot \overline{y}^{<0>\rightarrow<k-1>}-W_{obs}^{-}.\overline{y}^{<k>}\le \) \(M^{<1>}+W_{obs}\cdot \overline{y }^{<0>\rightarrow <k-1>}\ \)as \(W_{obs}^{-}.\overline{y}^{<k>}\ge 0\) (\( W_{obs}^{-}\ge 0\) and \(\overline{y}^{<k>}\ge 0).\) So, \(b^{<k>}\le b^{\prime <k>}\) and \(z.b^{<k>}\le z.b^{\prime <k>}\) as \(z\ge 0.\) So, \( \lfloor z.b^{<k>}\rfloor \le \lfloor z.b^{^{\prime }<k>}\rfloor \). Each vector z given by Point 1) or 2) provides a possible guaranteed horizon \( \lfloor z.b^{^{\prime }<k>}\rfloor +k\). \(\blacksquare \)

Let us consider an execution of the computation made with (4). In Point 1), a horizon can be computed without waiting the end of this computation by taking an intermediate solution z while an arbitrary z for any step \(<k>\) can be computed with a simple criterion in Point 2). As \(b^{\prime <k>}\) does not contain \( \overline{y}^{<k>}\) in Point 3),  a degraded horizon can be computed at the end of step \(<k-1>\) without waiting the occurrence of observation \(y^{<k>}\) and the knowledge of \(\overline{y}^{<k>}.\) Knowing \(\overline{y}^{<k>},\) a better result is obtained with \(z.b^{<k>}.\)

3.5 RSB property

The following definition introduced in Declerck (2021) modifies the condition in Theorem 1 by replacing \(z.W_{un}\le -c\) by \(z.W_{un}<0\).

Definition 1

The unobservable induced subnet is Relaxed Structurally Bounded (RSB) when there exists a non-negative vector z over \(\mathbb {R}\) such that \( z.W_{un}<0.\) Formally,

$$\begin{aligned} \exists z\ge 0\text { such that }z.W_{un}<0\text { with }z\text { over } \mathbb {R} \end{aligned}$$
Fig. 1
figure 1

Elementary Petri net of example 1

Example 1

Let us consider the Petri net of Fig. 1 which is an elementary event graph containing a self-loop, a source transition \(y_{1}\) and a sinks transition \(y_{2}\). We have: \(P=\{p_{1},p_{2},p_{3}\};\) \( TR=TR_{obs}\bigcup TR_{un}\) with \(TR_{obs}=\{y_{1},y_{2}\}\) and \( TR_{un}=\{x_{1}\}.\) So, \(W_{obs}=\left( \begin{array}{rr} 1 &{} 0 \\ 0 &{} 0 \\ 0 &{} -1 \end{array} \right) \) and \(W_{un}=\left( \begin{array}{r} -1 \\ 0 \\ 1 \end{array} \right) .\) The unobservable induced subnet is RSB as \(\exists z=\left( \begin{array}{ccc} 2&1&1 \end{array} \right) \ge 0\) such that \(z.W_{un}=-1<0\) with z over \(\mathbb {R}\). Note that the Petri net is neither structurally bounded (\(\exists z\in \) \(\mathbb { N}^{3}\) with \(z>0\) and \(z.W\le 0\)), nor deadlock structurally bounded (\( \exists z\in \) \(\mathbb {N}^{3}\) with \(z>0\) and \(z.W<0\)) as the first column of W is non-negative. Remember that these definitions concern the marking and not the sequences. Now if the first element of the first column becomes \( -1,\) the unobservable induced subnet remains RSB but the Petri net becomes structurally bounded and deadlock structurally bounded (for \(z=\left( \begin{array}{rrr} 2&1&1 \end{array} \right) ,\) \(z.W=\left( \begin{array}{rrr} -2&-1&-1 \end{array} \right) <0).\) Also note that, if we assume \(W_{obs}=0\) in a general Petri net and add the conditions z over \(\mathbb {N}\) and \(z>0\) to a RSB unobservable induced subnet, the Petri net becomes structurally bounded and deadlock structurally bounded. \(\blacksquare \)

The following theorem makes the connection between the primal problem and the RSB property.

Theorem 4

Declerck (2021) The maximization \( max(c.\overline{x}^{<0>\rightarrow <k>})\) with \(c>0\) unitary over \(\mathbb {R}\) for system (3) is upper-bounded if the unobservable induced subnet is RSB.

Example 1 continued.

a) Let us analyze the upper-boundedness of the count number \(\overline{x} _{1} \) of the unobservable transition \(x_{1}.\) Note that the analysis of the Petri net in Fig. 1 shows that, for \(p_{1},\) if \( \overline{y}_{1}\) is upper-bounded, then \(\overline{x}_{1}\) is also upper-bounded. The resolution of (1) for the observation sequence \(y_{1}y_{2}\) gives \(\overline{x}_{1}^{<1>}=0\) for step \( <k=1>\) (no firing of \(x_{1}\) before observation \(y_{1}\)) and the resolution of (2) yields \(\overline{x}_{1}^{<1>}+ \overline{x}_{1}^{<2>}=1\) for step \(<k=2>\) (firing of \(x_{1}\) at step \(<1>\) or \(<2>\) before observation \(y_{2}\)).

b) Let us consider the dual problem (4). Theorem 4 implies that \(\overline{x}_{1}\) is upper-bounded as the unobservable induced subnet is RSB. These results which are consistent are obtained despite that the self-loop is not represented in the incidence matrices \(W_{obs}\) and \(W_{un}.\) In fact, the same computation can be made without place \(p_{2}\) and its initial token. \(\blacksquare \)

4 Composition of substructures and propagation of the RSB property

4.1 Introduction

Considering the primal problem (3) relaxed over \(\mathbb {R}\) where the objective is \(max(c.\overline{x}^{<k>})\) with c unitary, a necessary condition to define a guaranteed horizon is that each component \(\overline{x}_{i}\) must be upper-bounded. Indeed, if \( max(c_{i}.\overline{x}_{i})=+\infty \) with \(c_{i}\ne 0,\) the variable is not upper-bounded and a relevant guaranteed horizon cannot be computed. However, a possibility is to consider not the complete set of unobservable transitions but a subset. Therefore, the objective of Section 4 is to determine the set of variables which are upper-bounded. An interesting phenomenon which may be studied is a possible propagation of the upper-boundedness: if the event numbers of all input transitions of a place are upper-bounded, the event numbers of all the output transitions of this place are also upper-bounded.

In the rest of the paper, the objective is not to analyze the primal problem (3) relaxed over \(\mathbb {R}\) but the dual problem (4) and particularly the propagation of the RSB property through the composition of substructures in \(W_{un}.\)

4.2 Preliminary results

Let us analyze the columns of \(W_{un}.\)

Theorem 5

A sufficient condition such that \(W_{un}\) is non-RSB is the existence of a non-RSB column at least.

Proof. Consider a matrix \(A=W_{un}\ \)containing a non-RSB column denoted \(A_{2}.\) If \(A_{2}\) is non-RSB then \(\not \exists z_{2}\ge 0\) such that \(z_{2}.A_{2}<0.\) The consideration of the other columns of A which increases the constraints cannot improve this impossibility. \(\blacksquare \)

This result does not depend on the number of the non-RSB columns and their positions which can be separate. A practical way to reduce this difficulty is to add sensors such that the relevant unobservable transitions become observable.

Theorem 6

Let \(A=W_{un}=\left( \begin{array}{c} A_{1} \\ B \end{array} \right) .\) If the substructure \(A_{1}\) is RSB, then A is also RSB.

Proof If the substructure \(A_{1}\) is RSB, there is \(z_{1}\ge 0\) such that \(z_{1}.A_{1}<0.\) Let us build a new vector \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) \ge 0\) such that \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) .\left( \begin{array}{c} A_{1} \\ B \end{array} \right) \) is negative. If \(z_{2}=0,\) the product is equal to \(z_{1}.A_{1}\) and the vector \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) \ge 0\) is obtained. \(\blacksquare \)

In other words, a structure remains RSB if some rows are added to it.

4.3 Triangular structure

Let us consider more complex structures. We now assume that a reorganization of the rows and the columns of \(W_{un}\) has established a specific triangular structure where the blocks can be non-square and the top right corner contains null elements only. For each block, a rank can be numbered from the upper left corner to the lower right corner of the table.

We below consider the RSB characteristic of each block of the structure, that is, the blocks of the main diagonal and also the substructures making the connection between the blocks.

Before considering the general case, the block-triangular form is composed of two blocks below. Let

$$\begin{aligned} A=W_{un}=\left( \begin{array}{cc} A_{1} &{} 0 \\ B &{} A_{2} \end{array} \right) . \end{aligned}$$
(5)

where the substructures \(A_{1},\) B and \(A_{2}\) are rectangular in general. Matrix B makes the connection with the two substructures \(A_{1}\) and \( A_{2} \).

Considering the primal problem, the resolution in the following example illustrates the propagation of the upper-boundedness of \(\overline{x} ^{<0>\rightarrow <k>}\) inside the structure. The propagation of the upper-boundedness on the count variables is represented by the propagation of the RSB characteristic in the blocks. This propagation follows the increasing order of the blocks.

Example 2

Let \(A_{1}=\left( \begin{array}{cc} -1 &{} 1 \\ -1 &{} 3 \\ 1 &{} -3 \end{array} \right) ,\) \(A_{2}=\left( \begin{array}{cc} -1 &{} 1 \\ 1 &{} -4 \end{array} \right) \) and \(B=\left( \begin{array}{cc} 1 &{} 2 \\ -1 &{} 2 \end{array} \right) \) in (5). Let us consider \( -W_{un}.\overline{x}^{<0>\rightarrow<k>}\le b^{<k>}\) in the primal problem (3). To facilitate the writing of the expressions, we take \(\overline{x}=\overline{x}^{<0>\rightarrow <k>}\) and \( b=b^{<k>}=\left( \begin{array}{ccccc} b_{1}&b_{2}&b_{3}&b_{4}&b_{5} \end{array} \right) ^{T}.\) For \(-A_{1}.\left( \begin{array}{rr} \overline{x}_{1}&\overline{x}_{2} \end{array} \right) ^{T}\le \left( \begin{array}{ccc} b_{1}&b_{2}&b_{3} \end{array} \right) ^{T},\) \(\overline{x}_{1}\) and \(\overline{x}_{2}\) are upper-bounded as some obtained relations are \(\overline{x}_{2}\le (b_{1}+b_{3})/2\) and \( \overline{x}_{1}\le b_{2}+3.\overline{x}_{2}\) (also, \(\overline{x}_{1}\le b_{1}+\overline{x}_{2}).\)

For \(-A_{2}.\left( \begin{array}{rr} \overline{x}_{3}&\overline{x}_{4} \end{array} \right) ^{T}\le \left( \begin{array}{cc} b_{4}&b_{5} \end{array} \right) ^{T},\) \(\overline{x}_{3}\) and \(\overline{x}_{4}\) are upper-bounded as a resolution yields \(\overline{x}_{4}\le (b_{4}+b_{5})/3\) and \(\overline{ x}_{3}\le b_{4}+\overline{x}_{4}.\)

Now, for the complete rows \(-\left( \begin{array}{rr} B&A_{2} \end{array} \right) .\overline{x}\le b,\) a similar resolution gives \(\overline{x} _{4}\le (b_{4}+b_{5}+4.\overline{x}_{2})/3\) and \(\overline{x}_{3}\le b_{4}+ \overline{x}_{1}+2.\overline{x}_{2}+\overline{x}_{4}.\) So, \(\overline{x}_{3}\) and \(\overline{x}_{4}\) are upper-bounded as \(\overline{x}_{1}\) and \( \overline{x}_{2}\) are upper-bounded. \(\blacksquare \)

Considering the structure of A,  a first analysis shows that \(A_{1}\) can be treated without considering the rows of B and \(A_{2}.\) The dependence between \(A_{1}\) and \(A_{2}\) in (5) is expressed by each non-null component of B. When \(B=0\), there is no dependence with \(A_{1}\) and a resolution can treat the subsystem \(A_{2}\) independently of the treatment of \(A_{1}\) which can be RSB or not. So, \( A_{2} \) is independent when \(B=0.\) Note that a transition depending of an unbounded part can be upper-bounded as a dependence with another part which is upper-bounded is possible.

Theorem 7

If the substructures \(A_{1}\) and \(A_{2}\) are RSB in (5), then A is RSB.

Proof Let us consider the left columns of A. If the substructure \( A_{1}\) is RSB, there is \(z_{1}\ge 0\) such that \(z_{1}.A_{1}<0.\) Let us build a new vector \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) \ge 0\) such that \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) .\left( \begin{array}{c} A_{1} \\ B \end{array} \right) =z_{1}.A_{1}+z_{2}.B\) is negative.

We now show that for a given \(z_{1}\) such that \(z_{1}.A_{1}<0\), we can make a product by \(\lambda >0\) such that \(\lambda .z_{1}.A_{1}\le -u\) where u is a unitary row-vector with the adapted dimension. Indeed, if \( -1<z_{1}.(A_{1})_{.,i}<0\) for the column \((A_{1})_{.,i},\) then we can always multiply \(z_{1}\) by \(\lambda _{i}=1/|z_{1}.(A_{1})_{.,i}|\) with \( z_{1}.(A_{1})_{.,i}\ne 0\) which implies \((\lambda _{i}.z_{1}).(A_{1})_{.,i}=-1.\) The result is kept if we take a greater \( \lambda _{i}:\) \((\lambda _{i}^{\prime }.z_{1}).(A_{1})_{.,i}<-1\) for \( \lambda _{i}^{\prime }>\lambda _{i}.\) The reasoning can be repeated for all columns satisfying \(-1<z_{1}.(A_{1})_{.,i}<0.\) So, we can make a multiplication of \(z_{1}\) by taking the maximum \(\lambda _{\max }\) on all the obtained values \(\lambda _{i}\) and the desired result is obtained. With \( z_{1}^{\prime }=\lambda _{\max }.z_{1}\), we have \(z_{1}^{\prime }.A_{1}+z_{2}.B\le -u+z_{2}.B\) and we can take the new objective \( -u+z_{2}.B<0\) which implies \(z_{1}^{\prime }.A_{1}+z_{2}.B<0.\) So, for the left columns of A, we must have \(z_{2}.B<u\) with \(z_{2}\ge 0\ \)and also, \( z_{2}.A_{2}<0\) for the right columns of A. So, to prove that A is RSB, the problem is equivalent to search \(z_{2}\ge 0\) such that

$$\begin{aligned} \left\{ \begin{array}{c} z_{2}.A_{2}<0 \\ z_{2}.B<u \end{array} \right. . \end{aligned}$$

As \(A_{2}\) is RSB, there is an arbitrary vector \(z_{2}\ge 0\) such that \( z_{2}.A_{2}<0\) which is modified below. In general, the product \( z_{2}.B_{.,i}\) for each column \(B_{.,i}\) can be in the intervals \([1,+\infty [\) (case 1), [0, 1[ (case 2) or \(]-\infty ,[0\) (case 3).

Case 1. If the product satisfies \(z_{2}.B_{.,i}\ge 1,\) then the scalar \(z_{2}.B_{.,i}\) is positive,  and we can always divide \( z_{2}\) by a positive scalar \(\mu _{i}>z_{2}.B_{.,i}\) such that \((z_{2}/\mu _{i}).B_{.,i}<1.\) Note that \(\mu _{i}>1.\) The same reasoning holds for all the columns \(B_{.,i}\) satisfying the case 1 and we can make a division of \( z_{2}\) by taking the maximum \(\mu _{\max }\) on all the obtained values \(\mu _{i}>1\). This implies that the relations \((z_{2}/\mu _{\max }).B_{.,i}<1\) are kept for the columns \(B_{.,i}\) in the case 1. Let us show that the columns presenting the cases 2 and 3 satisfy \((z_{2}/\mu _{\max }).B_{.,i}<1\) with this new vector \(z_{2}/\mu _{\max }\).

Case 2. If a column \(B_{.,i}\) satisfies \(0\le z_{2}.B_{.,i}<1,\) then \(0\le (z_{2}/\mu _{\max }).B_{.,i}<z_{2}.B_{.,i}<1\) as \(\mu _{\max }\ge 1.\)

Case 3. If a column \(B_{.,i}\) satisfies \(z_{2}.B_{.,i}<0,\) then the sign of \((z_{2}/\mu _{\max }).B_{.,i}\) remains negative as \(\mu _{\max }>0.\)

Finally, we have build a desired vector \(z_{2}^{\prime }=z_{2}/\mu _{\max }\ge 0\) such \(z_{2}^{\prime }.B<u\) and consequently, \(\left( \begin{array}{cc} z_{1}^{\prime }&z_{2}^{\prime } \end{array} \right) \ge 0\) with \(z_{1}^{\prime }=\lambda _{\max }.z_{1}\) such that \( \left( \begin{array}{cc} z_{1}^{\prime }&z_{2}^{\prime } \end{array} \right) .\left( \begin{array}{c} A_{1} \\ B \end{array} \right) \) is negative and \(\left( \begin{array}{c} A_{1} \\ B \end{array} \right) \) is RSB. In addition, \(z_{2}.A_{2}<0\) implies \(z_{2}^{\prime }.A_{2}<0\) as \(\mu _{\max }>0\) and the complete structure A is RSB. \( \blacksquare \)

Example 2 continued.

The two substructures \(A_{1}\) and \(A_{2}\) are RSB: \(z_{1}.A_{1}=\left( \begin{array}{cc} -1&-1 \end{array} \right) \) with \(z_{1}=\left( \begin{array}{ccc} 2&2&3 \end{array} \right) \) and \(z_{2}.A_{2}=\left( \begin{array}{cc} -1/2&-1/4 \end{array} \right) \) with \(z_{2}=\left( \begin{array}{cc} 3/4&1/4 \end{array} \right) .\) We have \(\left( \begin{array}{cc} z_{1}&z_{2} \end{array} \right) .W_{un}=\left( \begin{array}{cccc} -1/2&+1&-1/2&-1/4 \end{array} \right) \nless 0\) but the new vector \(\left( \begin{array}{cc} z_{1}&z_{2}/\mu \end{array} \right) \) with \(\mu =4\) gives \(\left( \begin{array}{cc} z_{1}&z_{2}/\mu \end{array} \right) .W_{un}=\left( \begin{array}{cccc} -7/8&-1/2&-1/8&-1/16 \end{array} \right) <0\). So, \(W_{un}\) is RSB and Theorem 4 implies that all the components of the vector \(\overline{x}^{<0>\rightarrow <k>}\) are upper-bounded. This result is coherent with the direct analysis made above. \(\blacksquare \)

Theorem 7 is now generalized by considering a triangular form where the main diagonal contains more than two blocks.

Let RSB(A) be the matrix expressing the RSB property of each submatrix of A which is denoted by Y if RSB, N if non-RSB, − if indifferent and 0 if the submatrix is null.

Theorem 8

If the substructures of the main diagonal are RSB, then A is RSB.

Proof. Let us consider a structure A where the main diagonal contains three RSB blocks: \(RSB(A)=\left( \begin{array}{rrr} Y &{} 0 &{} 0 \\ - &{} Y &{} 0 \\ - &{} - &{} Y \end{array} \right) .\) Matrix A can be rewritten as \(A=\left( \begin{array}{rr} C_{11} &{} C_{12} \\ C_{21} &{} C_{22} \end{array} \right) \) with \(RSB(C_{11})=\left( \begin{array}{rr} Y &{} 0 \\ - &{} Y \end{array} \right) ,\) \(RSB(C_{12})=\left( \begin{array}{r} 0 \\ 0 \end{array} \right) ,\) \(RSB(C_{21})=\left( \begin{array}{rr} -&- \end{array} \right) \) and \(RSB(C_{22})=\left( Y\right) .\) The application of Theorem 7 implies that \(C_{11}\) is RSB. Moreover, the same theorem shows that A is RSB. The reiteration of this reasoning for greater dimensions gives the desired result. \(\blacksquare \)

5 Case study: a manufacturing system (example 3)

This example is a classical automated manufacturing system which has been presented in Section VII.A of Mahulea et al. (2012) and is recognized to be significant in the literature since slight variations of it have already been considered by different authors. The plant produces two different types of products from two types of raw materials. It consists of five machines (MA1 to MA5), four robots (R01 to R04), a finite capacity buffer (BU ), two inputs of raw parts and two automated guided vehicle systems ( AGV1 and AGV2). Presented in Mahulea et al. (2012), the description of the plant and its layout is out the scope of the paper.

Fig. 2
figure 2

Petri net of a manufacturing system (Example 3) Mahulea et al. (2012). The number in a place represents its initial marking

The Petri net Fig. 2 has 38 places and 26 transitions and corresponds to the Petri net Fig. 3 page 979 in Section VII.A of Mahulea et al. (2012) with the same initial marking. Remember that observable transitions are \(y_{1},\ldots ,y_{12}\) while \(x_{13},\ldots ,x_{26}\) correspond to unobservable transitions. Despite a complex aspect, the structural approach proposed in this paper shows that this Petri net can be represented under a more approachable form in a context of estimation: in fact, the incidence matrix of the unobservable induced subnet is large (38\( \times \)14) but sparse (by lake of space, this complex matrix is not given), and a judicious reorganization of the rows and columns can provide a clearer view under a block-triangular form presented below.

Corresponding to places \(p_{1},\) \(p_{14},\) \(p_{18},\) \(p_{19},\) \(p_{20},\) \( p_{21},\) \(p_{28},\) \(p_{32}\) and \(p_{37}\) the null rows of the unobservable incidence matrix cannot affect the sequence estimation and can be disregarded. As shown by Table 2, the reorganization of the rows and columns shows that the unobservable incidence matrix presents two disconnected parts denoted as \(S_{A}\) and \(S_{B}\) (named simply connected structures in graph theory, these disconnected parts can be found by a standard search of paths between the vertices). System \(S_{A}\) decribes the connections between 19 places and 11 unobservable transitions while system \(S_{B}\) in the bottom right corner is about 7 places and 3 unobservable transitions: \(S_{A}=( \{p_{33},p_{36},p_{38},p_{26},p_{11},p_{33},p_{9},p_{10},p_{5},p_{24},p_{3},p_{2},p_{7},p_{31},p_{8},p_{6},p_{27},p_{16},p_{15},p_{17},\) \(p_{4},p_{29},p_{25}\}\times \) \( \{x_{19},x_{20},x_{15},x_{17},x_{13},x_{14},x_{18},x_{16},x_{23},x_{22},x_{25}\}) \) and \(S_{B}=(\{p_{34},p_{30},\) \(p_{22},p_{12},p_{23},p_{13},p_{35}\}\times \{x_{21},x_{26},x_{24}\}).\) The unobservable induced subnet presents some circuits (\(x_{16}p_{6}x_{18}p_{25}x_{16}\), \( x_{15}p_{5}x_{17}p_{24}x_{15},x_{13}p_{3}x_{15}p_{29}x_{13})\) but system \( S_{B}\) has no circuit. This first decomposition suggests that the consideration of these systems can reduce the size of the matrices and vectors used in the estimation.

Presented in the appendix, the DM decomposition is a possible approach leading to a triangular form. Let consider an application of this technique presented in Table 2 and analyze the obtained subsystems.

Table 2 Incidence matrix of the unobservable induced Petri net in Fig. 2 (example 3) after reorganization of the rows and columns

System \(\textbf{S}_{A}\)

System \(\textbf{S}_{A}\) is composed of \(S_{1},\) \(S_{2},\) \(S_{3}\) and \(S_{4}\) defined as follows:

\(S_{1}=\{p_{33},p_{36},p_{38},p_{26},p_{11},p_{33},p_{9},p_{10}\}\times \{x_{19},x_{20}\},S_{2}=\{p_{5},p_{24},p_{3},p_{2},p_{7}\}\times \{x_{15},x_{17},x_{13},x_{14}\},\) \(S_{3}=\{p_{31},p_{8}\}\times \{x_{18}\}\) and \(S_{4}=\{p_{6},p_{27},p_{16},p_{15},p_{17},p_{4},p_{29},p_{25}\}\times \{x_{16},x_{23},x_{22},x_{25}\}\)

Highlighted in grey in the table, the four incidence matrices \(W_{un,1}\) for \(S_{1},\) \(W_{un,2}\) for \(S_{2},\) \(W_{un,3}\) for \(S_{3}\) and \(W_{un,4}\) for \( S_{4}\) can easily be expressed and are RSB. Indeed, the relevant possible row-vector \(z_{i}\) satisfying \(z_{i}.W_{un,i}<0\) are \(z_{1}=\left( \begin{array}{rrrrrrrr} 0&0&0&0&0&0&2&1 \end{array} \right) ,\) \(z_{2}=\left( \begin{array}{rrrrr} 3&1&3&4&1 \end{array} \right) ,\) \(z_{3}=\left( \begin{array}{rr} 2&1 \end{array} \right) \) and \(z_{4}=\left( \begin{array}{rrrrrrr} 1&2&2&3&1&2&0 \end{array} \right) .\) So, we can deduce that \(\textbf{S}_{A}\) is RSB (Theorem 8) and that all count variables of structure \(\textbf{S}_{A}\) are upper bounded (Theorem 4).

System \(S_{B}\)

The unique substructure of \(S_{B}\) is \(S_{5}= \{p_{34},p_{30},p_{22},p_{12},p_{23},p_{13},p_{35}\}\times \{x_{21},x_{26},x_{24}\}.\)

The incidence matrix \(W_{un,5}\) for \(S_{5}\) is RSB as a possible \(z_{5}\) satisfying \(z_{5}.W_{un}<0\) is \(z_{5}=\left( \begin{array}{rrrrrrr} 1&0&2&3&0&0&1 \end{array} \right) .\)

Finally, the analysis of \(\textbf{S}_{A}\) and \(S_{B}\) shows the that the Petri net has good properties as the unobservable induced incidence matrix \( W_{un}\) is completely RSB and all count variables of \(\textbf{S}_{A}\) and \( S_{B}\) are upper bounded.

We now show that the computation of a guaranteed horizon can be applied on a subsystem.

System \(S_{1}\)

Let us consider the subsystem \(S_{1}\) at step \(<1>\) and the relevant estimation of the sequence length for transitions \(x_{19}\) and \(x_{20}\). In inequality (1), we can consider the rows relevant to the places of the set \( \{p_{33},p_{36},p_{38},p_{26},p_{11},p_{33},p_{9},p_{10}\},\) the relevant marking \(M_{R}=\left( \begin{array}{rrrrrrrr} M_{33}&M_{36}&M_{38}&M_{26}&M_{11}&M_{33}&M_{9}&M_{10} \end{array} \right) ^{T}\) (R as Reduced, remember that \(M_{i}=M(p_{i}))\) and the relevant rows of the incidence matrices: \(W_{un,R}=\left( \begin{array}{c} \begin{array}{r} W_{un}(33,.) \\ W_{un}(36,.) \\ W_{un}(38,.) \\ W_{un}(26,.) \\ W_{un}(11,.) \\ W_{un}(33,.) \end{array} \\ W_{un}(9,.) \\ W_{un}(10,.) \end{array} \right) \) and \(W_{obs,R}^{-}=\left( \begin{array}{c} \begin{array}{r} W_{obs}^{-}(33,.) \\ W_{obs}^{-}(36,.) \\ W_{obs}^{-}(38,.) \\ W_{obs}^{-}(26,.) \\ W_{obs}^{-}(11,.) \\ W_{obs}^{-}(33,.) \end{array} \\ W_{obs}^{-}(9,.) \\ W_{obs}^{-}(10,.) \end{array} \right) .\) Moreover, we can consider the columns of \(\{x_{19},x_{20}\}\) and remove the null columns of \(W_{un,R}\) which becomes \(W_{un,1}\) already introduced. We obtain

$$\begin{aligned} -W_{un,1}\cdot \overline{x}_{R}^{<1>}\le b_{R}^{<1>} \end{aligned}$$

with \(\overline{x}_{R}=\left( \begin{array}{rr} \overline{x}_{19}&\overline{x}_{20} \end{array} \right) ^{T}\) and \(b_{R}^{<1>}=M_{R}^{<1>}-W_{obs,R}^{-}.\overline{y}^{<1>}.\) For the following steps \(<k>,\) the same reasoning implies that relation (3) becomes

$$\begin{aligned} -W_{un,1}.\overline{x}_{R}^{<0>\rightarrow<k>}\le b_{R}^{<k>} \end{aligned}$$
(6)

with \(b_{R}^{<k>}=M_{R}^{<1>}+W_{obs,R}\cdot \overline{y}^{<0>\rightarrow<k-1>}-W_{obs,R}^{-}.\overline{y}^{<k>}\)

The size of vectors \(\overline{x}_{R}^{<0>\rightarrow <k>},\) \(M_{R}^{<1>},\) \( \overline{y}^{<0>\rightarrow <k-1>}\)and \(\overline{y}^{<k>}\) are \((2\times 1),\) \((8\times 1),\) \((12\times 1)\) and \((12\times 1)\) respectively. The size of matrices \(W_{un,1},W_{obs,R}\) and \(W_{obs,R}^{-}\) are \((8\times 2),\) \((8\times 12),\) and \((8\times 12)\) respectively. Except \(W_{obs,R}\) which is given below, all the matrices can easily be deduced.

$$\begin{aligned} W_{obs,R}=\left( \begin{array}{rrrrrrrrrrrr} 0 &{} -1 &{} -1 &{} 0 &{} 0 &{} 0 &{} -1 &{} 1 &{} 0 &{} 0 &{} 0 &{} 0 \\ -1 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 \\ -1 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 1 &{} 0 &{} 0 &{} 0 &{} -1 \\ 0 &{} 0 &{} 0 &{} 1 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 \\ 0 &{} 0 &{} 0 &{} -1 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 \\ 0 &{} 0 &{} -1 &{} -1 &{} 0 &{} 0 &{} -1 &{} 1 &{} 0 &{} 0 &{} 0 &{} 0 \\ 0 &{} 0 &{} 1 &{} 1 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 \\ 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 &{} 0 \end{array} \right) \end{aligned}$$

In the case of a receding horizon going from \(<0>\) to \(<k>\), the application of Theorem 2 allows to compute the guaranteed horizon with the following linear programming problem: \(h_{g}^{<0> \rightarrow<k>}=\lfloor \max (c.\overline{x}_{R}^{<0>\rightarrow <k>})\rfloor +k\) for (6) relaxed over \(\mathbb {R }\) where \(\overline{x}_{R}^{<0>\rightarrow <k>}\) and (6) replace \(\overline{x}^{<0>\rightarrow <k>}\) and \( -W_{un}.\overline{x}^{<0>\rightarrow<k>}\le b^{<k>}\) in (3), respectively. Taking the initial marking and the first observed words of Table I in Mahulea et al. (2012), the results are presented in Table 3. The execution time of each computation based on function glpk() of GNU Octave is negligible.

Table 3 Guaranteed horizon for subsystem \(S_{1}\) (example 3)

Table 3 already gives a rough estimate of the sequences as \(\overline{x}_{19}+\overline{x}_{20}\le \lfloor \max (c.\overline{x}_{R}^{<0>\rightarrow <k>})\rfloor \). When \( \lfloor \max (c.\overline{x}_{R}^{<0>\rightarrow <k>})\rfloor =0,\) the firings of \(x_{19}\) and \(x_{20}\) are not possible. Knowing the guaranteed horizon for the subsystem \(S_{1}\) and each step, the sequence estimation presented in Section 3.2 Declerck (2021) can be applied: using the counter form, the finite set of relations can be established and the timed sequences relevant to transitions \(x_{19}\) and \(x_{20}\) can be estimated with any criterion.

The same remarks hold for subsystems \(S_{2}\) and \(S_{5}\) which are independent. When the majorants of the count variables relevant to \(S_{1}\) and \(S_{2}\) are obtained, the structure suggests that the majorant of the count variable \( \overline{x}_{18}\) of \(S_{3}\) which depends on \(S_{1}\) and \(S_{2}\) can be estimated in a sequential way. The same remark holds for the count variable\(\{ \overline{x}_{16},\overline{x}_{23},\overline{x}_{22},\overline{x}_{25}\}\) of \(S_{4}.\) Finally, the sequential treatment of each reduced system with smaller dimensions facilitates the computation and brings local results independently of the resolution of the complete system.

6 Conclusion

In Section 3, we have introduced new results as Theorems 2 and 3 which focus on the computation of a guaranteed horizon \(h_{g}^{<0>\rightarrow <k>}\) that is, a guaranteed receding horizon relevant to the succession of steps going from \(<0>\) to \(<k>.\) We have analyzed the structure of the incidence matrix of the unobservable induced subnet in Section 4 and have shown that a structure presenting at least a non-RSB column is non-RSB. The composition of the RSB blocks shows a propagation of the RSB property through the structure proving that a part of the transitions relevant to the RSB blocks is upper bounded even if the complete system is non-RSB. The algebraic analysis can be applied to any triangular form which can be obtained with any technique as the Dulmage-Mendelsohn decomposition or a variant: a double application of this decomposition has been made in Section 5. The case study shows that the direct treatment of the large system system can be avoided and that the triangular form brings a sequential treatment allowing a computation based on smaller systems independently of the resolution of the complete system. The adaptation to any known estimation approach is clearly a perspective. As this proposed paper shows that the consideration of the triangular form of the incidence matrix is an useful way to analyze a Petri net, another perspective is the adaptation of this approach to analyze the properties of structurally boundedness, and deadlock structurally boundedness.