1 Introduction

In recent years, with the acceleration of urbanization in China, problems such as the rapid growth of urban populations, the growing demand for resources and environment, the urgent need to improve the unified management of cities and the unbalanced development of urban areas have become increasingly serious. As a punctual, fast, environmentally friendly, large-capacity and land-saving transportation mode, rail transit has gradually become a focus to achieve urban and regional development goals and spatial development strategies.

By the end of 2019, 208 urban rail transit lines had been opened in 40 cities across China, with a total length of 6736.2 km, including 5180.6 km of metros, accounting for 76.9%. A total of 3982 metro stations have been put into operation (transfer stations are not counted repeatedly), and a total of 317 car depots and parking lots have been put into operation [1].

According to incomplete statistics, Beijing handled 3.96 billion passenger trips in 2019, and Shanghai handled 3.88 billion passenger trips. The average daily passenger traffic in the two cities exceeded 10 million passengers. The four largest megacities, Beijing, Shanghai, Guangzhou and Shenzhen, contribute more than half of the total passenger traffic in China.

At present, electric traction is used in metros all over the world, and the supply and transmission of electrical energy is important to guarantee the safe and reliable operation of metros. In addition to supplying power to metro vehicles, electrical energy also supplies power to metro electromechanical equipment, including tunnel fans, station air conditioners, station escalators, automatic fare collection equipment, screen doors, drainage pumps, sewage pumps, platform doors, communication systems, signals, integrated monitoring systems, automatic fire alarm equipment, environmental and equipment monitoring equipment and various lighting facilities.

As the main energy source for metro systems, electrical energy consumption has continued to increase with the developing transportation network, and its magnitude cannot be ignored. In 2019, according to the statistical calculation of data reported by the operating units, the total power consumption of urban rail transit was 15.26 billion kWh, and year-on-year growth was 15.5%. Electricity charges account for 10.2% of operating costs [1]. With the continuous expansion of the urban rail transit network, its energy consumption will further increase, which will impose a heavy burden on urban energy supply, and at the same time, metro operators will face rising cost pressure.

In 2017, China issued the Comprehensive Work Plan for Energy Conservation and Emission Reduction in the Thirteenth Five-Year Plan, which requires vigorous development of public transportation, and clearly requires the share of public transportation in large cities to reach 30% by 2020.

On September 22, 2020, at the general debate of the 75th UN General Assembly, President Xi Jinping announced that China will increase its national independent contribution, adopt more powerful policies and measures, and strive to achieve the goal of “Peak carbon emissions by 2030 and achieving carbon neutrality by 2060” [2].

Against the background of energy saving and emission reduction, energy-saving technologies of metro systems have been applied to metro design, and energy-saving awareness has been implemented in actual metro operation. Some metro companies have set up an energy management platform to monitor and record metro energy consumption, analyze the collected basic data, manage energy quotas and implement cost accounting, which requires correcting waste behavior to improve the economic benefits of enterprises. At present, the configuration of meters in a metro energy management system can be basically divided into four levels [3]; the specific classification is shown in Table 1.

Table 1 Meter configuration of a metro energy management system

In the existing metro companies, in terms of energy meter allocation, typically only secondary meters are installed on the operation lines to measure the power consumption of vehicle traction and power lighting, to monitor the overall energy consumption of station and vehicle traction. The main reasons that only secondary meters are installed are as follows.

  1. (1)

    Considering the high management cost, for example, the traditional metro electromechanical and weak current systems are designed and constructed separately due to various professional systems. For example, the weak current system is divided into five systems: signal, communication, integrated monitoring, automatic fare collection and platform door systems. There are molecular systems in each system, and as a result, the design and construction requirements of various disciplines and subsystems are not unified, and the later operation is carried out in different respective departments. Because of the lack of overall planning, it is difficult to realize the unified installation of meters, resulting in a serious lack of itemized measurement data for the power consumption of the main electrical equipment (such as fans, water pumps, escalators, platform doors, signals, communication systems).

  2. (2)

    Since the existing metro lines in some cities were built some time ago, the original system needs to be transformed to build an energy management system and add meters. For example, each system of the weak current system has an independent backup power supply and distribution system, the transformation cost is high, and the utilization value of the collected energy measurement data is uncertain, which makes it more difficult for the metro company to build and utilize the energy metering network.

For these reasons, even if the metro company has built an energy consumption statistics and monitoring system, most of them only achieve a measurement range of level II, and only measure the total amount of the system or the total amount of the local range of energy consumption. The energy consumption data provided are only the common energy consumption data for local areas and multiple equipment, but do not measure and collect the energy consumption of the main energy-consuming equipment. Moreover, real-time synchronous statistical measurement data are not achieved, and the energy consumption process cannot be monitored and managed in real time.

At the same time, for energy management, the energy consumption quota includes an energy consumption quota per equipment and per process, which are important indices to evaluate the energy efficiency level and energy utilization rate of the project. At present, there is no energy consumption quota standard for the transportation field in China, but for the energy consumption and energy saving goal, the evaluation index of station energy consumption is given in the Beijing local standard evaluation method of energy consumption of urban rail transit (DB11/T-1035.2013). The local standard of Jiangsu Province, code for operation energy consumption calculation and energy conservation management of urban rail transit (metro), provides the recommended value for metro operation energy consumption in the statistical reporting period, such as 100 kWh/m2 for underground stations and 90 kWh/m2 for aboveground stations, and encourages its metro operation companies to achieve its recommended value. Among them, energy consumption data are mainly the data from energy meters collected in the process of energy use. Therefore, for an energy consumption quota, energy measurement data are the basis for energy users to formulate energy consumption quota standards.

The metro company lacks a complete energy metering network and cannot realize the energy metering of the main energy-consuming equipment. For energy management, the main defects are as follows.

  1. (1)

    Unable to measure and evaluate the details of the energy consumption process of end load equipment in a timely manner.

  2. (2)

    Unable to reasonably predict the energy consumption of equipment, which is convenient for scientific management.

  3. (3)

    The energy consumption data cannot be analyzed and compared with the energy-saving effect, and the abnormal energy consumption cannot be found in time, the energy consumption loopholes can be eliminated, the loss of energy consumption can be reduced, and the energy consumption process cannot be finely managed.

To sum up, the energy use of the metro is not only related to energy conservation and building an energy-saving city, but also directly related to the enterprise operation benefits of the metro company and the government financial subsidies to the metro company. Although most metro companies carry out energy-saving upgrading and transformation, there is also a lack of comparison and management of corresponding energy consumption quota data after energy-saving measures are taken, so it is impossible to assess whether the energy-saving technology and equipment utilized achieve the expected effect. Therefore, we should realize the full load measurement of endpoint equipment, build an energy measurement network, strengthen the value mining of energy measurement data, and realize the energy efficiency monitoring and management of the overall energy consumption facilities and professional systems of the ground.

Based on the Jinchuan Road station of Kunming Metro Line 4 in Kunming, Yunnan Province of China, we design the system and meter energy data of different levels, ranging from the power supply system of the whole station to the endpoint weak current electric equipment. With the 5-month measurement data, we predict the energy consumption of the equipment of the main weak current system of the station through the machine learning method to provide its energy consumption quota and optimize the energy consumption management of weak current system. Finally, it is extended to the energy consumption quota management of the whole line and the whole system to realize energy conservation and emission reduction. The specific steps are as follows.

  1. (1)

    Collecting relevant data required for this subject, such as basic data of Jinchuan Road Station of Kunming Metro Line 4 and load statistics of energy-consuming equipment.

  2. (2)

    Optimizing the distribution cabinet of the UPS system, and planning the range of the electrical energy meter according to the region.

  3. (3)

    Through machine learning on the collected data, the energy consumption quota of the weak current system of the station of Kunming Metro Line 4 is established, which further lays the foundation for the energy consumption quota of the station.

2 Power supply requirements and present situation of the metro weak current system

The main energy used and consumed by the metro includes electricity, water, fuel oil and gas. Most of the energy consumption in vehicle operation and electromechanical equipment is electrical energy. Water is also consumed in air conditioning equipment, and fuel oil and gas are consumed in the maintenance of equipment. However, the main energy consumption is still electrical energy. Therefore, to understand the current situation of metro energy consumption, we must understand the metro power supply system. The metro power supply system mainly includes the external power supply, main substation, step-down substation, traction substation, train traction power supply system, station power lighting power supply system and power monitoring system. The main power systems include the traction power supply and lighting power supply systems. The traction power supply system includes the traction substation and traction network, and the lighting power supply system includes the step-down substation and power lighting distribution system [4].

2.1 Metro power supply requirements and load classification

The overall power system of the metro industry is generally divided into the following parts: external power supply, main substation, traction step-down substation, stray current corrosion protection system and power monitoring system. The specific structure is shown in Fig. 1.

  1. (1)

    The external power supply of the metro power supply system is the external urban power grid supplied by the main substation of the metro power supply system. There are three power supply schemes of the external power supply: centralized power supply, decentralized power supply and hybrid power supply. The centralized power supply usually introduces two power sources from the 110 kV side of the urban power grid. According to the requirements of the metro design code, at least one power source is a dedicated line.

  2. (2)

    The main function of the main substation is to receive external high-voltage power (usually 110 kV) and provide medium-voltage power (usually 35 kV or 10 kV) for the traction substation and step-down substation after step-down. The main substation is suitable for the centralized power supply. The connection mode of the main substation is line-type or bridge-type.

  3. (3)

    The function of the traction power supply system is to transform AC medium voltage into DC 1500 V or DC 750 V after step-down rectification, and to provide the traction power supply for metro trains. The system includes a traction substation and traction network, which includes the catenary and return network. Overhead catenary (DC 1500 V) and contact rail (DC 1500 V or 750 V) are two suspension modes. At present, most metros use the running rail as the return network, while a few metros use a separate return rail.

  4. (4)

    The main function of the lighting power supply system is to step down AC medium voltage (35 kV or 10 kV) to AC 220 V and AC 380 V which can be used by the electromechanical equipment, and provide power for various electromechanical and weak current equipment required for operation.

  5. (5)

    The stray current leaks from the running rail, flows into the ground through the metro track bed, and then flows back from the ground to the rail return point. Since the rail is buried on the ground surface, which makes it easy to find and replace damaged parts, the stray current corrosion is not very harmful. However, because the steel bars of the main structure and metal pipelines of the metro are buried underground, corrosion is not easy to detect, so stray current corrosion is very harmful to these steel bars and metal pipelines. Therefore, the main function of the stray current corrosion protection system is to reduce stray current caused by the DC traction power supply and prevent it from spreading to the outside world, to avoid electrical corrosion of stray current to the main structure of the urban rail transit and its adjacent structural steel bars and metal pipelines, and to monitor stray current and corrosion protection.

  6. (6)

    The main function of the power monitoring system is to collect and monitor remote data of the metro substation, step-down substation, and catenary and other equipment in real time. In the metro control center, electric power dispatchers carry out telemetry, remote communication, remote control and remote adjustment control on the main electrical equipment through a dispatching terminal, communication channel and substation integrated automation system, to realize the operation and management of the whole metro power supply system.

Fig. 1
figure 1

Power system logic diagram of metro industry

2.2 Classification requirements and standard of metro power load

The metro power supply system is designed to satisfy different electrical energy requirements of different metro users. Therefore, it is necessary to classify the different power loads. The power load is divided into class I, class II and class III load according to the requirements for power supply reliability and the impact of power loss [4]. The most important electrical energy user is defined as the class I load user. According to the standard requirements, the class I load must be powered by a double power supply and double circuit line to ensure normal operation of the load. For particularly important loads in class I, an emergency power supply must be equipped and separated from other loads. The standard requires that the secondary load adopt a dual power input mode and single circuit special line power supply. The class III load can be powered by a single power supply and single circuit line [4]. The classification of power lighting load is shown in Table 2.

Table 2 Classification of power lighting load

In metro power supply, in addition to distinguishing three different loads, some individual class I loads are defined as particularly important loads and must be equipped with an uninterruptible power supply (UPS). They include "automatic fire alarm system equipment, environment and equipment monitoring system equipment, special communication system equipment, signal system equipment, substation operation power supply, underground station and section emergency lighting" [4].

The power supply quality and continuity of these loads are very important for the safe and reliable operation of the metro system. A UPS is set for especially important loads. These loads of are shown in Table 3.

Table 3 Especially important loads in metro stations

From the above standard requirements and application requirements, it can be seen that all of the especially important loads are the metro weak current system. Thus, the scientific application and coordination of the power supply of the metro station weak current system can ensure the safe and stable operation of the urban metro and rail transit, and avoid the potential safety hazards and accidents of the metro weak current system. Therefore, the weak current system needs to be equipped with a UPS to ensure the reliability and continuity of power supply to these important systems.

2.3 Power supply system of the metro weak current system

The power supply of the metro weak current system requires high reliability, so a UPS must be set to ensure quality and continuity of power equipment. The main function of a UPS is to continuously provide AC to the load. It is an independent power supply with high quality, high stability and high reliability. The UPS realizes an AC uninterrupted power supply and purifies the municipal power into stable and pure AC, and at the same time, it cooperates with the battery pack to realize continuous power supply in the case of power failure. Therefore, the UPS system has been widely used [5,6,7].

The UPS system is generally composed of power input circuits including the main circuit, bypass and battery, rectifier (REC) for AC/DC conversion, inverter (INV) for DC/AC conversion, inverter and bypass output switching circuit, energy storage battery and other main parts. The schematic diagram of UPS composition is shown in Fig. 2.

Fig. 2
figure 2

Schematic diagram of UPS composition

When the commercial power of the UPS is normal, the commercial power is first converted into DC by a rectifier, then into AC by an inverter, and then output to the load after electrical isolation, ensuring that the output electrical energy is regenerated and purified completely independently of the commercial power. When the commercial power is interrupted, the battery is converted into AC output through the inverter to ensure the uninterrupted normal power supply of the load.

UPS and independent power distribution cabinets are set for traditional weak current systems. The details are shown in Fig. 3.

Fig. 3
figure 3

UPS diagram of traditional weak current system

2.4 Present status of metro energy conservation and emission reduction

There are three main ways to achieve the goal of energy saving and emission reduction and to improve the energy use rate for metros: management energy saving, structural energy saving and technical energy saving.

Structural energy saving mainly involves adjustment of the energy structure. The metro industry has introduced solar energy, which promotes the environmental benefits of photovoltaic power consumption and reduces the economic costs of urban rail operation. It also has the social benefits of improving energy utilization forms and satisfying people's desire for green travel [8].

From the perspective of technical energy saving, many metros have now put forward a structural improvement method of energy-saving slope through the optimization design of line structure to effectively save energy combined with the step-by-step situation of metro energy consumption, for the traction power supply of trains in the main energy-consuming equipment [9]. Also, regenerative braking technology, combined with the adjustment of the operation chart of the network operation, and integrated solutions are used to save energy [10,11,12,13,14].

As for the main factors affecting the environment of the station platform and station hall, piston wind, train heat production, particulate matter and other pollution and noise mainly come from the tunnel [15], and energy saving is realized through the screen door system which separates the platform from the station tunnel [16, 17]. The design of power lighting and distribution for stations is carried out according to the principle of a graded load power supply, and various loads are divided into grades I, II or III according to their importance. The power supply system is simplified and the distribution equipment is saved. Adopting centralized reactive power automatic compensation and decentralized reactive power compensation measures improves the power factor and reduces line loss. In the station lighting system, high-efficiency and energy-saving light sources and lamps and lanterns are mainly selected. In the escalator and fan system, frequency conversion control is selected to save equipment power consumption. At the same time, the management is strengthened. After the morning and evening peaks, some public lighting equipment is turned off, and the escalator operates at low speed by frequency conversion [18,19,20,21,22,23].

From the aspect of management and energy saving, many metro companies have established their own energy management platforms, such as Beijing Metro and Nanjing Metro, which have set up their own energy consumption data models [24, 25]. Beijing-Hong Kong Metro, Guiyang Metro and Harbin Metro have also established their own energy management platforms [26,27,28,29]. The energy management platforms have realized the automatic and real-time collection of energy consumption data for each link of the power supply system in the form of classification, sub-item and household. In August 2019, the China Urban Rail Transit Association and China Building Energy Conservation Association jointly issued the group standard "Evaluation Standard for Green Urban Rail Transit Stations", which put forward a detailed measurement function of power consumption of different systems and different types of equipment and the evaluation requirements for setting up energy management systems [30].

In theory, energy consumption measurement is a precondition to realize energy saving and emission reduction in management. But in practice, it is realized by manual meter reading, copying the grid entrance settlement meter of the substation, and there is basically no meter in the subsequent links, so it is impossible to practice better quota management [20, 31].

For different buildings, the energy consumed to provide people's comfort (e.g. air conditioning, heating) can be properly considered in order to ensure the basic operation of the building, and the energy-saving goal can be achieved by restraining extravagant energy consumption, which can be achieved by formulating energy consumption quotas. The energy consumption quota should refer to the energy consumed to meet the basic operating requirements of buildings [32, 33].

Therefore, it is important to collect the current total energy consumption data for stations and energy consumption data for various equipment, combined with passenger flow data, environmental data and other data, and analysis and mining through a big data platform, in order to achieve continuous learning and improvement of the prediction model. Experts and scholars at home and abroad have put forward two machine learning methods, support vector regression (SVR) and random forest regression (RFR), to predict the energy consumption of metros [34]. A method for building a time series analysis model of metro energy consumption has been proposed using correlation analysis software and performing stationarity tests, data difference and order analysis to predict and analyze metro energy consumption in the next few years by comparing theoretical and actual energy consumption values [35]. Some experts have also put forward a building energy consumption prediction method (principal component analysis-radial basis function [PCA-RBF]) combining PCA with a neural network, which effectively improves the prediction accuracy of building energy consumption [36]. International scholars have put forward an intelligent method for predicting the energy consumption of metro stations by simulating the energy consumption system of a Hong Kong ground station based on an artificial neural network algorithm, which can be used to predict the weekly power consumption of metro stations and provide a basis for engineering applications [37].

The prediction model combined with the energy management platform optimizes the control algorithm of electrical equipment by a mathematical model, and it is compared with domestic and international industries. For example, the China Urban Rail Transit Association statistically analyzed the annual energy consumption and passenger transport data of the national metro in 2019 according to the Evaluation Method of Energy Consumption and Emission Index of Urban Rail Transit, in which the average total energy consumption per person-kilometer of urban rail transit is 0.155 kWh, and the average total energy consumption per vehicle-kilometer is 4.19 kWh [1]. Through forecasting and benchmarking, the energy consumption model is established, the energy consumption quota management system is implemented, and the energy management work is guided by the energy management platform. Finally, a better energy management and monitoring platform is provided for metro operation managers, so that many metro stations of different types can save energy and reduce emissions.

3 Energy saving design of comprehensive power supply for metro weak current system

A metro station is a type of public building, which is an important place for passengers to get on, get off, transfer and wait. In order to provide a safe, comfortable and sanitary environment for passengers, the station is generally composed of power supply, electromechanical and weak current systems, and the main energy consumption is electricity consumption. The weak current system of the station mainly ensures the safe operation of the station, and its energy consumption is affected by many factors including passenger satisfaction, station area, passenger flow and ambient temperature outside the station. Different factors affect the weak current power consumption of the station, and the analysis of their influence is helpful to clarify the energy consumption characteristics of the weak current system of the station. By collecting and learning the main influencing factors after analyzing the influencing factors, the energy management platform can estimate the future energy consumption demand of the metro, accurately predict the energy consumption needed during a certain period in the metro station, reduce the energy consumption of the metro station as much as possible while ensuring its reliable operation, and finally guide the station to manage energy and realize energy saving and emission reduction.

To predict the station energy consumption and provide the station energy consumption quota, it is necessary to adopt a prediction algorithm and increase the energy consumption meter. In this paper, firstly, an important factor analysis method is used to select the factors that have great influence on the energy consumption of the station, and based on this, a method for predicting the energy consumption quota of the weak current system equipment is put forward to achieve an energy-saving design of the weak current system equipment. This method can be extended to other energy-saving design scenarios. The specific process is shown in Fig. 4.

Fig. 4
figure 4

Flow chart of energy consumption quota prediction and analysis of weak current system

3.1 Correlation analysis of energy consumption of the station weak current system

The energy consumption data for the weak current system equipment in urban rail transit stations is characterized by different energy consumption ranges in different regions and different fluctuation of energy consumption in different weak current subsystems. The key problem is how to extract the characteristics of energy consumption data which have a great influence on the energy consumption of the weak current system equipment, and then to accurately predict the energy consumption of the weak current system equipment in the stations. There are many factors that affect the energy consumption of weak current systems in stations, but the degree of influence is different. This paper utilizes Pearson's correlation coefficient method to determine the characteristic parameters that have a significant impact on the energy consumption cost of weak current systems.

In statistics, Pearson's correlation coefficient method is used to measure the correlation degree (linear correlation) between two variables X and Y, and its value is between −1 and 1. The closer the Pearson value is to −1, the stronger the negative correlation between the two variables; the closer the Pearson value is to 1, the stronger the positive correlation between the two variables, and 0 indicates that the two variables have no correlation. The significance of Pearson's correlation coefficient method is used to measure whether the linear relationship between two variables is reliable, and its values represent different significant relationships between two variables in different ranges. When the significance value is less than or equal to 0.01, it means that there is a very significant linear relationship between samples; when the significance value is between 0.01 and 0.05, the linear relationship between samples is not significant.

Based on the existing data, through analysis and combining with the actual situation, this paper mainly analyzes from three aspects: influencing factors of station area, influencing factors of station passenger flow and influencing factors of various types of equipment quantity in the station weak current system. Pearson's correlation coefficient method is used to analyze the relationship between these different influencing factors and equipment energy consumption of the weak current system, and to determine the main influencing factors of energy consumption of the weak current system in stations. The correlation evaluation corresponding to correlation coefficient is shown in Table 4, and the specific result after analysis is shown in Table 5.

Table 4 Correlation coefficient and correlation analysis
Table 5 Correlation analysis results of different influencing factors on energy consumption of the station weak current system

As can be seen from Table 5, for different stations, the metro station floorage factor has the greatest correlation with the energy consumption of the weak current system and strong correlation with the equipment quantity, while the passenger flow factor has less influence. Therefore, when designing meters, we should consider the different station areas and the power of equipment to set up fourth-level meters in order to achieve more detailed monitoring and quota prediction of energy consumption.

3.2 Integration methods for the station weak current power supply system

Through the correlation analysis of the energy consumption of the weak current system, the floorage has the greatest impact on energy consumption. Therefore, the power supply system needs to increase the metering of the weak current system equipment accordingly. However, considering that the traditional weak current system mainly purchases UPS from each system separately and sets different distribution boxes, it is impossible to set meters according to the region. At the same time, the discrete setting of UPS also has the problem of low load rate, resulting in high UPS configuration and inability to achieve the ideal energy efficiency value.

Based on the above problems, the method of power integration is proposed in order to ensure the power supply safety and reliability of the weak current system. The main principle of power integration is that the integrated power system shall meet the technical requirements of all integrated systems for power supply and ensure the reliable and safe operation of each system. The integration of the power supply system shall be calculated uniformly according to the load nature, capacity demand and standby time of each integrated system, to select the capacity of UPS and battery. In addition, monitoring requirements, overvoltage protection and grounding measures and requirements for ambient temperature shall also be considered.

In recent years, the domestic urban rail transit system has integrated the UPS system to varying degrees, and the integration mode and scope are different in different regions and different lines. The rapid development and maturity of control technology and high-capacity power supply systems provide a guarantee for the centralized integration of UPS in urban rail transit.

Based on this, this paper proposes to integrate all weak current system power supplies into an integrated power supply, and transform the traditional single system mode into a centralized (1 + 1) parallel double-bus integrated power supply. The integrated power supply is provided by the lighting power design with two main power supplies, which are respectively supplied to two UPS hosts through two isolation transformers. The battery pack provides a backup power supply guarantee for the integrated power supply system. The UPS provides stable and reliable uninterruptible power to the main distribution cabinet and lower distribution cabinet/box. Switches and meters are set in the main distribution cabinet and lower distribution cabinet according to the area, to achieve distinctive area measurement.

According to the functional area, the station can be divided into platform, station hall, transfer channel, and entrance and exit. The station hall is also divided into equipment and management room area and public area. Public areas are divided into paid areas and unpaid areas according to whether or not passengers pay. The platform is also divided into waiting area and equipment room area. The specific power system partition is shown in Fig. 5.

Fig. 5
figure 5

Schematic diagram of integrated power supply system

3.3 Energy consumption prediction algorithm for the station weak current system

Machine learning is used to predict the energy consumption of station weak current systems. In the application of machine learning, feature selection is usually an important step. For the actual model building, there are usually too many variables in the data set, and most of the variables have nothing to do with the target problem. There are several disadvantages to dealing with an excessively large feature set. One is that purely technical processing of large feature sets slows down the algorithm, takes up too many resources, and is very inconvenient. Another, more important issue is that when the number of variables is significantly higher than the optimal one, the accuracy of many machine learning algorithms will decrease. Among the many feature selection methods, the Boruta algorithm is used to extract the feature data that have a large influence on the target value from the original feature data, and the PCA algorithm is used to fuse and reduce the dimensions of the extracted feature data that have a large influence on the target value, to obtain the feature data which can better reflect the important influence on the target value. As for the machine learning algorithm, boosting is a machine learning algorithm that can be used to reduce the deviation in supervised learning. It mainly obtains the sample subset by operating the sample set, and then trains a series of base classifiers on the sample subset using the weak classification algorithm. XGBoost is one such algorithm with a good application effect in numerical prediction, but its disadvantage is that it cannot predict high-dimensional data. According to the characteristics of the above three algorithms, this paper proposes a Boruta + PCA + XGBoost algorithm, which can accurately predict the energy consumption of equipment in the weak current system of stations. The specific algorithm flow is shown in Fig. 6. The algorithm is divided into two parts: model training and model calling. In the model training part, after Boruta feature screening and PCA feature fusion for dimension reduction, the XGBoost algorithm is used to train the model, and the model is deployed to production for interface calling. In the calling part, the data generated by production in turn go through the same process as in training, and finally the trained model is called to obtain the energy consumption prediction quota result.

Fig. 6
figure 6

Flowchart of Boruta + PCA + XGBoost algorithm

3.3.1 Boruta algorithm

The Boruta algorithm is a wrapper method based on a random forest algorithm [38]. Random forest is an integrated method which performs classification by voting on multiple unbiased weak classifiers, such as a decision tree. A characteristic of the Boruta algorithm is that it can filter out all the feature sets that are related to the dependent variables, instead of only the feature sets that minimize the loss function of the current model. This helps us to screen out all the energy consumption-related feature data, delete redundant features and determine the best feature subset.

The basic idea of the Boruta algorithm is to evaluate the importance of each characteristic variable by a cyclic method. Firstly, the input original feature matrix X is randomly shuffled to obtain shadow features. Then, the input original feature matrix X is spliced with the obtained shadow features, and a new feature matrix Y is obtained by random scrambling. The random forest classifier is run to calculate the z-score of all features on the feature matrix Y, as shown in Eq. 1. Then the maximum z-score among shadow attributes (MZSA) is compared with the z-score of each feature of the original feature matrix X. When the z-score of the feature of the original feature matrix X is far lower than MZSA, the feature is marked as unimportant. Reciprocating and cycling is performed until all features of the original feature matrix X have been assigned importance or the number of random forest iterations has been reached, and finally the best feature subset is obtained.

$$\mathrm{Z}-\mathrm{scores}=\frac{{Y}_{i}-\overline{{Y}_{i}}}{{S}_{i}}$$
(1)

Formula: \({Y}_{i}\)——The ith characteristic of the new characteristic matrix Y;

\(\overline{{Y}_{i}}\)——The ith eigenvalue average of the new eigenvalue matrix Y;

\({S}_{i}\)——The ith characteristic standard value of the new characteristic matrix Y.

The specific flow of the Boruta algorithm is shown in Fig. 7. Firstly, we input the related feature matrix X of energy consumption quota model and shuffle each feature value of X to get shadow features. Then, X and shadow features are spliced to form a new feature matrix Y, and a random forest classifier is run on Y to calculate the z-score of each feature of Y. Then we find the largest z-score in shadow features, and mark it as MZSA. We compare the z-score of each feature in X with it, and delete the features whose z-scores are significantly smaller than MZSA. We repeat the above process until all the features left by X are marked as important.

Fig. 7
figure 7

Flow chart of the Boruta algorithm

3.3.2 PCA algorithm

The PCA algorithm is a multivariate statistical method. It is one of the most commonly used dimensionality reduction methods. It transforms a set of potentially correlated variable data into a set of linearly uncorrelated variables through orthogonal transformation. The transformed variable is called the principal component [39].

The optimization goal of the PCA problem is to reduce a set of an n-dimensional vector matrix X to k-dimensional (0 < k < n), and the goal is to select k unit orthogonal bases so that after the original data are transformed to this set of bases, the covariance between each feature is 0, and the variance of the feature is as large as possible; the largest k variances are taken under the constraint of orthogonality.

The flow of the PCA algorithm is shown in Fig. 8. At first, the feature matrix X filtered by the Boruta algorithm in the previous step is input, and the covariance matrix C of the matrix X after zero averaging is calculated. Then we calculate the eigenvalues of C and the corresponding eigenvectors, arrange the eigenvectors into a matrix according to the eigenvalues from large to small, and take the first k rows of matrix P. At last, P and X after zero averaging are multiplied by matrix, and the final resulting matrix Y reduced to k dimensions is obtained.

Fig. 8
figure 8

PCA algorithm flow chart

3.3.3 XGBoost algorithm

The XGBoost algorithm is an integrated learning method based on a classification and regression tree (CART) [40]. Unlike the random forest that gives each decision tree the same voting weight, the generation of the next decision tree in the XGBoost algorithm is related to the training and prediction of the previous decision tree (by training the previous decision tree with a lower accuracy and giving the samples higher learning weights to improve the accuracy of the model). Compared with other ensemble learning algorithms, XGBoost on the one hand improves the robustness of the model by introducing regular terms and column sampling, and on the other hand, it adopts a parallelization strategy when each tree selects the split point, thereby greatly improving the operation speed of this model.

XGBoost is an additive model composed of multiple base models. If the tree model to be trained in the tth iteration is \(f_{t} \left( x \right)\), then:

$$\hat{y}_{i}^{\left( t \right)} = \sum\limits_{k = 1}^{t} {f_{x} \left( {x_{i} } \right)} = \hat{y}_{i}^{{\left( {t -1} \right)}} + f_{t} \left( {x_{i} } \right),$$
(2)

where \(\hat{y}_{i}^{\left( t \right)}\) represents the prediction result of energy consumption sample i after the tth iteration, and \(f_{t} \left( {x_{i} } \right)\) is the model of the tth tree. The objective function of XGBoost is defined as follows:

$${\text{Obj}} = \sum\limits_{i = 1}^{n} {l\left( {y_{i} ,\hat{y}_{i} } \right)} + \sum\limits_{i = 1}^{t} {\Omega \left( {f_{i} } \right)},$$
(3)

where \(\sum\limits_{i = 1}^{t} {\Omega \left( {f_{i} } \right)}\) is to sum the complexity of all t trees and add them to the objective function as a regularization term to prevent over fitting of the model. \(\sum\limits_{i = 1}^{n} {l\left( {y_{i} ,\hat{y}_{i} } \right)}\) is the loss function.

Then, according to the forward step-by-step algorithm, in step t, the predicted value of the model for the ith sample is:

$$\hat{y}_{i}^{\left( t \right)} = \hat{y}_{i}^{{\left( {t -1} \right)}} + f_{t} \left( {x_{i} } \right)$$
(4)

Among them, \(f_{t} \left( {x_{i} } \right)\) is the predicted value of the residual between the predicted value \(\hat{y}_{i}^{{\left( {t -1} \right)}}\) and the real value in fitting step t − 1. XGBoost fits the residual through continuous iteration. When the set maximum tree model depth is reached, the iteration is stopped.

4 Implementation of an integrated power supply system for the weak current system of Kunming Rail Transit Line 4

Kunming Rail Transit Line 4 is a rapid rail transit line connecting Kunming's main urban area and Chenggong New District. Kunming Metro Line 4 passes through Wuhua District, High-tech Zone, Panlong District, Guandu District, Economic Development Zone and Chenggong New District, with high-density passenger flow along the line, which is of great significance and has a role in solving the urban traffic problems in Kunming and driving the development along Line 4.

Kunming Metro Line 4 starts from Chenjiaying Station in the northwest of Kunming's main city and ends at Kunming South Railway Station in Chenggong New Town. The route diagram of Kunming Rail Transit Line 4 is shown in Fig. 9. The total length of Line 4 is 43.396 km, the elevated line is 3.491 km, the ground line and U-shaped trough-section are 0.781 km, and the underground line is 39.124 km. There are 29 stations in total along Line 4, including three elevated stations, 26 underground stations and 14 transfer stations. Line 4 has a depot at Dayantian and two parking lots, Guangwei parking lot and Bailongtan parking lot. The three main substations are the North Railway Station, Maju and Dounan main substations, among which a new Maju main substation is built.

Fig. 9
figure 9

Route diagram of Kunming Rail Transit Line 4

The weak current system design of Kunming Rail Transit Line 4 breaks the traditional framework of the weak current system and adopts a new system reconstructed by deeply integrating 21 weak current subsystems. The cloud-based rail intelligent automation system (CRIAS for short) adopts innovative fusion architecture, which completely replaces the chimney architecture of the traditional weak current system. Traditional weak current systems such as integrated monitoring systems, automatic train monitoring systems, broadcasting systems, passenger information systems, centralized alarm systems, automatic fare collection systems, platform doors, and communication systems will only exist in name, but in fact they are no longer independent. Its infrastructure layer, data acquisition layer, data processing layer and application layer are all integrated into the new overall architecture. The system weakens the system division of labor, strengthens professional functions, and places more emphasis on serving metro operations and passengers. It provides basic support for the innovation of management mode and the reform of maintenance management mechanisms of operating enterprises, realizes green energy saving, reduces staff and increases efficiency, and finally realizes smart metro.

4.1 The integrated power supply system of Rail Transit Line 4 of Kunming

The integrated power supply equipment in the CRIAS is responsible for providing uninterrupted power supply to the weak current systems in the station and depot, including the automatic monitoring of communication and signal systems, automatic fire alarm systems, environment and equipment monitoring systems, power monitoring systems, access control, automatic fare collection systems and platform doors.

The integrated power supply system adopts the centralized (1 + 1) parallel double-bus mode, so it is equipped with two sets of independent UPS systems, which are completely independent from the UPS host to the general distribution cabinet, then to the distribution cabinet, and finally to the load end. Two sets of independent power supply are provided to weak current equipment with a dual power supply. For equipment with a single power supply, an STS static switch is used, which can also realize the function of a dual independent power supply. One three-phase total power meter is installed in the general distribution cabinet as a third-level meter for station energy measurement to realize the energy consumption measurement of the whole weak current system; branch power supply circuits are separated according to the equipment installation area, and a single electrical energy meter is installed as the fourth-level meter of station energy measurement in order to obtain itemized measurement according to the section and equipment.

The power network management terminal of the control center realizes the centralized monitoring and management of the operation information, status and energy consumption of the power system equipment of the whole line, including UPS, battery pack and general distribution cabinet. The operation status information, fault information and alarm information of the power supply equipment of each station are sent to the control center for unified processing through the transmission system. The network communication between the power supply system and the transmission system adopts the Ethernet ring network mode to ensure that the whole network will not be affected by the fault of single-point network equipment. After the UPS, battery pack and general distribution cabinet at each station are networked, they are uniformly uploaded to the platform transmission system through the Ethernet port of the general distribution cabinet.

4.2 Design of backup power supply for the stations' weak current system

The integrated power supply system of the station is integrated according to the integrated monitoring system (including environment and equipment monitoring, automatic fire alarm system and access control system), special communication (including telephones, clocks, CCTV, PIS, PA, transmission and other systems), automatic fare collection (including gates, automatic ticket vending machines, ceiling guidance signs and other equipment), and office automation (including office computers and other equipment in the ticket office and station master's room), and the nature of the power consumption and power demand of depot/parking lot security is designed and calculated to select the UPS and battery capacity.

According to the integration scope, the power load of the electrical equipment for all systems in the station is calculated. The specific statistics are shown in Table 6.

Table 6 Load statistics of Jinchuan road station integration system

According to the requirements of Uninterruptible Power Systems for Communication (UPS) (YD/T 1095—2008), the formula for calculating the output active power of the UPS equipment is shown in Eq. 5.

$${\text{Output active power }} = {\text{ Rated capacity }} \times \, 0.{\text{7 kW}}/{\text{kVA}}$$
(5)

Therefore, according to the calculation of the above company, the rated capacity of the configured UPS is 54.652/0.7 kW/kVA = 78.036 kVA. Combined with the selection of product specifications and models, the rated capacity of the UPS selected for the station is 80 kVA.

4.3 Design of station integrated power meters

The UPS system of the station machine room mainly supplies power for regular loads such as intelligent control all-in-one machines, intelligent control units, switches and terminal operation display screens, and impact loads such as access control and fare collection machines. Each station will design different distribution boxes and power supply circuits according to the quantity, type and installation area of electrical equipment.

Based on the quantity and power of electric equipment in the station, each station of Kunming Line 4 is equipped with two UPS (80 kVA), two isolation transformers (80 kVA), two sets of battery packs (64 for each set, 128 in total, 77AH for each battery), two sets of battery monitoring systems, two sets of general distribution cabinets, four sets of sub-distribution cabinets (12 STS are set in each distribution cabinet) and one power network management unit.

An energy meter is set in the general distribution cabinet, and a general meter is set in each general distribution cabinet, which belong to level 3 measurement. Other meters are set to measure the power consumption of different equipment and regions, which belong to level 4 measurement. The power circuit and meter distribution are shown in Tables 7, 8, 9 and 10.

Table 7 System diagram of distribution cabinet #1
Table 8 System diagram of distribution cabinet #2
Table 9 System diagram of distribution cabinet #3
Table 10 System diagram of 4# distribution cabinet

After the power supply circuit and meter are designed in combination with the power supply equipment and power, the station UPS system is designed. The station integrated power monitoring system collects the operation and status data of the equipment to the intelligent control integrated machine through the PLC collector set in the station power distribution cabinet, and then transmits it to the central power monitoring system through the transmission network. The specific structural diagram is shown in Fig. 10.

Fig. 10
figure 10

Structure diagram of the station integrated power supply monitoring system

In the specific electrical design process, the three-phase watt-hour meter monitoring of the power parameters of the incoming circuit in the main distribution cabinet is connected to the terminals of the RS485 expansion card of PLC1 by a MODBUS protocol through its own RS485 interface. The feeder shunt watt-hour meter of the main distribution cabinet also has an RS485 interface, and 21 shunt watt-hour meters are also connected to the bus of the RS485 expansion card of PLC1. The wiring diagrams of the two main distribution cabinets are shown in Figs. 11, 12 and 13. The wiring diagrams of all electric meters are shown in Figs. 12 and 14.

Fig. 11
figure 11

Wiring diagram of general cabinet 1 of the Jinchuan Road station

Fig. 12
figure 12

Electric meter wiring diagram of main distribution cabinet 1 of the Jinchuan Road station

Fig. 13
figure 13

Wiring diagram of general cabinet 2 of Jinchuan Road station

Fig. 14
figure 14

Wiring diagram of electric meter in general distribution cabinet 2 of the Jinchuan Road station

4.4 Integrated power supply monitoring and measurement data acquisition

In the design scheme of the metro weak current power system, in order to realize remote centralized monitoring of all power equipment, all equipment has an intelligent communication interface through which the running status of power equipment, status information, fault information, alarm information and energy metering information for power equipment in each station can be collected in real time.

Specifically, PLCs are used in two independent main distribution cabinets of each station or parking lot to realize the collection and processing of various data, and the measured data and status are displayed through a 7-inch color touchscreen. The operating parameters of voltage, current, frequency, power factor, active power, reactive power and power degree of the input circuit and each output circuit can be accurately measured by configuring a high-precision watt-hour meter for each circuit, and the state data of each switch state, arrester state and time-sharing power-down state can be monitored at the same time.

In the monitoring system of the distribution cabinet, historical curves and alarm data can be recorded, and real-time alarms can be given for various abnormal phenomena. In the central power supply monitoring system, various real-time data, historical curves and alarm data of the station can be recorded, and real-time alarms can be issued for various abnormal phenomena. The specific system interface is shown in Fig. 15.

Fig. 15
figure 15

Centralized monitoring diagram of Tami station

4.5 Energy consumption data acquisition and prediction

The integrated power system platform collects various energy consumption data in real time and pushes the data to the big data platform. The big data platform analyzes and learns the energy consumption data of each station. The specific daily energy consumption of the station from October 2020 to March 2021 is shown in Figs. 16 and 17.

Fig. 16
figure 16

Measurement data change diagram of Jinchuan Road station

Fig. 17
figure 17

Diagram of power variation with time at Wujiaying station

Figures 16 and 17 show that with the change in time, the change trend of electrical energy is relatively stable, and the power consumption is approximately a fixed value with little change. In the process of energy management, energy measurement is the basis of formulating scientific and reasonable energy consumption limit standards [41]. Reasonable energy consumption quota should be based on energy measurement data. In order to ensure the accuracy and validity of the measurement data, through the accumulation and value mining of energy measurement data, the energy consumption quota of energy-consuming equipment is given by a machine learning method. The energy consumption quota method based on machine learning involves taking a large number of statistical samples as the object, obtaining the energy consumption quota table by establishing a machine learning model, and then selecting the appropriate quota level to determine the energy consumption quota value according to the actual situation [42]. By analyzing the energy consumption data for each station along Kunming Metro Line 4, the average and median energy consumption of each station are obtained, as shown in Figs. 18 and19.

Fig. 18
figure 18

Distribution of average energy consumption of each station

Fig. 19
figure 19

Median distribution of energy consumption of each station

Through the data analysis of each station, it can be seen that the energy consumption of different stations varies greatly. Different stations need to be classified separately, and the station energy consumption quota cannot be formulated in a one-size-fits-all and average way, which would lead to a lack of fairness and rationality of the quota.

After the overall analysis of the station weak current system, the energy consumption quota data are given using the Boruta + PCA + XGBoost algorithm, and the results are shown in Fig. 20.

Fig. 20
figure 20

Energy consumption trend and forecast of Jinchuan Road station

Figure 20 shows that the Boruta + PCA + XGBoost algorithm can better predict the energy consumption data for Jinchuan Road station. The specific results are shown in Table 11.

Table 11 Energy consumption prediction results table

4.6 Energy consumption data monitoring and early warning

In addition to energy consumption determination through big data analysis and machine learning, the integrated power system platform also analyzes the operation of the equipment and the measurement data for each itemized meter. It is found that some data deviate from the actual energy consumption design, such as the data of the PJ17 meter in the distribution cabinet, which measures energy consumption data 24 hours a day for four gates, one ticket machine, the ceiling guide sign and one switch equipment, as shown in Fig. 21.

Fig. 21
figure 21

24-hour energy consumption distribution diagram of PJ17 meter

According to Fig. 21, in combination with the operation of the equipment, the equipment is generally in normal operation in the station, and can be in standby mode when it is shut down at night to reduce energy consumption. Through data monitoring, it is found that the energy consumption of the equipment during shutdown and no shutdown are basically the same. When the normal gate and ticket machine are shut down, the equipment will be in energy-saving and standby mode. For the gate, the power consumption is 150 W in the normal operation mode and 50 W in the standby mode. In this way, the energy-saving operation can be realized by monitoring to ensure that the field equipment complies with the design. The ideal energy consumption curve is shown in Fig. 22 if the energy-saving operation of the gate equipment is in standby mode during shutdown.

Fig. 22
figure 22

Ideal energy consumption distribution of PJ17 meter for 24 h

Through the real-time monitoring of the system and the analysis of energy consumption measurement data, the unreasonable phenomena in the energy consumption link are found and reported to the energy consumption management unit in real time. Now, the weak current maintenance center has checked the equipment-setting problems and asked the equipment manufacturer to put forward solutions. The energy-saving ratio is calculated based on "energy-saving ratio = energy-saving value of starting standby mode after hypothetical shutdown/energy consumption value of not enabling standby mode".

Finally, through calculation, if the standby mode transformation is started after the gate is stopped, gate energy consumption savings of 27.59% can be achieved.

According to the actual power consumption and energy consumption of each station, an energy consumption quota can be selected. With the energy consumption quota, the weak current system of the station is monitored according to the approved energy consumption quota. At the same time, we should establish and improve the quota management, reward and punishment system, and finally achieve energy conservation and consumption reduction.

Therefore, the metro company must strengthen the use of energy measurement data, must integrate, analyze and record the data measured by professional energy measuring instruments, and carry out reasonable and effective management of the measurement results in order to guide the production links, promptly detect problems and mistakes, strengthen the potential of energy conservation and reduced consumption, and maximize the interests of the enterprise [42].

5 Conclusion

Energy consumption in metro system has become a serious burden for metro operation companies. To achieve energy saving on metro weak current system, we conducted an in-depth research and analysis on metro energy load classification and energy management, focusing in particular on the design and usage of power supply systems for metro weak current electromechanical systems. To make it clear how the energy consumption among the weak current system, based on the characteristics of the power supply system of the metro weak current system, we propose a series of measurements to monitor and predict energy consumption of the weak current system. In Kunming Metro Line 4, a comprehensive power supply design of the weak current system is carried out to reduce the number of UPS and improve the UPS utilization rate. In the distribution cabinet, different meters are set for the loop according to different areas and equipment conditions to meet the requirements of graded measurement of metro energy consumption. At the same time, the station energy consumption is predicted through the Boruta +PCA +XGBoost algorithm, and a reasonable energy consumption quota is given to ensure that "data originates from measurement and management depends on data".

By the application of graded metering of station power consumption, the station energy consumption is predicted, and the supervision of equipment energy consumption is realized. Based on the overall integration of cloud transportation automation systems, the energy saving of Kunming Line 4 is 5% higher than that of the existing line. In the future, we will promote the installation of meters in energy-consuming equipment and facilities such as other weak current systems and electromechanical systems to realize hierarchical measurement, improve the energy measurement network and measurement data traceability system of metro stations and the company, and then extend it to other major energy systems. At the same time, the system will conduct machine learning on the data in combination with the big data platform to realize energy consumption prediction and provide a scientific and reasonable energy consumption quota for stations and promote the use of the energy consumption quota, in order to facilitate the rational energy consumption of stations and achieve the goal of energy conservation and emission reduction.