Abstract
Background
Early-life respiratory infections and asthma are major health burdens during childhood. Markers predicting an increased risk for early-life respiratory diseases are sparse. Here, we identified the predictive value of ultrasound-monitored fetal lung growth for the risk of early-life respiratory infections and asthma.
Methods
Fetal lung size was serially assessed at standardized time points by transabdominal ultrasound in pregnant women participating in a pregnancy cohort. Correlations between fetal lung growth and respiratory infections in infancy or early-onset asthma at five years were examined. Machine-learning models relying on extreme gradient boosting regressor or classifier algorithms were developed to predict respiratory infection or asthma risk based on fetal lung growth. For model development and validation, study participants were randomly divided into a training and a testing group, respectively, by the employed algorithm.
Results
Enhanced fetal lung growth throughout pregnancy predicted a lower early-life respiratory infection risk. Male sex was associated with a higher risk for respiratory infections in infancy. Fetal lung growth could also predict the risk of asthma at five years of age. We designed three machine-learning models to predict the risk and number of infections in infancy as well as the risk of early-onset asthma. The models’ R2 values were 0.92, 0.90 and 0.93, respectively, underscoring a high accuracy and agreement between the actual and predicted values. Influential variables included known risk factors and novel predictors, such as ultrasound-monitored fetal lung growth.
Conclusion
Sonographic monitoring of fetal lung growth allows to predict the risk for early-life respiratory infections and asthma.
Graphical abstract
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Respiratory diseases such as infections and allergic asthma are major causes of morbidity and mortality in neonates and children [1,2,3]. Acute respiratory tract infections, especially those involving the upper airways, are the most common illnesses in young children [4, 5]. Given that they are usually treated in outpatient settings, the exact incidence of upper respiratory tract infections is often hard to determine, and most epidemiological studies provide information collected in hospital settings and thus refer to the typically more severe lower respiratory tract infections [6,7,8]. According to the Global Burden of Diseases, Injury and Risk Factors Study 2015, lower respiratory tract infections are the third leading cause of death in children younger than 5 years in 195 countries worldwide, resulting in 12.1% of deaths in this population [1]. Epidemiological data highlight the predominance of viruses in childhood respiratory tract infections. Specifically, 90% of upper respiratory tract infections are of viral origin, with the most common pathogens being rhino- and adenovirus [4]. Lower respiratory tract infections are attributed to viruses in approximately 50% of cases, with respiratory syncytial virus (RSV), adenovirus, metapneumovirus, influenza, and parainfluenza most frequently causing the disease [4, 9, 10].
Similarly, asthma is the most common non-communicable disease in children [11, 12], with an uprecedented worldwide incidence of nearly 22 million childhood cases in 2019 [13]. The burden of childhood asthma also becomes evident from the rate of morbidity and mortality, e.g., 12,900 children died from asthma, and 5.1 million disability-adjusted life years were associated with asthma in 2019 worldwide [13]. Epidemiological studies not only highlight the increasing incidence of these diseases worldwide but also demonstrate a considerable association between early-life infections and the increased risk of subsequently developing childhood asthma [14,15,16,17,18].
Fetal and early-life lung development pave the way for lung function and pathology later in life. Lung development is a delicate process consisting of distinct pre-, peri- and postnatal events that determine lung function throughout life [19]. An underdeveloped lung structure and function, e.g., seen in premature-born children, increases the risk for respiratory diseases, such as early-life infections and wheezing disorders during childhood [20, 21].
Despite these intriguing epidemiological and developmental insights, studies monitoring features of fetal lung growth and its association with postnatal respiratory health are missing. To close this critical gap in knowledge, we took advantage of the availability of highly granular data from a prospectively designed observational pregnancy study, focusing mainly on uncomplicated pregnancies and term-born offspring. Here, fetal lung development could be evaluated using scans from serially acquired ultrasound examinations throughout gestation and subsequently linked to respiratory health or diseases recorded during infancy and childhood. In our study, we combined serial monitoring of fetal lung growth trajectories with a machine-learning approach and developed accurate models to identify children at risk for respiratory diseases.
Methods
Study design
The present work was conducted within the Prenatal Identification of Children’s Health (PRINCE) study. The PRINCE study is a prospective longitudinal pregnancy cohort located at the University Medical Center Hamburg-Eppendorf, which started in 2011 and focuses on the impact of prenatal challenges on children’s health. The inclusion criteria for pregnant women to enrol were an age ≥ 18 years and a viable singleton pregnancy of 12–14 weeks of gestation. Exclusion criteria included chronic infections (human immunodeficiency virus, hepatitis B or C), known drug or alcohol abuse, multiple pregnancies or pregnancies resulting from assisted reproductive technology. Pregnancy progression, health status, medication, stress perception, and anthropometric data of the mother were documented during study visits between 12 and 14, 24 and 26, and 34 and 36 weeks of gestation. Transabdominal ultrasound examinations were also performed at these study visits. At a gestational age of 27–29 weeks, study participants were offered the opportunity for an additional ultrasound assessment as part of their prenatal visit at our hospital to register for giving birth.
At birth, anthropometric indices of the newborn were obtained. At the age of twelve months (infancy), information on the occurrence of upper and lower respiratory tract infections was obtained by standardized parental questionnaires and independently confirmed by the routine childhood screenings of the child’s pediatrician. Specifically, the documented infections were common cold, pneumonia, tonsillitis, bronchitis and croup. The health status of the children was followed up annually between the ages of two and four years. At the age of five years, a study visit of the child was performed by a trained pediatrician (Fig. 1a). From the mother/child pairs that had participated in pre- and postnatal study visits of the study until the children’s age of five years by March 2021 (n = 195), prenatal and postnatal datasets were available for 177 mother/child pairs. From these, 75% (n = 133) of the total available pairs were randomly selected by the employed algorithm and included in the current analysis as a training group to develop (train) three models to predict the exact number and risk of respiratory infections in infancy as well as the risk for early-life asthma manifestation on the basis of the ultrasound-monitored fetal lung growth trajectory, while the remaining independent participant group (n = 44; 25% of total pairs) served as a testing group to conclusively validate the predictive value of the developed models (Fig. 1b). Mother/child pairs included in either cohort are referred to as study participants throughout the manuscript.
Study approval
The study protocol of the PRINCE study was approved by the Ethics Committee of the Hamburg Chamber of Physicians under the registration number PV3694 and performed in compliance with the Declaration of Helsinki for Medical Research involving Human Subjects. Informed consent forms to participate in the study were signed by all participants (or their parent or legal guardian in the case of children under 16).
Ultrasound measurements
Ultrasound examinations and measurements were performed using a Voluson E8 (General Electric; GE) equipped with a transabdominal 3–5 MHz transducer (RAB 6D, GE). Measurements were conducted by two trained clinicians with certified advanced ultrasound expertise. A routine standard sonographic study, including fetal biometry and anomaly screening, was always included. Based on the study design, estimated fetal weight was calculated using the Warsof formula in the first trimester and the Hadlock IV formula at the other two time points [22, 23]. The lung area was calculated by subtracting the heart area from the thorax at the four-chamber level and averaging three replicate values at each time point [24]. Ultrasound examinations that precluded fetal lung measurements due to fetal position or fetal movements affected approximately 11% of all cases and were excluded.
Infection and asthma risk classification
Based on the number of respiratory infections in infancy, two groups with distinct infection risks could be identified. Specifically, children who exhibited five or fewer respiratory infections in the first year of life were identified as being at low infection risk, whereas a high risk for infection characterized children with a history of at least six respiratory infections in infancy [25]. The classification of the risk for early-onset asthma was based on clinical information and examination performed by a pediatrician at the age of five years. In addition to the clinical examination, the physician employed standardized clinical information and criteria that have been developed and universally applied by several population-based birth cohort studies focusing on asthma and have been validated in the International Study of Asthma and Allergies in Childhood project worldwide [26, 27]. Specifically, the current existence of an asthmatic phenotype was affirmative upon fulfilment of two out of the three following criteria: (1) pre-existing asthma diagnosis by a physician; (2) any asthma-related symptom within the last 12 months, including wheezing, dry cough at night and shortness of breath; and (3) use of asthma medication within the last 12 months [27]. As a complementary approach, a second classification system based on guidelines that have been established and validated by German health authorities with the special aim of identifying two- to five-year-old children with early signs of an asthmatic predisposition was also used [28]. Specifically, a high risk for early-onset asthma development was present in children who exhibited three asthma-typical episodes in the last 12 months that responded well to asthma medication and satisfied at least one of the following criteria: (1) existence of a parent or sibling suffering from asthma; (2) hospitalization due to obstructive respiratory symptoms; (3) clinical evidence of sensitization; and (4) wheezing without an underlying respiratory infection. Children with a positive scoring outcome in either one scoring approach were considered to be at high risk for early-onset asthma development.
Statistical analysis
General statistics
Study sample characteristics, including demographics and ultrasound parameters, are presented as the mean ± standard deviation. Comparison of the maternal and neonatal demographic parameters between the training and testing cohorts was performed by using the Chi-squared test or the Mann‒Whitney U test with a significance level of P < 0.05. Comparison of the number of infections per child between sexes was performed using the Mann‒Whitney U test. The data shown represent the mean ± standard error of the mean. The respective analysis was conducted, and plots were created with GraphPad Prism, version 8.0 (GraphPad Software, Inc., La Jolla, California), R version 4.1.2 and Python scripting language version 3.8.
Missing fetal lung growth values and design of the fetal lung growth trajectory
To design the trajectory of fetal lung growth covering the entire prenatal period of interest for all male and female fetuses, values for common standardized time points, namely, gestational weeks 12, 23, 28 and 35, for all children were needed. For this reason, missing values for any standardized time points, e.g., ultrasound examination conditions that did not allow fetal lung area assessment (as mentioned above), were imputed using Poisson regression based on the available fetal lung area measurements for each fetus. The imputed data and related fetal lung growth trajectories were used for the calculation of the area under the curve by applying the linear trapezoidal rule [29]. The estimated area under the curve was subsequently used as a representative summarizing indicator for each child’s fetal lung growth trajectory for further prediction analysis and risk assessment and is referred to as the “lung growth index” throughout the manuscript.
Regression analysis
A Poisson regression model with log as the link function was used to analyze the influence of fetal growth and fetal lung growth index, maternal age at birth, maternal smoking during pregnancy, maternal body mass index (BMI) in the first trimester, mean maternal stress perception during pregnancy, gestational age and weight of the offspring at birth, child sex, and the presence of older siblings on the number of respiratory infections during the first year of life [30]. This model was further adjusted by using the natural logarithm of the time interval in gestational weeks between the minimum and the maximum available fetal lung growth value for each child as an offset.
Machine learning approach to predict the number of respiratory infections in infants
To predict the total number of respiratory infections in the first year of life (model I), machine learning modeling was used. The lung growth index during pregnancy, gestational age and weight of the offspring at birth, child sex, maternal first trimester BMI, and age at birth were the included input data features. Four types of regression algorithms, namely, K-nearest neighbors regressor (KNNR) [31], random forest regressor (RFR) [32], gradient boosting regressor (GBR) [33], and extreme gradient boosting regressor (XGBR) [34], were implemented to determine the optimal strategy and prediction model. All regressor models were evaluated on our dataset using tenfold cross validation, based on the following metrics [35]: (1) R-squared (R2), the correlation coefficient representing how well the model fits, i.e., the closer this value is to 1, the more perfectly the model performs; (2) mean squared error, demonstrating the difference between the actual observations and the observation values predicted by the model; (3) root mean squared error, measuring the average difference between predicted by the model and actual values; and (4) mean absolute error (MAE), the absolute difference between the actual value and the one predicted by the model, i.e., the lower the MAE, the better the model. To determine the impact of each input feature on the prediction, Shapley additive explanations (SHAP) values were used [36].
Machine learning approach to predict the risk for respiratory infections in infancy or asthma manifestation in early childhood
Machine learning approaches were again employed to develop an additional model predicting the risk (low or high) for respiratory infections in infancy (model II) and a third model predicting the risk for early-onset childhood asthma development (model III). Similar to model I mentioned above, the input data features included the lung growth index during pregnancy, gestational age and weight of the offspring at birth, child sex, maternal first trimester BMI, and age at birth. In this case, and due to our experience with the abovementioned modeling strategy, the algorithm that we used for prediction was the XGBC [34]. The confusion matrix, accuracy, precision, recall (also known as true positive rate or sensitivity), f1 score, and the receiver operating characteristic (ROC) curve along with the area under the ROC curve were used to evaluate the diagnostic performance of each model [37, 38].
Results
Study participant demographics and characteristics
The demographics and characteristics of the study participants included in the training and testing groups are shown in Table 1. Apart from parity, no significant differences were present between the training and testing groups with regard to demographic, anthropometric, and educational parameters in mothers or neonates [39]. This includes parameters such as advanced maternal age (≥ 35 years) [40] and grand multiparity (≥ 5 pregnancies resulting in viable offspring) [41] in the training and testing groups. According to the exclusion criteria described earlier, multiple pregnancies or pregnancies resulting from assisted reproductive technology were excluded from the study. Among the 177 children included in the training and testing groups, one child was diagnosed with intrauterine growth restriction (IUGR). In this child, the lung growth was indeed below average (data not shown), as expected, but exclusion did not affect the overall outcome, and it was thus included in the study. Along this line, we wish to highlight that our aim was to predict the risk for childhood infection and asthma based on lung growth, rather than focusing on such risk in distinct subgroups, such as children born upon pregnancy complications or IUGR. Fetal ultrasound assessment and lung area measurement were performed at three main time points during pregnancy (Fig. 1c). To assess the quality and accuracy of the obtained lung ultrasound measurements, the intraclass correlation coefficient (“one-way” model; “agreement” type) was calculated in samples of the training group and revealed excellent agreement for ultrasound assessments at all time points of interest between two independent blinded observers (Supplementary Fig. 1a). Mean ultrasound parameters throughout pregnancy for the training and testing groups are shown in Supplementary Tables 1 and 2, respectively. To design the fetal lung growth trajectory throughout pregnancy based on standardized time points, data imputation was performed. Importantly, the calculation of Pearson’s correlation coefficient showed excellent agreement between the actual and predicted values for the fetal lung area (Supplementary Fig. 1b). Subsequently, fetal lung growth trajectories were designed, and their progress was quantified based on the estimated area under the curve, here referred to as the lung growth index (Fig. 1d and e). To exclude potential aberrations in fetal growth, trajectories depicting the fetal growth course were also designed based on the estimated fetal weight at the three main time points. In this case, the estimated area under the curve served as an indicator of fetal growth throughout pregnancy (Supplementary Fig. 1c).
Respiratory infections in infancy and asthma manifestation in childhood
During the first year, the common cold was the most frequent respiratory infection in both boys and girls (Fig. 2a; Supplementary Tables 3 and 4). Among the 133 children included in the training group, five (3.76%) children exhibited no respiratory infections in the first year of life, while the majority (24%) of children suffered from three respiratory infections during this time. In general, most children were classified as being at low infection risk, while the minority exhibited more than six respiratory infections and were characterized by a high infection risk. The highest documented number of respiratory infections until the first birthday was twelve (Fig. 2b). Although the total number of infections did not differ between boys and girls, the mean number of infections in boys was significantly higher than that in girls (Fig. 2c). Notably, testing for specific pathogens causing a respiratory infection is not routinely performed in our country (and many other countries), unless the course of the infection is very severe and the clinical symptoms require additional attention, such as hospitalization. Hence, among the total number of children included in the training and testing groups (n = 177), polymerase chain reaction-based testing of the pathogen causing the respiratory infection was available only in five children, four of which tested positive for RSV infection and one child for influenza A virus. Regarding early-onset asthma, a positive asthma risk classification was identified in 12.23% of all participating children at the age of 5 years, based on at least one of the applied scoring systems, with boys and girls being similarly affected (Fig. 2d).
Fetal lung growth as a predictor for early-life respiratory morbidities
Using a Poisson regression model, we next aimed to identify risk predictors for respiratory infections in infancy. Among all examined factors, fetal lung growth, here evident as the lung growth index, as well as the sex of the offspring were found to significantly affect the risk for early-life respiratory infections. Importantly, an enhanced progress of fetal lung growth, specifically a one-point increase in the lung growth index, would result in a decrease in the respiratory infection risk ratio by a factor of 0.78, while holding all other variables in the model constant (Fig. 2e; Table 2). Additionally, male sex was identified as an independent risk factor for early-life respiratory infections if all other parameters in the model were constant (Table 2), meaning also that, among children with a similar lung growth index, boys are at higher risk for suffering from frequent respiratory infections in infancy.
Using machine learning to predict susceptibility to respiratory infections in infancy
After identifying fetal lung growth as a pivotal predictor for the early-life risk for respiratory infections, we next used machine learning to design a prediction model allowing for early recognition of susceptible individuals (model I). Apart from ultrasound-monitored fetal lung growth during pregnancy, the abovementioned confounding factors were again included. To predict the exact number of respiratory infections in infancy, we used the KNNR, RFR, GBR and XGBR algorithms to develop models based on the training group (Supplementary Fig. 2). To conclusively validate the performance of the developed models, a prediction of the number of respiratory infections within the independent testing group was subsequently performed. XGBR was identified as the most accurate model (R2 = 0.92) (Table 3; Fig. 3a and b), while RFR, GBR and KNNR showed an average score of approximately 0.67 (Table 3; Supplementary Figs. 2 and 3). XGBR was also characterized by low error values. The lung growth index also had the highest absolute SHAP value of all input data features and was thus identified as the most important predictor for the selected XGBR model (Fig. 3c). Importantly, based on the generated SHAP dot plot charts, male sex (here in red) was associated with a higher number of early-life respiratory infections (Fig. 3c).
As a next step, we developed a second independent prediction model with the ability to pinpoint children with a low or high risk for respiratory infections in infancy as early as birth (model II). Specifically, we again employed a machine learning approach for binary logistic regression based on the XGBC algorithm and the same training and testing groups (Fig. 4a). The XGBC achieved an accuracy of 0.90, a precision of 0.92, a recall of 0.90, and a f1-score of 0.91, with an area of the ROC curve of 0.91 (Fig. 4b and c). In this case, the most important predictors for the designed XGBC model were the child’s sex and the lung growth index, which had the highest absolute SHAP values of all input parameters (Fig. 4d). Similar to model I predicting the exact number of respiratory infections, male sex was again identified as a key predictor for a high infection risk in infancy (Fig. 4d).
Using machine learning to predict the risk for early-onset childhood asthma
To identify young children at high risk for early-onset asthma development, the XGBC algorithm for logistic regression as well as our training and testing groups were again used, and model III was developed (Fig. 5a). After training on our dataset, XGBC model was able to predict the risk for asthma and underwent evaluation based on its performance on our testing cohort. Specifically, the XGBC was characterized by an accuracy of 0.93, a precision of 0.94, a recall of 0.93, and a f1-score of 0.93, with an area of the ROC curve of 0.93 (Fig. 5b and c). As seen in the case of respiratory infections, the lung growth index exhibited the highest absolute SHAP value and was again identified as the strongest contributing factor to the predictive performance of the model. However, based on the SHAP dot plot charts (Fig. 5d), the lung growth index alone was not associated with a higher or lower asthma risk but could facilitate prediction as part of the whole developed model. Apart from the lung growth index, maternal age also highly contributed to the prediction performance of the model (Fig. 5d). Of note, based on the SHAP dot plot charts, younger maternal age at birth was linked with a higher risk for early-onset asthma manifestation in the offspring (Fig. 5d).
Discussion
Here, we identified fetal lung growth as a significant predictor of the risk for early-life respiratory infections and early-onset childhood asthma. Using machine learning, we developed models I and II, which allow for the accurate prediction of the risk and number, respectively, of respiratory infections early in life. Similarly, model III was also developed to identify children at high risk for early-onset asthma.
These prediction models are of high clinical relevance. Fetal lung development paves the way for healthy lung function but also lung pathologies later in life. The delicate trajectory of lung development can be easily disrupted, e.g., by prenatal adversities. A wealth of evidence underpins that prenatal exposure to environmental factors, including high levels of maternal psychological stress, smoking, and infections, may interfere with fetal lung development and subsequently increase the risk for respiratory morbidities later in life [42,43,44,45]. Indeed, the increased risk for early-life respiratory infections, wheezing disorders, or asthma in childhood could be associated with an abnormally developed lung structure and function [20, 21, 46, 47]. The clinical evidence of this association becomes evident from the increasing incidence of these pulmonary diseases faced by our society. Early-life infections and childhood asthma are risk factors for chronic obstructive pulmonary disease (COPD) later in life, which will perpetuate such a burden [48, 49]. Since it is known that fetal lung growth and development shape postnatal respiratory health and disease [46, 47, 50], our findings will now facilitate the identification of the underlying pathogenesis [21, 47, 51]. In fact, the key mechanistic trigger of airway diseases is, in most cases, dysfunction of the respiratory epithelial barrier, as shown in preclinical models or via lung biopsies in humans [47, 52,53,54]. Although imaging technologies such as ultrasound are still restricted in providing insights into such disrupted epithelial defense barriers, the fetal lung growth trajectories we evaluated here may serve as a proxy for potentially underlying pathologies and developmental deficits.
Our current findings close a pivotal gap in knowledge, as non-invasive methods for serial fetal lung growth assessment during pregnancy were missing. To date, fetal lung maturation could only be evaluated at a specific gestational time point using invasive amniocentesis to test for markers, such as lecithin and sphingomyelin [55, 56], or a generalized, comparative assessment of lung, liver or placental parenchyma development [57]. However, these methods are not fully suitable to serially monitor fetal lung growth throughout pregnancy due to their invasiveness or lack of specificity. Notably, quantitative texture analysis of fetal lung ultrasound pictures has recently been proposed to provide a glimpse into fetal lung maturity. However, this approach is solely based on a single time point and only provides risk estimation for respiratory distress syndrome immediately after birth [58, 59]. Thus, our model now advances such first attempts, as we integrate not only three fetal time points but also two childhood disease entities.
Not surprisingly, male sex was independently associated with an increased risk for early-life respiratory infections in our study, as also described in a number of other studies [45, 60], which can be attributed to sex-specific immune as well as anatomic differences of the respiratory tract [45]. Of note, functional sex-specific differences in fetal lung maturation and especially perinatal lung transition with timely and sufficient surfactant production account for the increased risk for pulmonary complications in term and preterm male infants, the so-called “male disadvantage” [61, 62]. In our study, no sex-specific differences in the fetal lung trajectories could be detected, a finding that does not rule out an underlying functional and molecular sexual dimorphism that cannot be captured by ultrasound.
Epidemiological studies highlight an association between IUGR, preterm birth or low birth weight with an increased risk for respiratory diseases such as early-life respiratory infections and asthma in childhood [63]. Interestingly, no significant impact of fetal growth, gestational age or weight at birth on the early-life risk for respiratory infections or asthma was found here. This observation may be attributed to the homogenously low-risk character of the cohort, since included pregnancies lacked complications, in the vast majority resulted in term deliveries, while only one was associated with IUGR. Of note, since our aim was to develop universally applicable prenatal prediction approaches for the postnatal risk for respiratory diseases, pregnancies resulting in preterm birth were not excluded from the study. Similarly, maternal smoking or increased stress perception during pregnancy were not considered exclusion criteria but were taken into account in our analysis. No correlation among fetal lung volume, early-life respiratory infections and risk for childhood asthma could be identified in our setting. This finding may be due to the relatively small sample size and the inclusion of all respiratory infections, regardless of their severity, in the current work compared to previous observational studies, which mostly take severe infections into account [17, 64, 65]. In agreement with previously published evidence [66,67,68], we could also demonstrate that maternal age independently influences the risk for asthma development in early childhood. Specifically, increasing maternal age could be linked with a lower risk for early-onset asthma. Indeed, several population-based studies have identified younger maternal age as a risk factor for childhood asthma and associated maternal aging with improved lung function and a lower risk for asthma manifestation in offspring [66, 67].
The finding of a birth weight- and gestational age-independent association of the fetal lung growth trajectory with the risk for respiratory immune diseases in early childhood strengthens its importance as a predictor and determinant of postnatal health and highlights the urgent need to unravel the external and internal factors that may disrupt the developmental process. Such challenges may include, among others, increased maternal stress perception, smoking, inflammation and medication [69, 70]. Future studies focusing on the impact of prenatal adversities on human fetal lung growth are needed.
Using the state-of-the-art approach of machine learning, we were able to foresee critical health burdens of the offspring long before their manifestation. Specifically, we developed three novel highly accurate models, with model I predicting the exact number of early-life respiratory infections, model II predicting the risk of early-life respiratory infections, and model III predicting the risk of early-onset childhood asthma, mainly based on the lung growth index as the most potent predictor. The number of infections is a continuous outcome variable (numerical value). Its prediction constitutes a regression predictive modeling problem; therefore, XGBR was used. On the other hand, the prediction of low or high infection risk and asthma or no asthma (binary values) is a categorical problem between the two classes and thus a binary classification problem. Therefore, XGBC was applied in these cases. Model development was based on a training and a testing group with similar maternal and neonatal characteristics, as well as the number of children with a positive asthma risk classification. Although the distribution of infections differed between the two groups, our model could perform excellently in predicting the infection risk in the testing cohort, an observation suggesting that it can be applied universally. Importantly, both the training and testing groups were low-risk, mostly lacking pregnancy-associated complications or other factors with a known link with an increased infection or asthma risk [71, 72]. Thus, we here acknowledge the respiratory health risk of children mostly resulting from healthy pregnancies and identify poor fetal lung growth as an independent risk factor in this context. If implemented in clinical practice, such models would allow for timely recognition of offspring prone to respiratory immune diseases, thereby justifying the close monitoring and follow-up of these children’s health as well as early application of personalized prevention strategies, such as vaccination regimens and immunoboosting approaches. Avoidance of respiratory pathologies in this time period of continuous postnatal development and increased sensitivity may have long-term benefits for the respiratory health of the individual, including a reduced hospitalization need and lower risk for COPD in adulthood [73, 74].
Our study has some limitations. To address the challenge of missing information, especially in the lower and upper limits, due to missed visits during pregnancy in some cases, we performed data imputation. Importantly, excellent agreement was shown between actual and predicted fetal lung area values for all time points of interest throughout pregnancy. Additionally, it is quite easy to overfit the regression model, and for this reason, we selected a representative number of input parameters for the XGBR model. SHAP values provided more transparency through the report of a list of features with high influence on the outcome. Finally, as a single-center study, the models developed here require external validation in further studies.
In summary, we identified fetal lung growth as an important predictor for poor postnatal respiratory health. Using machine learning, we could recognize children prone to infection or asthma manifestation early based on a serial sonographic assessment of fetal lung growth during pregnancy. All additional input factors are easy to access, which means that these models could be applied in most hospitals. Overall, these models combined with fetal ultrasound may hold the potential to not only improve neonatal, infant and children’s health but also facilitate disease prevention later in life. Lung area measurement during routinely performed ultrasound examinations in pregnancy would allow the calculation of the lung growth index, which could then be used for risk estimation by the models developed here. Such a prenatal non-invasive assessment could pave the way for developing similar algorithms [75] predicting long-term health risks based on prenatal life and thus fundamentally change risk assessment for children’s health.
Data availability
The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.
References
GBD 2015 LRI Collaborators. Estimates of the global, regional, and national morbidity, mortality, and aetiologies of lower respiratory tract infections in 195 countries: a systematic analysis for the Global Burden of Disease Study 2015. Lancet Infect Dis. 2017;17:1133–61.
Dharmage SC, Perret JL, Custovic A. Epidemiology of asthma in children and adults. Front Pediatr. 2019;7:246.
Eder W, Ege MJ, von Mutius E. The asthma epidemic. N Engl J Med. 2006;355:2226–35.
Avendaño Carvajal L, Perret Pérez C. Epidemiology of respiratory infections. Pediatr Respir Dis. 2020:263–72.
Mulholland K. Global burden of acute respiratory infections in children: implications for interventions. Pediatr Pulmonol. 2003;36:469–74.
Chen J, Hu P, Zhou T, Zheng T, Zhou L, Jiang C, et al. Epidemiology and clinical characteristics of acute respiratory tract infections among hospitalized infants and young children in Chengdu, West China, 2009–2014. BMC Pediatr. 2018;18:216.
Mameli C, Picca M, Buzzetti R, Pace ME, Badolato R, Cravidi C, et al. Incidence of acute respiratory infections in preschool children in an outpatient setting before and during Covid-19 pandemic in Lombardy Region, Italy. Ital J Pediatr. 2022;48:18.
Nair H, Simões EA, Rudan I, Gessner BD, Azziz-Baumgartner E, Zhang JSF, et al. Global and regional burden of hospital admissions for severe acute lower respiratory infections in young children in 2010: a systematic analysis. Lancet. 2013;381:1380–90.
Abu-Raya B, Viñeta Paramo M, Reicherz F, Lavoie PM. Why has the epidemiology of RSV changed during the COVID-19 pandemic? EClinicalMedicine. 2023;61:102089.
Li Y, Wang X, Blau DM, Caballero MT, Feikin DR, Gill CJ, et al. Global, regional, and national disease burden estimates of acute lower respiratory infections due to respiratory syncytial virus in children younger than 5 years in 2019: a systematic analysis. Lancet. 2022;399:2047–64.
Asher MI, Rutter CE, Bissell K, Chiang C-Y, El Sony A, Ellwood E, et al. Worldwide trends in the burden of asthma symptoms in school-aged children: Global Asthma Network Phase I cross-sectional study. Lancet. 2021;398:1569–80.
Meghji J, Mortimer K, Agusti A, Allwood BW, Asher I, Bateman ED, et al. Improving lung health in low-income and middle-income countries: from challenges to solutions. Lancet. 2021;397:928–40.
Zhang D, Zheng J. The burden of childhood asthma by age group, 1990–2019: a systematic analysis of Global Burden of Disease 2019 data. Front Pediatr. 2022;10:823399.
Bønnelykke K, Vissing NH, Sevelsted A, Johnston SL, Bisgaard H. Association between respiratory infections in early life and later asthma is independent of virus type. J Allergy Clin Immunol. 2015;136:81–6.e4.
Achten NB, van Rossum AMC, Bacharier LB, Fitzpatrick AM, Hartert TV. Long-term respiratory consequences of early-life respiratory viral infections: a pragmatic approach to fundamental questions. J Allergy Clin Immunol Pract. 2022;10:664–70.
Dumas O, Erkkola R, Bergroth E, Hasegawa K, Mansbach JM, Piedra PA, et al. Severe bronchiolitis profiles and risk of asthma development in Finnish children. J Allergy Clin Immunol. 2022;149:1281–5.e1.
van Meel ER, Mensink-Bout SM, den Dekker HT, Ahluwalia TS, Annesi-Maesano I, Arshad SH, et al. Early-life respiratory tract infections and the risk of school-age lower lung function and asthma: a meta-analysis of 150 000 European children. Eur Respir J. 2022;60:2102395.
Moore HC, Hall GL, de Klerk N. Infant respiratory infections and later respiratory hospitalisation in childhood. Eur Respir J. 2015;46:1334–41.
Schittny JC. Development of the lung. Cell Tissue Res. 2017;367:427–44.
Broughton S, Bhat R, Roberts A, Zuckerman M, Rafferty G, Greenough A. Diminished lung function, RSV infection, and respiratory morbidity in prematurely born infants. Arch Dis Child. 2006;91:26–30.
Stocks J, Hislop A, Sonnappa S. Early lung development: lifelong effect on respiratory health and disease. Lancet Respir Med. 2013;1:728–42.
Hadlock FP, Harrist RB, Sharman RS, Deter RL, Park SK. Estimation of fetal weight with the use of head, body, and femur measurements-a prospective study. Am J Obstet Gynecol. 1985;151:333–7.
Warsof SL, Gohari P, Berkowitz RL, Hobbins JC. The estimation of fetal weight by computer-assisted analysis. Am J Obstet Gynecol. 1977;128:881–92.
Tang Y, Jin XD, Xu L, Deng Y, Chang Z, Li Q, et al. The value of ultrasonography in assessing fetal lung maturity. J Comput Assist Tomogr. 2020;44:328–33.
van der Zalm MM, Uiterwaal CSPM, Wilbrink B, de Jong BM, Verheij TJM, Kimpen JLL, et al. Respiratory pathogens in respiratory tract illnesses during the first year of life: a birth cohort study. Pediatr Infect Dis J. 2009;28:472–6.
Asher MI, Keil U, Anderson HR, Beasley R, Crane J, Martinez F, et al. International Study of Asthma and Allergies in Childhood (ISAAC): rationale and methods. Eur Respir J. 1995;8:483–91.
Gough H, Grabenhenrich L, Reich A, Eckers N, Nitsche O, Schramm D, et al. Allergic multimorbidity of asthma, rhinitis and eczema over 20 years in the German birth cohort MAS. Pediatr Allergy Immunol. 2015;26:431–7.
Lommatzsch M, Criée CP, de Jong CCM, Gappa M, Geßner C, Gerstlauer M, et al. Diagnosis and treatment of asthma: a guideline for respiratory specialists 2023-published by the German Respiratory Society (DGP) e. V. Pneumologie. 2023;77:461–543 (in German)
Abdulhameed AF, Memon QA. An improved trapezoidal rule for numerical integration. J Phys Conf Ser. 2021;2090:012104.
Kianifard F, Gallo PP. Poisson regression analysis in clinical research. J Biopharm Stat. 1995;5:115–29.
Cover T, Hart P. Nearest neighbor pattern classification. IEEE Trans Inf Theory. 1967;13:21–7.
Breiman L. Random forests. Mach Learn. 2001;45:5–32.
Natekin A, Knoll A. Gradient boosting machines, a tutorial. Front Neurorobot. 2013;7:21.
Chen T, Guestrin C. XGBoost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, California, USA: Association for Computing Machinery; 2016. p. 785–94.
Chicco D, Warrens MJ, Jurman G. The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Comput Sci. 2021;7:e623.
Lundberg SM, Lee SI. A unified approach to interpreting model predictions. ArXiv. 2017. abs/1705.07874.
Khourdifi Y, Bahaj M. Applying best machine learning algorithms for breast cancer prediction and classification. 2018 International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS). 2018. p. 1–5.
Marne S, Churi S, Marne M. Predicting breast cancer using effective classification with decision tree and K means clustering technique. 2020 International Conference on Emerging Smart Computing and Informatics (ESCI). 2020. p. 39–42.
Albrecht M, Pagenkemper M, Wiessner C, Spohn M, Lütgehetmann M, Jacobsen H, et al. Infant immunity against viral infections is advanced by the placenta-dependent vertical transfer of maternal antibodies. Vaccine. 2022;40:1563–71.
American College of Obstetricians and Gynecologists’ Committee on Clinical Consensus-Obstetrics, Gantt A, Society for Maternal-Fetal Medicine, Metz TD, Kuller JA, Louis JM, et al. Obstetric care consensus #11, pregnancy at age 35 years or older. Am J Obstet Gynecol. 2023;228:B25–40.
Mgaya AH, Massawe SN, Kidanto HL, Mgaya HN. Grand multiparity: is it still a risk in pregnancy? BMC Pregnancy Childbirth. 2013;13:241.
Van Putte-Katier N, Uiterwaal CS, De Jong BM, Kimpen JL, Verheij TJ, Van Der Ent CK, et al. The influence of maternal respiratory infections during pregnancy on infant lung function. Pediatr Pulmonol. 2007;42:945–51.
McEvoy CT, Spindel ER. Pulmonary effects of maternal smoking on the fetus and child: effects on lung development, respiratory morbidities, and life long lung health. Paediatr Respir Rev. 2017;21:27–33.
Meyer KF, Krauss-Etschmann S, Kooistra W, Reinders-Luinge M, Timens W, Kobzik L, et al. Prenatal exposure to tobacco smoke sex dependently influences methylation and mRNA levels of the Igf axis in lungs of mouse offspring. Am J Physiol Lung Cell Mol Physiol. 2017;312:L542–55.
Zazara DE, Arck PC. Developmental origin and sex-specific risk for infections and immune diseases later in life. Semin Immunopathol. 2019;41:137–51.
Zazara DE, Perani CV, Solano ME, Arck PC. Prenatal stress challenge impairs fetal lung development and asthma severity sex-specifically in mice. J Reprod Immunol. 2018;125:100–5.
Zazara DE, Wegmann M, Giannou AD, Hierweger AM, Alawi M, Thiele K, et al. A prenatally disrupted airway epithelium orchestrates the fetal origin of asthma in mice. J Allergy Clin Immunol. 2020;145:1641–54.
Martinez FD. Early-life origins of chronic obstructive pulmonary disease. N Engl J Med. 2016;375:871–8.
Silva GE, Sherrill DL, Guerra S, Barbee RA. Asthma as a risk factor for COPD in a longitudinal study. Chest. 2004;126:59–65.
Shi W, Bellusci S, Warburton D. Lung development and adult lung diseases. Chest. 2007;132:651–6.
Ubags NDJ, Alejandre Alcazar MA, Kallapur SG, Knapp S, Lanone S, Lloyd CM, et al. Early origins of lung disease: towards an interdisciplinary approach. Eur Respir Rev. 2020;29:200191.
Holgate ST. The sentinel role of the airway epithelium in asthma pathogenesis. Immunol Rev. 2011;242:205–19.
Blume C, Swindle EJ, Dennison P, Jayasekera NP, Dudley S, Monk P, et al. Barrier responses of human bronchial epithelial cells to grass pollen exposure. Eur Respir J. 2013;42:87–97.
Swindle EJ, Collins JE, Davies DE. Breakdown in epithelial barrier function in patients with asthma: Identification of novel therapeutic approaches. J Allergy Clin Immunol. 2009;124:23–34.
American College of Obstetricians and Gynecologists. ACOG practice bulletin No. 97: fetal lung maturity. Obstet Gynecol. 2008;112:717–26.
Besnard AE, Wirjosoekarto SA, Broeze KA, Opmeer BC, Mol BW. Lecithin/sphingomyelin ratio and lamellar body count for fetal lung maturity: a meta-analysis. Eur J Obstet Gynecol Reprod Biol. 2013;169:177–83.
Prakash KN, Ramakrishnan AG, Suresh S, Chow TW. Fetal lung maturity analysis using ultrasound image features. IEEE Trans Inf Technol Biomed. 2002;6:38–45.
Bonet-Carne E, Palacio M, Cobo T, Perez-Moreno A, Lopez M, Piraquive JP, et al. Quantitative ultrasound texture analysis of fetal lungs to predict neonatal respiratory morbidity. Ultrasound Obstet Gynecol. 2015;45:427–33.
Palacio M, Bonet-Carne E, Cobo T, Perez-Moreno A, Sabria J, Richter J, et al. Prediction of neonatal respiratory morbidity by quantitative ultrasound lung texture analysis: a multicenter study. Am J Obstet Gynecol. 2017;217:e1–14.
Muenchhoff M, Goulder PJ. Sex differences in pediatric infectious diseases. J Infect Dis. 2014;209(Suppl 3):S120–6.
Anadkat JS, Kuzniewicz MW, Chaudhari BP, Cole FS, Hamvas A. Increased risk for respiratory distress among white, male, late preterm and term infants. J Perinatol. 2012;32:780–5.
Laube M, Thome UH. Y it matters-sex differences in fetal lung development. Biomolecules. 2022;12:437.
Sonnenschein-van der Voort AM, Gaillard R, de Jongste JC, Hofman A, Jaddoe VW, Duijts L. Foetal and infant growth patterns, airway resistance and school-age asthma. Respirology. 2016;21:674–82.
Jackson DJ, Gangnon RE, Evans MD, Roberg KA, Anderson EL, Pappas TE, et al. Wheezing rhinovirus illnesses in early life predict asthma development in high-risk children. Am J Respir Crit Care Med. 2008;178:667–72.
Kusel MM, de Klerk NH, Kebadze T, Vohma V, Holt PG, Johnston SL, et al. Early-life respiratory viral infections, atopic sensitization, and risk of subsequent development of persistent asthma. J Allergy Clin Immunol. 2007;119:1105–10.
Gómez Real F, Burgess JA, Villani S, Dratva J, Heinrich J, Janson C, et al. Maternal age at delivery, lung function and asthma in offspring: a population-based survey. Eur Respir J. 2018;51:1601611.
Laerum BN, Svanes C, Wentzel-Larsen T, Gulsvik A, Torén K, Norrman E, et al. Young maternal age at delivery is associated with asthma in adult offspring. Respir Med. 2007;101:1431–8.
Raby BA, Celedón JC, Litonjua AA, Phipatanakul W, Sredl D, Oken E, et al. Low-normal gestational age as a predictor of asthma at 6 years of age. Pediatrics. 2004;114:e327–32.
Maritz GS, Harding R. Life-long programming implications of exposure to tobacco smoking and nicotine before and soon after birth: evidence for altered lung development. Int J Environ Res Public Health. 2011;8:875–98.
Walsh K, McCormack CA, Webster R, Pinto A, Lee S, Feng T, et al. Maternal prenatal stress phenotypes associate with fetal neurodevelopment and birth outcomes. Proc Natl Acad Sci U S A. 2019;116:23996–4005.
Montgomery S, Bahmanyar S, Brus O, Hussein O, Kosma P, Palme-Kilander C. Respiratory infections in preterm infants and subsequent asthma: a cohort study. BMJ Open. 2013;3:e004034.
Sánchez García L, Calvo C, Casas I, Pozo F, Pellicer A. Viral respiratory infections in very low birthweight infants at neonatal intensive care unit: prospective observational study. BMJ Paediatr Open. 2020;4:e000661.
Carraro S, Scheltema N, Bont L, Baraldi E. Early-life origins of chronic respiratory diseases: understanding and promoting healthy ageing. Eur Respir J. 2014;44:1682–96.
Jartti T, Gern JE. Role of viral infections in the development and exacerbation of asthma in children. J Allergy Clin Immunol. 2017;140:895–906.
Rasmussen M, Reddy M, Nolan R, Camunas-Soler J, Khodursky A, Scheller NM, et al. RNA profiles reveal signatures of future health and disease in pregnancy. Nature. 2022;601:422–7.
Acknowledgements
The authors would like to acknowledge all the women participating in the PRINCE study for their invaluable contribution and compliance during the study. Furthermore, the authors would like to thank Gudula Hansen for her crucial support in recruitment, Corinna Kramer and Sarah Kabish for clinical work and follow-up of participants, as well as Lina Otto for data entry. Additionally, the authors thank Dr. Ann-Kathrin Ozga for her intellectual input and conceptualization of the assessment of the lung growth index.
Funding
Open Access funding enabled and organized by Projekt DEAL. This work was supported by the German Research Foundation to ZDE, APC and DA (CRU296: AR232/25-2, DI 2103/2-1, SO1413/1-2; ZA1246/2-1; FOR5068: AR232/29-1), the Authority for Science, Research and Equality, Hanseatic City of Hamburg, Germany to APC and DA (LFF-FV73) and the Werner Otto Foundation to ZDE and GAD. ZDE and GAD are supported by the Clinician Scientist program of the RU5068 and the Medical Faculty of the University of Hamburg.
Author information
Authors and Affiliations
Contributions
DA and APC contributed equally as senior authors. ZDE contributed to conceptualization, methodology, investigation, validation, formal analysis, and writing of the original draft. GO contributed to methodology, software, data curation, validation, investigation, and formal analysis. SS contributed to methodology, formal analysis and investigation. PM, GAD, BI, and PM contributed to methodology and investigation. BS and MAC contributed to the methodology. HK contributed to methodology and resources. DA and APC contributed to conceptualization, resources, funding acquisition, supervision, reviewing and editing. All authors approved the final manuscript as submitted and agreed to be accountable for all aspects of the work.
Corresponding author
Ethics declarations
Ethical approval
The study protocol of the PRINCE study was approved by the Ethics Committee of the Hamburg Chamber of Physicians under the registration number PV3694 and performed in compliance with the Declaration of Helsinki for Medical Research involving human subjects. Informed consent forms to participate in the study were signed by all participants (or their parent or legal guardian in the case of children under 16).
Conflict of interest
No financial or non-financial benefits have been received or will be received from any party related directly or indirectly to the subject of this article. The authors have no conflict of interest to declare.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zazara, D.E., Giannou, O., Schepanski, S. et al. Fetal lung growth predicts the risk for early-life respiratory infections and childhood asthma. World J Pediatr (2024). https://doi.org/10.1007/s12519-023-00782-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12519-023-00782-y