The artificial intelligence-based model ANORAK improves histopathological grading of lung adenocarcinoma

Pan, Xiaoxi; AbdulJabbar, Khalid; Coelho-Lima, Jose; Grapa, Anca-Ioana; Zhang, Hanyun; Cheung, Alvin Ho Kwan; Baena, Juvenal; Karasaki, Takahiro; Wilson, Claire Rachel; Sereno, Marco; Veeriah, Selvaraju; Aitken, Sarah J.; Hackshaw, Allan; Nicholson, Andrew G.; Jamal-Hanjani, Mariam; Swanton, Charles; Yuan, Yinyin; Le Quesne, John; Moore, David A.

doi:10.1038/s43018-023-00694-w

Download PDF

Technical Report
Open access
Published: 10 January 2024

The artificial intelligence-based model ANORAK improves histopathological grading of lung adenocarcinoma

Nature Cancer volume 5, pages 347–363 (2024)Cite this article

6294 Accesses
43 Altmetric
Metrics details

Subjects

Abstract

The introduction of the International Association for the Study of Lung Cancer grading system has furthered interest in histopathological grading for risk stratification in lung adenocarcinoma. Complex morphology and high intratumoral heterogeneity present challenges to pathologists, prompting the development of artificial intelligence (AI) methods. Here we developed ANORAK (pyrAmid pooliNg crOss stReam Attention networK), encoding multiresolution inputs with an attention mechanism, to delineate growth patterns from hematoxylin and eosin-stained slides. In 1,372 lung adenocarcinomas across four independent cohorts, AI-based grading was prognostic of disease-free survival, and further assisted pathologists by consistently improving prognostication in stage I tumors. Tumors with discrepant patterns between AI and pathologists had notably higher intratumoral heterogeneity. Furthermore, ANORAK facilitates the morphological and spatial assessment of the acinar pattern, capturing acinus variations with pattern transition. Collectively, our AI method enabled the precision quantification and morphology investigation of growth patterns, reflecting intratumoral histological transitions in lung adenocarcinoma.

DIEANet: an attention model for histopathological image grading of lung adenocarcinoma based on dimensional information embedding

Article Open access 14 March 2024

Predicting EGFR mutational status from pathology images using a real-world dataset

Article Open access 16 March 2023

Grading of lung adenocarcinomas with simultaneous segmentation by artificial intelligence (GLASS-AI)

Article Open access 18 July 2023

Main

Lung adenocarcinoma (LUAD), the most common type of non-small cell lung cancer, is histologically characterized by distinct growth patterns: lepidic, papillary, acinar, cribriform, micropapillary and solid¹ (Extended Data Fig. 1a). The proposed International Association for the Study of Lung Cancer (IASLC) grading system, based on a combination of the predominant growth pattern and high-grade patterns (cribriform, micropapillary and solid) within individual tumors, is highly prognostic². However, there is interobserver variability among pathologists due to the challenges of consistently defining, recognizing and quantifying the wide spectrum of growth patterns³. This variability particularly affects differentiating lepidic, papillary and acinar patterns^2,4, as well as the estimated proportion of high-grade patterns in non-high-grade pattern-predominant tumors^2,5. Accurate quantification is challenging when there are multiple admixed growth patterns across several histological sections, as is the case in most LUADs. This challenge is compounded by the difficulty of defining the cutoff between different patterns where they represent a spectrum of histological appearances⁶. This poses challenges for accurate prognostic inference and reproducibility in clinical studies.

Computer-assisted approaches powered by artificial intelligence (AI) have been widely applied to histological image analysis^7,8,9,10,11. While some studies have applied deep learning models to LUAD growth pattern classification^12,13, automated IASLC grading by AI methods is yet to be explored. Moreover, previous deep learning methods were mainly based on patch-wise classification that predicts a histological subtype for each patch, overlooking the detailed morphological structure of patterns. To capture the distinct pattern morphology, we developed an AI method based on pixel-wise classification to segment growth pattern islands and automate the IASLC grading for risk stratification and outcome prediction.

In this study, we developed an AI method to segment LUAD growth patterns at the pixel level using hematoxylin and eosin (H&E) whole-slide images (WSIs) (Fig. 1a and Extended Data Fig. 1b,c) and applied it to 5,540 diagnostic slides from 1,372 cases, spanning four cohorts: TRAcking non-small cell lung Cancer Evolution through therapy (Rx) (TRACERx); Leicester Archival Thoracic Tumor Investigatory Cohort-Adenocarcinoma (LATTICe-A); The Cancer Genome Atlas (TCGA) LUAD; and Dartmouth Lung Cancer Histology Dataset (DHMC) (Fig. 1b). The growth pattern proportions, predominant pattern and IASLC grading of a tumor can be derived automatically based on growth pattern mapping (Fig. 1c). This pixel-wise segmentation method also revealed the morphological properties of growth patterns and enabled analysis of the degree of spatial heterogeneity, highlighting its advantages over patch-wise classification algorithms.

**Fig. 1: Proposed computational pipeline for precision mapping and spatial heterogeneity analyses.**

Results

A hierarchical AI model for growth pattern quantification

To spatially map complex growth patterns in LUAD, we developed ANORAK (pyrAmid pooliNg crOss stReam Attention networK), which encodes cross-stream interactions using a multi-order attention mechanism within convolutional neural networks¹⁴ (Fig. 1a and Extended Data Fig. 1b,c). Moreover, a pyramid pooling module (PPM)¹⁵ distributed global contextual information of growth patterns to guide high-level feature learning. ANORAK was trained on data annotated from 49 WSIs in the TRACERx 100 cohort (Extended Data Fig. 1a) by three thoracic subspeciality pathologists (Extended Data Fig. 1b), and validated on a total of 5,540 WSIs from 1,372 LUAD tumors across four cohorts (Fig. 1b and Table 1). This model enabled precision mapping of diverse growth patterns at pixel-level resolution, thereby facilitating automated grading and analysis of morphological intratumoral heterogeneity (Fig. 1c).

Table 1 Patient demographics (all cohorts)

Full size table

ANORAK generated promising outputs for growth pattern segmentation (Fig. 2a and Extended Data Figs. 2a,b and 3a). To validate the effectiveness of the developed model, we conducted the ablation study at the patch level (Extended Data Fig. 3b). Overall, multi-stream variants were more promising than single-stream ones, gaining an advantage by gathering different types of features. Moreover, methods with attention modules (multi-FO, multi-SO, ANORAK) achieved better overall performance, implying that the attention techniques came into effect. Specifically, first-order attention (multi-FO) improved performance by around 3% compared to the adding fashion (multi-ADD), while second-order attention (multi-SO) showed an approximate 5% improvement when compared to multi-FO. This suggested that high-level feature interactions across streams could be more effective than merging at low-level feature learning, highlighting the importance of high-level features in semantic segmentation^15,16. The proposed model adopted both first-order and second-order attention modules, enhancing the overall performance with notable improvements. To compare this with existing methods, ANORAK outperformed several widely used approaches in semantic segmentation, including attention U-Net¹⁷, DeepLabv3+ (ref. ¹⁸), DANet¹⁹ and MedT²⁰, for growth pattern subtypes (0.4430–0.7463; Extended Data Fig. 3c) except for solid pattern (0.7170), which was lower than DeepLabv3+ (0.7381). ANORAK also achieved overall promising performance at the patch-level and WSI-level evaluations (patch-Dice: ANORAK: 0.6034, other methods: 0.3770–0.5691; WSI agreement: ANORAK: 60.00–65.31%, other methods: 16–48.98%; Extended Data Fig. 3c). Furthermore, the parameters of ANORAK are 4.10 million, that is, more lightweight than other convolutional models (6.67–15.55 million; Extended Data Fig. 3c). Taken together, the proposed model may have advantages in performance and computing over other methods.

**Fig. 2: Performance of AI in the prediction and quantification of growth patterns.**

In all four cohorts, AI-predicted growth pattern proportions were highly correlated with the pathologists’ estimates (Fig. 2b and Supplementary Table 1), notably for the solid pattern (TRACERx 421, Spearman’s rho = 0.79; LATTICe-A correlations against each pathologist’s scoring, rho1 = 0.80, rho2 = 0.77, rho3 = 0.78; TCGA, rho = 0.67). The lowest correlations were observed for the micropapillary pattern (rho = 0.35–0.44 across three cohorts), which was also the pattern with the lowest interobserver agreement (LATTICe-A, 14.5–66.7%, average 39.8%; Extended Data Fig. 4a). When tumors were grouped according to their predominant pattern, the overall agreement rates between AI-predicted and manual scoring ranged between 50.18% and 67.96% (Supplementary Table 2 and Fig. 2c) across four cohorts. This is lower than the interobserver rates in LATTICe-A (53.49–74.08%; Extended Data Fig. 4b) but consistent with the known level of agreement between pathologists in previous studies (≥51.7%)^3,13. The kappa statistics suggested a moderate agreement between AI and pathologists as well as inter-pathologists for predominant pattern assessment (averaged kappa index of AI-pathologist in four cohorts = 0.46; inter-AI-pathologists in LATTICe-A = 0.46; inter-pathologists in LATTICe-A = 0.49; Supplementary Table 3 and Extended Data Fig. 4b). Likewise, the overall agreement rates of AI-based grading according to the IASLC guidelines (AI grading hereafter) (65.73–76.80%; Supplementary Table 2 and Fig. 2e) were lower than the rates between pathologists in LATTICe-A (71.95–82.01%; Extended Data Fig. 4c,e), but the kappa statistics indicated a moderate agreement with manual grading, comparable with interobserver agreement (averaged kappa index of AI-pathologist in four cohorts = 0.47; inter-AI-pathologists in LATTICe-A = 0.50; inter-pathologists in LATTICe-A = 0.50; Supplementary Table 3 and Extended Data Fig. 4c). Interestingly, tumors with discrepant classification between AI and manual scoring had a notably higher intratumoral heterogeneity in growth pattern composition, measured using the Shannon diversity index based on pathological scores, compared to tumors concordant between AI and manual scoring (TRACERx 421, P = 8.5 × 10⁻⁷; LATTICe-A, P1 < 2.22 × 10⁻¹⁶, P2 = 2.8 × 10⁻¹², P3 < 2.22 × 10⁻¹⁶; TCGA, P = 0.00076; Fig. 2d). A consistent trend was observed between discrepant and agreement classifications assessed by pathologists in LATTICe-A (P < 2.22 × 10⁻¹⁶, 4.3 × 10⁻¹³, 1.6 × 10⁻¹⁵; Extended Data Fig. 4d).

AI grading consistently improves patient risk stratification

Patients with IASLC grade 1 and 2 tumors as identified by AI had notably favorable disease-free survival (DFS) compared to patients with IASLC grade 3 tumors in TRACERx 421 (n = 206, P = 0.003, hazard ratio (HR) = 0.48, 95% confidence interval (CI) = 0.30–0.78) and LATTICe-A (n = 729, P = 1.73 × 10⁻⁷, HR = 0.53, 95% CI = 0.42–0.68; Fig. 3a). This prognostic effect remained notable when AI grading was incorporated in a multivariable model (TRACERx 421, n = 206, P = 0.009, HR = 0.51, 95% CI = 0.31–0.85; LATTICe-A, n = 729, P = 0.001, HR = 0.64, 95% CI = 0.49–0.84; Fig. 3b). The prognostic effect was slightly changed when tumor stage was replaced by tumor size (TRACERx 421, P = 0.004, HR = 0.48, 95% CI = 0.29–0.79; LATTICe-A, P = 0.001, HR = 0.64, 95% CI = 0.49–0.84; Extended Data Fig. 5a). The overall prognostic effect of the pair-wise comparison was consistently retained in the univariable (TRACERx 421, P = 0.011; LATTICe-A, P = 7.81 × 10⁻⁷) and multivariable analyses (TRACERx 421, tumor stage: P = 0.033, tumor size: P = 0.014; LATTICe-A, tumor stage: P = 0.004, tumor size: P = 0.003; Extended Data Fig. 5a).

**Fig. 3: Survival analyses of AI and pathologist grading.**

To determine the prognostic information provided by AI compared to manual scoring and the clinical baseline characteristics, we focused on the large LATTICe-A cohort. While manual IASLC grading from all three pathologists was prognostic (Extended Data Fig. 5b–d), AI grading achieved a comparable performance with all three pathologists (Fig. 3b) in LATTICe-A. When Cox regression models were considered for predicting DFS (baseline; age, sex, tumor stage), AI grading (baseline + automated IASLC grading) and manual grading (baseline + a pathologist’s manual IASLC grading), AI grading achieved a comparable performance with pathologists and clinical baseline for stage I–III tumors in LATTICe-A (n = 729, concordance index (C-index): AI = 0.682, 95% CI = 0.650–0.713; path 1 = 0.679, 95% CI = 0.645–0.713; path 2 = 0.680, 95% CI = 0.647–0.713; path 3 = 0.675, 95% CI = 0.644–0.707; baseline = 0.665, 95% CI = 0.633–0.697; Fig. 3c). Consistent performance was observed for stage I–III tumors in TRACERx 421 (n = 206, C-index: AI = 0.689, 95% CI = 0.625–0.752; path = 0.689, 95% CI = 0.625–0.752; baseline = 0.670, 95% CI = 0.608–0.733; Fig. 3c). In patients with early-stage tumors, the C-index of AI grading was comparable with pathologist grading but higher than baseline in TRACERx 421 (n = 108, C-index: AI = 0.700, 95% CI = 0.618–0.783; path = 0.695, 95% CI = 0.607–0.783; baseline = 0.665, 95% CI = 0.571–0.759; Fig. 3c). However, in LATTICe-A, the association between DFS and AI grading was consistently higher than the grading from pathologists (n = 337, C-index: AI = 0.643, 95% CI = 0.584–0.702; path 1 = 0.630, 95% CI = 0.570–0.690; path 2 = 0.615, 95% CI = 0.548–0.683; path 3 = 0.600, 95% CI = 0.526–0.673; baseline = 0.560, 95% CI = 0.495–0.625; Fig. 3c). Furthermore, once AI grading was added to manual grading (Supplementary Table 4), the prognostic value of the combined grading was consistently improved for stage I tumors (increment in C-index for path in TRACERx 421 = 0.013; path 1 = +0.023; path 2 = +0.028; path 3 = +0.043 in LATTICe-A; Fig. 3c), which was marginally higher than adding an additional manual grading in LATTICe-A (Extended Data Fig. 5e and Supplementary Table 5).

Taken together, these data suggest that AI grading adds independent prognostic value for patient stratification, particularly for stage I disease in which clinical decision-making regarding adjuvant therapy following surgery can be challenging in the absence of evidence for outcome benefit.

Assisting pathologists in challenging scenarios

To evaluate the utility of our AI method to assist pathologists with LUAD grading, we identified four specific scenarios and used the large LATTICe-A cohort with manual grading available from three pathologists. We focused on stage I LUAD tumors, a group of patients with an unmet need for predicting which patients are likely to relapse to guide early intervention, potentially with adjuvant therapy²¹.

The first scenario consisted of cases with highly diversified growth patterns indicated by the Shannon diversity index (Fig. 4a), which was notably higher in cases with discrepant predominant patterns between AI and pathologists (Fig. 2d). When evaluated in cases with high growth pattern diversity based on the Shannon index derived from manual scoring, AI grading consistently obtained a higher C-index than pathological grading for DFS prediction (AI = 0.602, 95% CI = 0.485–0.720; path 1 = 0.590, 95% CI = 0.472–0.709, n1 = 169; AI = 0.602, 95% CI = 0.497–0.706; path 2 = 0.572, 95% CI = 0.453–0.692, n2 = 162; AI = 0.620, 95% CI = 0.537–0.704; path 3 = 0.578, 95% CI = 0.494–0.663, n3 = 167; stage I, Fig. 4a; stages I–III, Extended Data Fig. 6a; all models included baseline clinical parameters, same hereafter).

**Fig. 4: Assistance of AI in challenging scenarios for grading stage I tumors in LATTICe-A.**

Second, we focused on tumors scored predominantly as lepidic or acinar by each pathologist, excluding any morphologically homogeneous tumor that received a score of 90% or more for either pattern²². There is an ongoing difficulty in the histopathological discrimination between in situ and invasive disease⁴, and the distinction between invasive acinar and lepidic growth altered by interstitial fibrosis or iatrogenic compression with alveolar collapse can be particularly difficult. Differences in classification between pathologists can generate a shift between low and medium grade, which was observed among pathologists in the LATTICe-A cohort (Extended Data Fig. 4a). Therefore, these heterogeneously scored lepidic-predominant or acinar-predominant tumors present a challenging scenario to further test the added benefit of an AI grading system. AI grading consistently achieved a better performance in predicting DFS against pathological grading (AI = 0.658, 95% CI = 0.546–0.770; path 1 = 0.616, 95% CI = 0.513–0.718, n1 = 146; AI = 0.621, 95% CI = 0.530–0.711; path 2 = 0.587, 95% CI = 0.478–0.695, n2 = 136; AI = 0.703, 95% CI = 0.625–0.781; path 3 = 0.599, 95% CI = 0.512–0.687, n3 = 175; stage I, Fig. 4b; stages I–III, Extended Data Fig. 6b). There was a similar challenge in distinguishing between lepidic and papillary growth. When predominantly but heterogeneously presented (<90%) lepidic and papillary tumors were investigated in the context of comparing DFS prediction, AI grading consistently achieved a higher C-index (AI = 0.651, 95% CI = 0.420–0.882; path 1 = 0.619, 95% CI = 0.427–0.811, n1 = 92; AI = 0.658, 95% CI = 0.449–0.8670; path 2 = 0.614, 95% CI = 0.442–0.786, n2 = 77; AI = 0.602, 95% CI = 0.423–0.780; path 3 = 0.532, 95% CI = 0.373–0.692, n3 = 79; stage I, Fig. 4b; stages I–III, Extended Data Fig. 6b). The absence of statistical significance could be attributed to the relatively smaller number of patients and events in each group.

The third scenario was the detection of aggressive, high-grade patterns. Although there was a high concordance rate for cases composed predominantly of high-grade patterns (Extended Data Fig. 4e), the proposed IASLC grading system sets a 20% cutoff for high-grade patterns to qualify as grade 3, adding challenges to identify high-grade patterns from non-high-grade pattern-predominant tumors. Therefore, we selected tumors with high-grade patterns (≥5%) at lower abundance (≤30%) as scored by each pathologist and compared their manual grading with AI grading. Such analyses allowed us to examine manually scored tumors, which may be ‘close calls’ among observers when determining the high-grade pattern cutoff. A higher C-index for AI grading was consistently observed compared with all pathologists’ grading in predicting DFS (AI = 0.631, 95% CI = 0.486–0.776; path 1 = 0.574, 95% CI = 0.392–0.757, n1 = 79; AI = 0.560, 95% CI = 0.347–0.773; path 2 = 0.505, 95% CI = 0.313–0.696, n2 = 63; AI = 0.640, 95% CI = 0.521–0.759; path 3 = 0.607, 95% CI = 0.461–0.753, n3 = 128; stage I, Fig. 4c; stages I–III, Extended Data Fig. 6c).

Finally, we considered cases with high numbers of diagnostic slides per tumor (Fig. 4d), defined as four or more slides (n = 233, decreased kappa index in Fig. 4d). In these cases, AI grading achieved a C-index higher than average for the manual grading but lower than pathologist 1 in predicting DFS (AI = 0.638, 95% CI = 0.577–0.699; path 1 = 0.645, 95% CI = 0.581–0.709; path 2 = 0.615, 95% CI = 0.551–0.680; path 3 = 0.607, 95% CI = 0.523–0.691; stage I, Fig. 4d; stages I–III, Extended Data Fig. 6d).

These data indicated that our proposed AI method was not inferior to pathological grading and could assist pathologists to grade growth patterns in certain challenging scenarios.

Acinar morphology and spatial heterogeneity

Precise spatial delineations of growth patterns allowed us to study the spatial configuration of tumors as morphologically distinct pattern islands (Fig. 2a and Extended Data Figs. 2a,b and 3a). Acinar growth, often considered as an intermediate state during the transition of morphological patterns^6,23, was also the most prevalent pattern in stage I tumors in the LATTICe-A cohort (Fig. 5a). The area of individual acinar islands was similar to that of micropapillary islands, and smaller than those of other patterns (Fig. 5b). These data led us to investigate the importance of morphological features and spatial distribution of acinar islands that may be indicative of histology pattern transition.

**Fig. 5: Characterization of tumors with acinar morphological features and spatial heterogeneity.**

We used area and shape measured using pixel number and solidity index (Extended Data Fig. 7a) to represent the morphological features of individual acinar islands. Acinar island area and shape were notably different in tumors (≥5% of acinar) with different predominant patterns (TRACERx 421 n = 173; LATTICe-A n = 654; Extended Data Fig. 7b). Smaller acinar islands were enriched in lepidic-predominant tumors compared to acinar-predominant and papillary-predominant tumors (TRACERx 421 P = 0.00052; LATTICe-A P = 5.4 × 10⁻¹²; Fig. 5c and Extended Data Fig. 7c). This may reflect the acinar structures in lepidic-predominant disease frequently representing airspaces with iatrogenic collapse²⁴. The area of acinar islands in high-grade pattern-predominant (cribriform, micropapillary and solid) tumors were also smaller than those in acinar-predominant and papillary-predominant tumors (TRACERx 421 P = 9.8 × 10⁻¹¹; LATTICe-A P < 2.22 × 10⁻¹⁶; Fig. 5c and Extended Data Fig. 7c). Notably, this area feature was a strong discriminator between acinar-predominant and cribriform-predominant tumors (TRACERx 421 P = 0.0007; LATTICe-A P = 1.5 × 10⁻⁷; Fig. 5d), indicating that acini may form differently in acinar-predominant tumors compared to others. The transition from an acinar to a cribriform pattern may frequently occur to large acinar islands through gland fusion (Extended Data Fig. 7e), while smaller acinar structures may remain. Alveolar architectures in airspace detected in acinar-predominant tumors might also be supporting large ‘glands’. Acinar islands with regular shapes were enriched in high-grade-predominant tumors compared with lepidic subtypes (TRACERx 421 P = 0.0024; LATTICe-A P = 4.1 × 10⁻⁷; Fig. 5e and Extended Data Fig. 7d), which is again consistent with morphological variance due to the compressibility of lepidic growth. Taken together, the morphological features of acinar islands vary notably in tumors predominantly enriched with different patterns (Fig. 5f).

To investigate the spatial arrangement of acinar patterns, we developed an acinar scattering score that measured the degree of acinus dispersion. A low score indicated locally clustered acinar islands, while a high score implied a dispersion of acinar islands throughout the tissue (Extended Data Fig. 7f). Low acinar scattering was found more frequently in lepidic-predominant tumors compared to all others (TRACERx 421 P = 0.017; LATTICe-A P = 0.004; Fig. 5g), indicating that clustered acinar islands may reflect the compression induced by iatrogenic collapse and may also suggest that the transition from lepidic to acinar occurs in an organized manner²⁵. We next explored acinar scattering in the context of outcome prediction. Tumors with highly scattered acini were associated with reduced DFS compared to lowly scattered tumors (TRACERx 421 n = 205, P = 0.003, HR = 1.89, 95% CI = 1.25–2.86; LATTICe-A n = 837, P = 5.09 × 10⁻⁷, HR = 1.63, 95% CI = 1.35–1.98; Fig. 5h) in univariate analysis. In a multivariable model incorporating acinar scattering and AI grading, acinar scattering was independent of AI grading (TRACERx 421 P = 0.004; LATTICe-A P = 2.61 × 10⁻⁵; Fig. 5i). These data suggest that acinar scattering may be a potential pattern reflecting histological transition events, and that high scattering may be a morphological phenotype indicating poor prognosis, which can be assessed from H&E images.

Discussion

We have developed an AI method ANORAK for the precise classification of growth patterns in LUAD. To the best of our knowledge, this is the first AI method to dissect LUAD growth patterns at the pixel level and be tested in over 1,000 cases, setting a benchmark in automated grading of LUAD. Our method can automatically estimate growth pattern proportions and predominant patterns within a tumor, providing an unbiased and automated pipeline for determining IASLC grading in LUAD. Moreover, the precise delineation of growth patterns can provide insights into the heterogeneous landscape of LUAD, which cannot be addressed by patch-wise classification methods.

The AI method was evaluated in four cohorts, comprising a total of 1,372 tumors. The overall agreement of predominant pattern at the tumor level between AI and pathologists across four cohorts was moderate, which is consistent with the inter-pathologist agreement in the LATTICe-A and DHMC cohorts¹³. Similar results were found in previous studies. Boland et al.³ reported an agreement of 51.7% between two pathologists for a large cohort of individuals with LUAD (n = 534), while Thunnissen et al.⁴ showed good agreement for typical cases and fair agreement for difficult cases by comparing scores from 26 pathologists. In addition, tumors with a discrepant predominant pattern classification between AI and manual scoring were more heterogeneous compared to tumors in agreement. Previous attempts were made to determine how clonal evolution is reflected in growth pattern heterogeneity through the identification of molecular alterations that accompany the transition between growth patterns⁶. This detailed analysis in a small number of tumors found that changes in expression, rather than mutations, accompanied the transition; as such, clear evidence of divergent tumor clones reflected in the growth pattern was not identified. On a larger scale, in the TRACERx study, although without specific focus on sampling to capture divergent growth patterns, there was a tendency for tumors to evolve from low-grade or mid-grade to higher grade growth patterns in individuals with LUAD where an ancestor–descendant relationship could be described based on clonal or subclonal loss of heterozygosity²².

The proposed IASLC grading system was originally introduced to improve prognostication using tumor morphology². In our study, AI grading improved the performance of predicting DFS compared to the baseline and pathological grading for stage I tumors, and be comparable for stage I–III tumors. Moreover, the prognostic value of AI grading was independent of clinical parameters in the TRACERx 421 and LATTICe-A cohorts. In typical clinical practice, the colineage of postsurgical recurrence is not definitively confirmed, although data from the TRACERx 421 cohort showed that only two out of 49 cases of clinically classified postsurgical recurrence were of different lineage using whole-exome sequencing²⁶. While we acknowledge that these uncommon events limit the ability to predict recurrence from resection specimens, this applies equally to both our method and established practices.

The LATTICe-A cohort, consisting of 845 tumors with scores from three pathologists, allowed a comprehensive investigation of the clinical impact of the AI method and showed its benefit as a morphological biomarker. This benefit was slightly higher than that brought by an additional manual grading for stage I tumors, and was comparable with additional manual grading for stage I–III tumors. Furthermore, analyses of manual scoring demonstrated that tumors with multiple slides and intratumoral morphological heterogeneity were particularly challenging cases. In these cases, AI grading achieved a stronger predictive ability compared to manual grading for stage I tumors. Because stage I patients frequently receive surgical resection without adjuvant therapy, the accurate prediction of recurrence, to better target individual patients for adjuvant therapies, is critical. These data illustrate the clinical utility of our AI method for stage I tumors, which could potentially be used as an alternative or independent variable to manual grading, or be applied specifically to challenging cases.

The AI method enables the spatial profiling of growth patterns at the pixel level, allowing morphological and spatial heterogeneity analyses at the growth pattern island level. This would be unattainable with alternative manual or patch-wise classification methods. We used the area and solidity index to measure acinar island morphology and found that small acinar islands were enriched in lepidic-predominant and high-grade-predominant tumors, while the shape of these small acini in lepidic-predominant tumors was more irregular than high-grade-predominant tumors. This may reflect tumor cell biological and microenvironmental differences regarding the formation of acinar structures within the context of different predominant architectures. Because acinar morphological features were obtained by averaging thousands of acinar islands within a tumor, noise due to island segmentation was mitigated (Supplementary Figs. 1–7). We also developed a metric for measuring the spatial distribution within the tissue space of acinar islands, termed acinar scattering. Low acinar scattering was notably associated with lepidic-predominant tumors compared to others, suggesting that acinar spatial distribution may reflect the transition of growth patterns toward more aggressive behavior. High acinar scattering was correlated to unfavorable outcomes, independent of AI grading.

This study has some limitations. The Dice coefficient of ANORAK is still limited, indicating that error modes exist. Intratumoral and tumor microenvironment heterogeneity may result in variations in growth pattern morphology, making segmentation more challenging, specifically among lepidic, papillary and acinar patterns. Meanwhile, the patching operation during the training and testing stages may limit the field of view, thus losing context information. Stain color shift may also have the potential for misclassification despite the color augmentations and normalizations applied to mitigate this impact. These factors may contribute to local error modes, which, when accumulated, may result in errors at the WSI level. In addition, because the model counted the number of pixels to determine the predominant pattern per tumor, and the area of micropapillary islands was smaller than the papillary structures²⁷, the discrepancy between AI and pathologists regarding papillary-predominant and micropapillary-predominant patterns may be considered another error mode. Furthermore, because we only collected histopathology annotations from invasive non-mucinous LUAD as training data, invasive mucinous and preinvasive tumors with distinct morphologies are therefore outside of the scope, which may generate inaccurate results or completely fail if applied to such samples. In addition, we selected a ‘challenging case series’ from the LATTICe-A cohort, because the other cohorts considered in this study had fewer cases satisfying the selection criteria. However, LATTICe-A is not a screening-based cohort. It is therefore crucial to validate the potential clinical benefits of AI grading in further cohorts that include screening-detected tumors. Because there are no other studies reporting the importance of acinar spatial arrangement, further validations and studies of the biological implications of acinar scattering are needed.

In summary, the AI method we developed can automate the predominant growth pattern and IASLC grading for LUAD tumors, achieving a moderate agreement with pathologists; this was validated in four cohorts consisting of 1,372 cases. In the TRACERx 421 and LATTICe-A cohort, AI grading was an independent prognostic indicator and had a stronger prognostic ability than pathological grading alone for stage I tumors in the LATTICe-A cohort. The prognostic performance of AI grading was further underlined in challenging scenarios consisting of cases with multiple slides and greater intratumoral heterogeneity. Furthermore, specific morphological features of tumor acini have the potential to infer different underlying tumor biology, with the spatial heterogeneity of acinar islands reflecting divergent tumor behavior and prognosis.

Methods

Study cohorts

TRACERx is a multi-center, prospective study, which began recruitment in April 2014 (https://clinicaltrials.gov/ct2/show/NCT01888601, approved by an independent research ethics committee, ref. no. 13/LO/1546). Formalin-fixed paraffin-embedded and H&E-stained histopathology diagnostic slides were scanned using the NanoZoomer S210 digital slide scanner (catalog no. C13239-01) and NanoZoomer digital pathology system v.3.1.7 (Hamamatsu) at ×40 (0.228 μm per pixel resolution)^28,29. LATTICe-A is a retrospective series of all consecutively resected primary LUAD tumors at a single UK surgical center between 1998 and 2014. The work was ethically approved by a UK National Health Service research ethics committee (ref. no. 14/EM/1159) and complies with Strengthening the Reporting of Observational Studies in Epidemiology guidelines. All archived slides containing tumor material were used to capture the full diversity of each lesion. Slides were dearchived and scanned using a Hamamatsu NanoZoomer XR at ×40 (0.226 μm per pixel resolution)^23,29. Available diagnostic slides from the TCGA LUAD³⁰ were downloaded from https://portal.gdc.cancer.gov/ in 2021. The DHMC¹³ was downloaded from https://bmirds.github.io/LungCancer/ in 2021. Further information on the research design is available in the Nature Research Reporting Summary linked to this article.

The training set of the AI method consisted of 49 WSIs from 49 patients in the TRACERx 100 cohort^28,29. The WSIs were sparsely annotated by three independent thoracic subspeciality pathologists, yielding 3,662 patches (768 × 768 pixels at ×20, approximately 0.45 μm per pixel) of annotations for six typical growth patterns (Extended Data Fig. 1a) and non-tumor areas, for example, normal tissue and blank areas.

The AI method was then applied and evaluated on a total of 5,540 WSIs from four cohorts, which were collected, processed and scanned independently. This included patients with invasive non-mucinous LUAD as primary diagnosis (excluding adenocarcinoma in situ, minimally invasive adenocarcinomas and other variants) from the TRACERx 421 cohort (n = 206, 1,184 slides)^22,26, LATTICe-A cohort (n = 845, 3,979 slides)²³, TCGA LUAD cohort (n = 178, 234 slides)³⁰, DHMC cohort (n = 143, 143 slides)¹³ (Table 1). TRACERx 100 is a subset of TRACERx 421. For the TRACERx 421 and LATTICe-A cohorts, slides were from all the diagnostic blocks containing tumor cells. For the DHMC cohort and most patients (91%) in the TCGA cohort, only one slide was available. Hence, we only considered these two cohorts for agreement performance comparison. No statistical method was used to predetermine sample size but our sample sizes are similar to those reported in previous publications^{13,22,26,29,30} and subject to available diagnostic slides. Blinding and randomization were not relevant because this was an observational study. Patients were not allocated to any interventions and they were followed up and assessed as per routine practice. No results from this study were reported back to patients, so there is no likelihood of people changing their behaviors based on these findings. The deep learning model was trained without knowing the outcome of patients, which represents a form of blinding.

Manual pathological grading of growth patterns, as well as individual pattern proportion scoring, were available for the TRACERx 421, LATTICe-A and TCGA cohorts. The DHMC cohort only had predominant pattern data for each slide. In the LATTICe-A cohort, three independent consultant-level thoracic subspeciality pathologists provided growth pattern scoring for each tumor.

In the TRACERx 421 cohort, DFS was defined as the period from the date of registration to the time of radiological confirmation of the recurrence of the primary tumor registered for the TRACERx or the time of death by any cause. During the follow-up, three participants with LUAD (CRUK0512, CRUK0428 and CRUK0511) developed new primary cancer and subsequent recurrence from either the first primary lung cancer or the new primary cancer diagnosed during the follow-up. These cases were censored at the time of the diagnosis of the new primary cancer for DFS analysis because of the uncertainty of the origin of the third tumor²².

In the LATTICE-A cohort, recurrence data were obtained from the examination of patient records, notably paper notes and radiological databases, to identify the date of radiologically or biopsy-confirmed recurrence. Cancer-specific death was determined by the presence of lung cancer in the cause of death in the death certificate. Overall survival refers to the date of death.

Deep learning model architecture

We developed a deep learning-based model¹⁴ ANORAK which leveraged cross-stream interaction to recognize and segment six histological patterns (lepidic, acinar, papillary, micropapillary, cribriform and solid) on WSIs at the pixel level. The model applied ResNet50 (ref. ³¹) as the backbone with customized modifications to account for the limited training data. It encoded three streams (coarse, intermediate and fine) with different scales of information to gather abundant features at different resolutions (×10 at approximately 0.9 μm per pixel, ×5 and ×2.5). The first-order attention (Extended Data Fig. 1c) introduced global contextual information at an early stage to guide low-level feature learning and enable the first round of interactions between streams. Each output in the coarse and intermediate streams was then fed into a convolution layer to align the depth dimension with the fine stream output. A PPM¹⁵ (Extended Data Fig. 1c) was used to integrate high-level features. Afterwards, such features were forwarded to a second-order attention module, learning the relationship of streams to extract more discriminative features, and driving high-level feature exchanging between streams (Extended Data Fig. 1c and Fig. 1a).

Implementation and evaluation

Before training, the annotated tiles were divided into nonoverlapping patches, except for patches at the bottom and right edges, with a size of 768 × 768 pixels at ×20. During training, four data augmentation strategies were used to mitigate overfitting: random rotation within 90 degrees; random width-shift and height-shift up to 20% of the input width and height; randomly zooming in or out in a range of (0.8, 1.2); and random adjustment of the saturation within (0.8, 2.0) and hue within (−0.1, 0.1). Color augmentation was not applied to the cross-validation stage because data were from the same cohort. The model was trained for 60 epochs with a batch size of eight. Cross-entropy loss was applied as the objective function, which was minimized by the Adam optimizer with a step-wise learning rate. The initialization rate was set to 10⁻³ for the first ten epochs; then, it was decreased by ten times for the next 40 epochs, which was then followed by another ten times of decreasing (10⁻⁵) for the remaining ten epochs. The pipeline was implemented with Python v.3.8, tensorflow-gpu v.2.2, keras v.2.4.3, h5py v.2.10.0, numpy v.1.20.3, opencv-python v.4.5.3.56, pandas v.1.3.2, pillow v.8.3.1 and scipy v.1.7.1.

The ablation experiments at the patch level included comparisons with the baseline method (single-stream), multi-stream with the element-wise add combination (multi-ADD), multi-stream with first-order attention alone (multi-FO), multi-stream with second-order attention alone (multi-SO) and the proposed ANORAK model (multi-FO and multi-SO). The proposed model was compared against other widely used approaches in semantic segmentation, including attention U-Net¹⁷, DeepLabV3+ (ref. ¹⁸), DANet¹⁹ and MedT²⁰. We applied the Dice coefficient to evaluate segmentation performance at the patch level and the agreement of predominant patterns to assess prediction at the WSI level. Comparisons were conducted with fivefold cross-validation for the TRACERx 100 cohort (n = 53) and on a subset of the LATTICe-A cohort (n = 50), an independent dataset to the training dataset.

Growth pattern and grading inference

Each WSI was divided into tiles of 2,000 × 2,000 pixels with the magnification downsampled to ×20 (approximately 0.45 μm per pixel)²⁹. Each tile was then normalized to a target image to align the color before feeding it to the well-trained deep learning model, which, in turn, generated corresponding masks for all growth pattern regions detected at the pixel level. The tile masks were then stitched and further downsampled to ×1.25 (approximately 7.2 μm per pixel). Small components were empirically removed as postprocessing; lepidic patterns that were less than approximately 0.05 mm², and papillary, cribriform and solid patterns that were less than approximately 0.015 mm² were removed.

The predominant pattern and grading were inferred from a stitched and downsampled mask (approximately 7.2 μm per pixel). The growth pattern proportion for each tumor was computed as the proportion across all slides of a given tumor:

$$g_j=\frac{{\sum }_{i=1}^{m}{S}_{ij}}{{\sum }_{i=1}^{m}{\sum }_{j}^{n=6}{S}_{ij}}$$

$${P}={\rm{argmax}}({g}_{j})$$

where g_j is a proportion for the j pattern, j represents lepidic, acinar, papillary, cribriform, micropapillary and solid, i is the i-th slide, m is the number of slides per tumor, n is the number of patterns and S_ij is the number of pixels identified for the j pattern with the i-th slide. The predominant pattern, P, is determined as the pattern with the highest proportion. The growth pattern grading driven by AI followed the IASLC grading system²: grade 1, lepidic-predominant tumors with less than 20% of high-grade patterns (solid, micropapillary, cribriform); grade 2, acinar-predominant or papillary-predominant tumors with less than 20% of high-grade patterns; and grade 3, any tumor with 20% or more high-grade patterns.

Agreement between AI and pathological scores with regard to predominant patterns

The strongest correlation for growth pattern proportion between the AI and manual estimates was observed for the solid pattern (TRACERx 421, rho = 0.79; LATTICe-A correlations against each pathologist’s scoring, rho1 = 0.80, rho2 = 0.77, rho3 = 0.78; TCGA, rho = 0.67; Fig. 2b and Supplementary Table 1), followed by acinar (TRACERx 421, rho = 0.69; LATTICe-A, rho1 = 0.67, rho2 = 0.58, rho3 = 0.65; TCGA, rho = 0.56; Fig. 2b and Supplementary Table 1). A moderate correlation was observed for the micropapillary subtype (TRACERx 421, rho = 0.35; LATTICe-A, rho1 = 0.35, rho2 = 0.42, rho3 = 0.40; TCGA, rho = 0.44; Fig. 2b and Supplementary Table 1). Compared with other patterns, solid-predominant tumors had the highest agreement levels between AI and manual scoring (TRACERx 421, 85.5%; LATTICe-A, 85.4%, 79.9%, 85.3% against three pathologists; TCGA, 72%; DHMC, 90.2%; Fig. 2c). A lower agreement rate was observed for micropapillary-predominant tumors (TRACERx 421, 0%; LATTICe-A, 38.1%, 19.7% and 50% against three pathologists; TCGA, 20%; DHMC, 0%; Fig. 2c). Most discrepant micropapillary-predominant cases were identified as papillary and acinar by AI (TRACERx 421, 40%; LATTICe-A, 42.8%, 63.1%, 43.7%; TCGA, 60%; DHMC, 100%), suggesting that micropapillary islands frequently mixed with acinar or papillary in micropapillary-predominant tumors.

C-index measuring prognostic ability

We used the C-index to measure the prognostic ability of the survival models. Cox regression models were considered for predicting DFS; specifically, the baseline model included age, sex, tumor stage (excluded for stage I tumors as the stage information remains the same). The AI grading-based model included clinical baseline characteristics and automated IASLC grading. The manual grading-based model included clinical baseline characteristics together with a pathologist’s manual IASLC grading. When excluding clinical parameters, AI grading achieved a comparable C-index with pathological grading in stage I (TRACERx 421: AI = 0.588, 95% CI = 0.483–0.692; path = 0.593, 95% CI = 0.461–0.724; LATTICe-A: AI = 0.616, 95% CI = 0.571–0.661; path 1 = 0.609, 95% CI = 0.563–0.656; path 2 = 0.593, 95% CI = 0.545–0.641; path 3 = 0.571, 95% CI = 0.483–0.658; Supplementary Table 6) and stage I–III tumors (TRACERx 421: AI = 0.588, 95% CI = 0.547–0.630; path = 0.581, 95% CI = 0.530–0.632; LATTICe-A: AI = 0.577, 95% CI = 0.554–0.600; path 1 = 0.577, 95% CI = 0.552–0.603; path 2 = 0.574, 95% CI = 0.551–0.597; path 3 = 0.569, 95% CI = 0.546–0.591; Supplementary Table 6).

Acinar morphological features

The pixel number and solidity index, that is, the proportion of pixels in the convex hull that were also in a region of interest, were used to measure the individual acinar island area and shape generated by the AI method. A higher solidity index indicated a more regular shape. The average area and solidity index of all the individual acinar islands identified from the available slides were taken as the tumor-level features.

Acinar scattering score

We adapted an established score, standard distance³², to measure the spatial distribution of acinar patterns, which we termed ‘acinar scattering’:

$$d=\sqrt{\frac{{\sum }_{i=1}^{n}{({x}_{i}-{x}_{0})}^{2}+{\sum }_{i=1}^{n}{(\,{y}_{i}-{y}_{0})}^{2}}{n\times N}}$$

where d is the standard distance, n is the number of isolated acinar islands within the tissue identified by the proposed AI method, N is the area of the tissue, (x_i, y_i) is the centroid of an acinar island and (x₀, y₀) is the mean center of all the acinar islands.

$${x}_{0}=\frac{{\sum }_{i=1}^{n}{x}_{i}}{n},\ {y}_{0}=\frac{{\sum }_{i=1}^{n}{y}_{i}}{n}$$

A higher acinar scattering score indicated a more scattered distribution of acini across the tissue. The median value of all available slides for a given tumor was taken as the tumor-level score. The optimal cutoff (0.36) separating tumors into low-scattering and high-scattering groups was selected from the discovery cohort, LATTICe-A, which was then applied directly to the TRACERx 421 cohort.

In a univariable model, acinar scattering was prognostic of DFS for LATTICe-A in grade 2 and 3 tumors, respectively (grade 2, n = 212, P = 1.95 × 10⁻⁵, HR = 2.48, 95% CI = 1.63–3.76; grade 3, n = 570, P = 0.007, HR = 1.35, 95% CI = 1.08–1.68; Extended Data Fig. 8b,c), but not in grade 1 tumors (Extended Data Fig. 8a). In the TRACERx 421 cohort, high acinar scattering was associated with reduced DFS in grade 3 tumors (n = 137, P = 0.042, HR = 1.64, 95% CI = 1.01–2.65) and remained borderline in grade 2 tumors (n = 56, P = 0.053, HR = 2.74, 95% CI = 0.99–7.61), but was not notable in grade 1 tumors. The lack of statistical significance was probably due to the smaller number of patients and events in the grade 1 subgroup. When merging grade 1 and 2 tumors, the prognostic effect of acinar scattering was observed (TRACERx 421, n = 68, P = 0.025, HR = 2.79, 95% CI = 1.14–6.85; LATTICe-A, n = 267, P = 1.39 × 10⁻⁵, HR = 2.36, 95% CI = 1.60–3.48; Extended Data Fig. 8d).

Statistics and reproducibility

Correlation tests used Spearman’s method and were generated using the function cor.test from the stats v.4.1.2R package. Confusion matrices were obtained using the function confusionMatrix from the caret v.6.0-93R package. Fleiss’ kappa was computed to assess the agreement among observers using the function kappam.fleiss from the irr v.0.84.1R package. Survival analyses were conducted using the Kaplan–Meier estimator (ggsurvplot R function from the survminer v.0.4.9 and survival v.3.2-13R packages) as well as the Cox model (coxph R function, displayed using the ggforest R function). The differences between grade strata Kaplan–Meier curves were determined using Wald tests. Forest plots showed the HR on the x axis; each variable’s HR was plotted and annotated with a 95% CI. All HRs were computed for all time points (the whole survival curve was not at a specific time point). For statistical comparisons among groups, a two-sided, nonparametric, unpaired Wilcoxon rank-sum test was used for the continuous variables, while a Fisher’s exact test was used for the categorical variables. A Kruskal–Wallis test was used for comparisons among over two groups, unless stated otherwise. Predictive performance was assessed using a C-index³³ within 5 years, computed with the function Inf.Cval from the survC1 v.1.0-3R package. Multicollinearity between AI and manual grading, and between two manual gradings were assessed using the function vif from the car v.3.0-12R package. All statistical tests were two-sided and P < 0.05 was considered as statistically significant. To adjust P values for multiple comparisons, the Benjamini–Hochberg method was used. The packages tidyverse v.2.0.0 and tidyr v.1.3.0 were used for data processing in R. Plotting was done using ggplot2 v.3.4.1, RColorBrewer v.1.1-3 and ggpubr v.0.5.0R packages. All statistical analyses were conducted in R v.4.1.2.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The training dataset consisting of annotations on small image tiles have been deposited in Zenodo (https://doi.org/10.5281/zenodo.10016027). Previously published image data that were reanalyzed in this study can be requested from https://bmirds.github.io/LungCancer/. The human LUAD diagnostic slide images were derived from the TCGA Research Network at https://portal.gdc.cancer.gov/. Images generated by the AI model in Fig. 2a and Extended Data Figs. 2, 3a and 7f can be accessed at figshare (https://doi.org/10.6084/m9.figshare.24599796). For the TRACERx study, all of the scanned diagnostic histological images have a study number label embedded in the file that prevents complete anonymization. Therefore, these images cannot be shared, in line with the ethical approval for the study. Requests for access to the TRACERx dataset for academic noncommercial research purposes can be submitted through the Cancer Research UK and UCL Cancer Trials Centre (ctc.tracerx@ucl.ac.uk) and are subject to review of a project proposal that will be evaluated by a TRACERx data access committee, entering into an appropriate data access agreement and any applicable ethical approvals. The time frame of response to requests is about 6 months. LATTICe-A study data and materials are currently subject to a material and data transfer agreement between the University of Leicester, the University of Cambridge and NHS Greater Glasgow and Clyde, which includes a restricted access period of 5 years, precluding any access by other third parties during this time. After the 5-year period, restricted access data can be accessed by application to NHS Greater Glasgow and Clyde Biorepository (clare.orange@ggc.scot.nhs.uk; john.lequesne@glasgow.ac.uk) as custodians; the data access request will be reviewed and released under their research ethics committee-approved tissue bank protocols. Requests will be reviewed and approved within 6–8 weeks and will be accompanied by a data sharing agreement detailing the conditions and restrictions of use and publication. Source data are provided with this paper.

Code availability

The AI pipeline for growth pattern segmentation is available at https://github.com/xi11/AIgrading. All code used for the analyses was developed in R v.4.1.2 and is available to reproduce all figures (https://github.com/xi11/AIgrading).

References

Nicholson, A. G. et al. The 2021 WHO Classification of Lung Tumors: impact of advances since 2015. J. Thorac. Oncol. 17, 362–387 (2022).
Article PubMed Google Scholar
Moreira, A. L. et al. A grading system for invasive pulmonary adenocarcinoma: a proposal from the International Association for the Study of Lung Cancer Pathology Committee. J. Thorac. Oncol. 15, 1599–1610 (2020).
Article PubMed PubMed Central Google Scholar
Boland, J. M., Wampfler, J. A., Yang, P. & Yi, E. S. Growth pattern-based grading of pulmonary adenocarcinoma—analysis of 534 cases with comparison between observers and survival analysis. Lung Cancer 109, 14–20 (2017).
Article CAS PubMed Google Scholar
Thunnissen, E. et al. Reproducibility of histopathological subtypes and invasion in pulmonary adenocarcinoma. An international interobserver study. Mod. Pathol. 25, 1574–1583 (2012).
Article PubMed PubMed Central Google Scholar
Deng, C. et al. Validation of the novel International Association for the Study of Lung Cancer grading system for invasive pulmonary adenocarcinoma and association with common driver mutations. J. Thorac. Oncol. 16, 1684–1693 (2021).
Article CAS PubMed Google Scholar
Tavernari, D. et al. Nongenetic evolution drives lung adenocarcinoma spatial heterogeneity and progression. Cancer Discov. 11, 1490–1507 (2021).
Article CAS PubMed Google Scholar
Deshmukh, G. et al. FEEDNet: a feature enhanced encoder–decoder LSTM network for nuclei instance segmentation for histopathological diagnosis. Phys. Med. Biol. 67, https://doi.org/10.1088/1361-6560/ac8594 (2022)
Coudray, N. et al. Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. Nat. Med. 24, 1559–1567 (2018).
Article CAS PubMed PubMed Central Google Scholar
van Rijthoven, M., Balkenhol, M., Siliņa, K., van der Laak, J. & Ciompi, F. HookNet: multi-resolution convolutional neural networks for semantic segmentation in histopathology whole-slide images. Med. Image Anal. 68, 101890 (2021).
Article PubMed Google Scholar
Bulten, W. et al. Automated deep-learning system for Gleason grading of prostate cancer using biopsies: a diagnostic study. Lancet Oncol. 21, 233–241 (2020).
Article PubMed Google Scholar
Wang, Y. et al. Improved breast cancer histological grading using deep learning. Ann. Oncol. 33, 89–98 (2022).
Article CAS PubMed Google Scholar
Gertych, A. et al. Convolutional neural networks can accurately distinguish four histologic growth patterns of lung adenocarcinoma in digital slides. Sci. Rep. 9, 1483 (2019).
Article ADS PubMed PubMed Central Google Scholar
Wei, J. W. et al. Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks. Sci. Rep. 9, 3358 (2019).
Article ADS PubMed PubMed Central Google Scholar
Pan, X. et al. in Computational Mathematics Modeling in Cancer Analysis 78–90 (Springer, 2022).
Zhao, H., Shi, J., Qi, X., Wang, X. & Jia, J. Pyramid scene parsing network. In Proc. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2881–2890 (IEEE, 2017).
Lin, G., Milan, A., Shen, C. & Reid, I. RefineNet: multi-path refinement networks for high-resolution semantic segmentation. In Proc. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 1925–1934 (IEEE, 2017).
Oktay, O. et al. Attention U-Net: learning where to look for the pancreas. Preprint at arXiv https://doi.org/10.48550/arXiv.1804.03999 (2018).
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F. & Adam, H. Encoder–decoder with atrous separable convolution for semantic image segmentation. In Computer Vision – ECCV 2018 (eds. Ferrari, V. et al.) 833–851 (2018).
Fu, J. et al. Dual attention network for scene segmentation. In Proc. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 3146–3154 (IEEE, 2019).
Valanarasu, J. M. J., Oza, P., Hacihaliloglu, I. & Patel, V. M. Medical Transformer: gated axial-attention for medical image segmentation. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2021 (eds. de Bruijne, M. et al.) 36–46 (2021).
Yoshizawa, A. et al. Impact of proposed IASLC/ATS/ERS classification of lung adenocarcinoma: prognostic subgroups and implications for further revision of staging based on analysis of 514 stage I cases. Mod. Pathol. 24, 653–664 (2011).
Article CAS PubMed Google Scholar
Karasaki, T. et al. Evolutionary characterization of lung adenocarcinoma morphology in TRACERx. Nat. Med. 29, 833–845 (2023).
Article CAS PubMed PubMed Central Google Scholar
Moore, D. A. et al. In situ growth in early lung adenocarcinoma may represent precursor growth or invasive clone outgrowth—a clinically relevant distinction. Mod. Pathol. 32, 1095–1105 (2019).
Article CAS PubMed Google Scholar
Thunnissen, E. et al. Elastin in pulmonary pathology: relevance in tumours with a lepidic or papillary appearance. A comprehensive understanding from a morphological viewpoint. Histopathology 80, 457–467 (2022).
Article PubMed Google Scholar
Thunnissen, E. et al. Defining morphologic features of invasion in pulmonary nonmucinous adenocarcinoma with lepidic growth: a proposal by the International Association for the Study of Lung Cancer Pathology Committee. J. Thorac. Oncol. 18, 447–462 (2023).
Article PubMed Google Scholar
Frankell, A. M. et al. The evolution of lung cancer and impact of subclonal selection in TRACERx. Nature 616, 525–533 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
WHO Classification of Tumours: Thoracic Tumours (WHO, 2021).
Jamal-Hanjani, M. et al. Tracking the evolution of non-small-cell lung cancer. N. Engl. J. Med. 376, 2109–2121 (2017).
Article CAS PubMed Google Scholar
AbdulJabbar, K. et al. Geospatial immune variability illuminates differential evolution of lung adenocarcinoma. Nat. Med. 26, 1054–1062 (2020).
Article CAS PubMed PubMed Central Google Scholar
Collisson, E. A. et al. Comprehensive molecular profiling of lung adenocarcinoma. Nature 511, 543–550 (2014).
Article ADS CAS Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proc. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 770–778 (IEEE, 2016).
Mitchell, A. & Griffin, L. S. The ESRI Guide to GIS Analysis, Vol. 2. Spatial Measurements and Statistics (ESRI, 2020).
Uno, H., Cai, T., Pencina, M. J., D’Agostino, R. B. & Wei, L. J. On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Stat. Med. 30, 1105–1117 (2011).
Article MathSciNet PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was funded by a Cancer Research UK Career Establishment Award to Y.Y. (C45982/A21808). The TRACERx study (ClinicalTrials.gov registration: NCT01888601) is sponsored by University College London (UCL) (UCL/12/0279) and has been approved by an independent research ethics committee (13/LO/1546). TRACERx is funded by Cancer Research UK (C11496/A17786) and coordinated through the Cancer Research UK and UCL Cancer Trials Centre. Y.Y. acknowledges additional support from Breast Cancer Now (2015NovPR638), the Children’s Cancer and Leukaemia Group (CCLGA201906), National Institutes of Health (NIH) grant nos. U54 CA217376 and R01 CA185138, Congressionally Directed Medical Research Programs, Breast Cancer Research Program Award no. BC132057, European Commission Innovative Training Network (H2020-MSCA-ITN-2019), Wellcome Trust (105104/Z/14/Z) and the Royal Marsden/Institute of Cancer Research National Institute of Health Research Biomedical Research Centre. C.S. is a Royal Society Napier Research Professor (RSRP\R\210001). This work was supported by the Francis Crick Institute, which receives its core funding from Cancer Research UK (CC2041), the UK Medical Research Council (MRC) (CC2041) and the Wellcome Trust (CC2041). For the purpose of open access, the authors have applied a CC BY public copyright license to any author’s accepted manuscript version arising from this submission. C.S. is funded by Cancer Research UK (TRACERx, C11496/A17786), PEACE (C416/A21999) and Cancer Research UK Cancer Immunotherapy Catalyst Network; Cancer Research UK Lung Cancer Centre of Excellence (C11496/A30025); the Rosetrees Trust; the Butterfield and Stoneygate Trust; the NovoNordisk Foundation (ID16584); a Royal Society Professorship Enhancement Award (RP/EA/180007); the National Institute for Health Research University College London Hospitals Biomedical Research Centre; the Cancer Research UK-University College London Centre; the Experimental Cancer Medicine Centre; the Breast Cancer Research Foundation (BCRF-22-157); a Cancer Research UK Early Detection an Diagnosis Primer Award (grant no. EDDPMA-Nov21/100034); and a Mark Foundation for Cancer Research Aspire Award (grant no. 21-029-ASP). Research grants are administered by the American Association for Cancer Research, the scientific partner of SU2C. C.S. is in receipt of an European Research Council (ERC) Advanced Grant (PROTEUS) under the European Union’s Horizon 2020 research and innovation program (grant no. 835297). We thank L. Officer-Jones, C. Ficken and M. Das from the MRC Toxicology Unit Core Histology facility for their expert technical assistance. Figure 1a and Extended Data Figs. 1c and 3b,c were first published in Pan et al.¹⁴.

Author information

Xiaoxi Pan & Yinyin Yuan
Present address: Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Juvenal Baena
Present address: AstraZeneca Computational Pathology, Munich, Germany
These authors contributed equally: Khalid AbdulJabbar, Jose Coelho-Lima.
These authors jointly supervised this work: Yinyin Yuan, John Le Quesne, David A. Moore.

Authors and Affiliations

Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
Xiaoxi Pan, Khalid AbdulJabbar, Anca-Ioana Grapa, Hanyun Zhang & Yinyin Yuan
Division of Molecular Pathology, The Institute of Cancer Research, London, UK
Xiaoxi Pan, Khalid AbdulJabbar, Anca-Ioana Grapa, Hanyun Zhang & Yinyin Yuan
Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
Jose Coelho-Lima & Sarah J. Aitken
Department of Histopathology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
Jose Coelho-Lima & Sarah J. Aitken
Cancer Evolution and Genome Instability Laboratory, The Francis Crick Institute, London, UK
Alvin Ho Kwan Cheung, Takahiro Karasaki, Selvaraju Veeriah, Gareth A. Wilson, Rachel Rosenthal, Andrew Rowan, Chris Bailey, Claudia Lee, Emma Colliver, Katey S. S. Enfield, Mark S. Hill, Mihaela Angelova, Oriol Pich, Michelle Leung, Alexander M. Frankell, Crispin T. Hiley, Emilia L. Lim, Haoran Zhai, Maise Al Bakir, Nicolai J. Birkbak, Olivia Lucas, Ariana Huebner, Clare Puttick, Kristiana Grigoriadis, Michelle Dietzen, Dhruva Biswas, Foteini Athanasopoulou, Sophia Ward, Jayant K. Rane, Thomas B. K. Watkins, Charles Swanton & David A. Moore
Leicester Cancer Research Centre, University of Leicester, Leicester, UK
Juvenal Baena, Claire Rachel Wilson & Claire Wilson
Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London, UK
Takahiro Karasaki, Selvaraju Veeriah, Mariam Jamal-Hanjani, Michelle Leung, Alexander M. Frankell, Crispin T. Hiley, Emilia L. Lim, Haoran Zhai, Maise Al Bakir, Nicolai J. Birkbak, Olivia Lucas, Ariana Huebner, Clare Puttick, Kristiana Grigoriadis, Michelle Dietzen, Dhruva Biswas, Foteini Athanasopoulou, Sophia Ward, Antonia Toncheva, Christopher Abbosh, Corentin Richard, Cristina Naceur-Lombardelli, Francisco Gimeno-Valiente, Krupa Thakkar, Mariana Werner Sunderland, Monica Sivakumar, Nnennaya Kanu, Paulina Prymas, Sadegh Saghafinia, Sharon Vanloo, Jie Min Lam, Wing Kin Liu, Abigail Bunkum, Sonya Hessey, Simone Zaccaria, Carlos Martínez-Ruiz, James R. M. Black, Kerstin Thol, Robert Bentham, Kevin Litchfield, Nicholas McGranahan, Sergio A. Quezada, Martin D. Forster, Siow Ming Lee, Charles Swanton & David A. Moore
Hope Against Cancer and Leicester Experimental Cancer Medicine Centre, Leicester, UK
Claire Rachel Wilson
Institute for Lung Health, NIHR Leicester Biomedical Research Centre, Leicester, UK
Marco Sereno
Cancer Research UK & UCL Cancer Trials Centre, London, UK
Allan Hackshaw, Anne-Marie Hacker, Abigail Sharp, Sean Smith, Harjot Kaur Dhanda, Kitty Chan, Camilla Pilotti & Rachel Leslie
Department of Histopathology, Royal Brompton and Harefield Hospitals, Guy’s and St Thomas’ NHS Foundation Trust, London, UK
Andrew G. Nicholson
National Heart and Lung Institute, Imperial College London, London, UK
Andrew G. Nicholson
Cancer Metastasis Laboratory, University College London Cancer Institute, London, UK
Mariam Jamal-Hanjani, Jie Min Lam, Wing Kin Liu, Abigail Bunkum & Sonya Hessey
Department of Medical Oncology, University College London Hospitals NHS Foundation Trust, London, UK
Mariam Jamal-Hanjani & Charles Swanton
Molecular Pathology, School of Cancer Sciences, University of Glasgow, Glasgow, UK
John Le Quesne & John Le Quesne
Cancer Research UK Beatson Institute of Cancer Research, Glasgow, UK
John Le Quesne & John Le Quesne
NHS Greater Glasgow and Clyde, Glasgow, UK
John Le Quesne, Craig Dick & John Le Quesne
Department of Cellular Pathology, University College London Hospitals, London, UK
Teresa Marafioti, Elaine Borg, Mary Falzon, Reena Khiroya & David A. Moore
Lungs for Living Research Centre, UCL Respiratory, Department of Medicine, University College London, London, UK
Sam M. Janes
Independent Cancer Patient’s Voice, London, UK
David Chuter & Mairead MacKenzie
University Hospital Southampton NHS Foundation Trust, Southampton, UK
Serena Chee & Aiman Alzetani
Academic Division of Thoracic Surgery, Imperial College London, London, UK
Eric Lim
Royal Brompton and Harefield Hospitals, Guy’s and St Thomas’ NHS Foundation Trust, London, UK
Eric Lim, Paulo De Sousa, Simon Jordan, Alexandra Rice, Hilgardt Raubenheimer, Harshil Bhayani, Lyn Ambrose, Anand Devaraj, Hema Chavan, Sofina Begum, Silviu I. Buderi, Daniel Kaniu, Mpho Malima, Sarah Booth, Nadia Fernandes, Pratibha Shah & Chiara Proli
Royal Surrey Hospital, Royal Surrey Hospitals NHS Foundation Trust, Guildford, UK
Madeleine Hewish
University of Surrey, Guildford, UK
Madeleine Hewish
Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, UK
Sarah Danson
Liverpool Heart and Chest Hospital, Liverpool, UK
Michael J. Shackcloth
Princess Alexandra Hospital, The Princess Alexandra Hospital NHS Trust, Harlow, UK
Lily Robinson & Peter Russell
School of Cancer Sciences, University of Glasgow, Glasgow, UK
Kevin G. Blyth
Beatson Institute for Cancer Research, University of Glasgow, Glasgow, UK
Kevin G. Blyth
Queen Elizabeth University Hospital, Glasgow, UK
Kevin G. Blyth
Institute of Infection, Immunity & Inflammation, University of Glasgow, Glasgow, UK
Andrew Kidd
Golden Jubilee National Hospital, Clydebank, UK
Alan Kirk, Mo Asif, Rocco Bilancia, Nikos Kostoulas & Mathew Thomas
Singleton Hospital, Swansea Bay University Health Board, Swansea, UK
Jason F. Lester
University Hospitals of Leicester NHS Trust, Leicester, UK
Amrita Bajaj, Apostolos Nakas, Azmina Sodha-Ramdeen, Mohamad Tufail, Molly Scotland, Rebecca Boyles, Sridhar Rathinam & Dean A. Fennell
University of Leicester, Leicester, UK
Dean A. Fennell, Domenic Marrone & Sean Dulloo
Cancer Research Centre, University of Leicester, Leicester, UK
Gurdeep Matharu, Jacqui A. Shaw, Joan Riley & Lindsay Primrose
Royal Free London NHS Foundation Trust, London, UK
Ekaterini Boleti
Aberdeen Royal Infirmary NHS Grampian, Aberdeen, UK
Heather Cheyne, Mohammed Khalil, Shirley Richardson & Tracey Cruickshank
Department of Medical Oncology, Aberdeen Royal Infirmary NHS Grampian, Aberdeen, UK
Gillian Price
University of Aberdeen, Aberdeen, UK
Gillian Price & Keith M. Kerr
Department of Pathology, Aberdeen Royal Infirmary NHS Grampian, Aberdeen, UK
Keith M. Kerr
Department of Oncology, University College London Hospitals, London, UK
Sarah Benafif, Dionysis Papadatos-Pastos, James Wilson, Tanya Ahmad, Jie Min Lam, Martin D. Forster & Siow Ming Lee
The Whittington Hospital NHS Trust, London, UK
Jack French & Kayleigh Gilbert
Birmingham Acute Care Research Group, Institute of Inflammation and Ageing, University of Birmingham, Birmingham, UK
Babu Naidu
University Hospital Birmingham NHS Foundation Trust, Birmingham, UK
Akshay J. Patel, Aya Osman, Christer Lacson, Gerald Langman, Helen Shackleford, Madava Djearaman & Gary Middleton
Artificial Intelligence in Medicine AIM Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA
Salma Kadiri & Hugo J. W. L. Aerts
Institute of Immunology and Immunotherapy, University of Birmingham, Birmingham, UK
Gary Middleton
Manchester Cancer Research Centre Biobank, Manchester, UK
Angela Leek, Jack Davies Hodgkinson & Nicola Totten
Wythenshawe Hospital, Manchester University NHS Foundation Trust, Manchester, UK
Angeles Montero, Elaine Smith, Eustace Fontaine, Felice Granato, Juliette Novasio, Kendadai Rammohan, Leena Joseph, Paul Bishop, Rajesh Shah, Stuart Moss, Vijay Joshi & Philip Crosbie
Division of Infection, Immunity and Respiratory Medicine, University of Manchester, Manchester, UK
Philip Crosbie
Cancer Research UK Lung Cancer Centre of Excellence, University of Manchester, Manchester, UK
Philip Crosbie, Anshuman Chaturvedi, Lynsey Priest, Pedro Oliveira, Alexandra Clipson, Jonathan Tugwood, Alastair Kerr, Dominic G. Rothwell & Caroline Dive
Manchester University NHS Foundation Trust, Manchester, UK
Antonio Paiva-Correia
The Christie NHS Foundation Trust, Manchester, UK
Anshuman Chaturvedi, Lynsey Priest, Pedro Oliveira, Fabio Gomes, Kate Brown & Mathew Carter
Division of Cancer Sciences, The University of Manchester and The Christie NHS Foundation Trust, Manchester, UK
Colin R. Lindsay, Fiona H. Blackhall, Matthew G. Krebs & Yvonne Summers
Cancer Research UK Manchester Institute Cancer Biomarker Centre, University of Manchester, Manchester, UK
Alexandra Clipson, Jonathan Tugwood, Alastair Kerr, Dominic G. Rothwell & Caroline Dive
Department of Radiation Oncology, Brigham and Women’s Hospital, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
Hugo J. W. L. Aerts
Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Maastricht, the Netherlands
Hugo J. W. L. Aerts
Institute for Computational Cancer Biology, Center for Integrated Oncology, Cancer Research Center Cologne Essen, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
Roland F. Schwarz
Berlin Institute for the Foundations of Learning and Data, Berlin, Germany
Roland F. Schwarz & Tom L. Kaufmann
Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
Tom L. Kaufmann
Department of Genetics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Peter Van Loo
Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Peter Van Loo
Cancer Genomics Laboratory, The Francis Crick Institute, London, UK
Peter Van Loo, Jonas Demeulemeester, Carla Castignani & Elizabeth Larose Cadieux
Cancer Genome Evolution Research Group, Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London, UK
Michelle Leung, Ariana Huebner, Clare Puttick, Kristiana Grigoriadis, Michelle Dietzen, Thomas Patrick Jones, Carlos Martínez-Ruiz, James R. M. Black, Kerstin Thol, Robert Bentham & Nicholas McGranahan
Department of Molecular Medicine, Aarhus University Hospital, Aarhus, Denmark
Nicolai J. Birkbak, Judit Kisistok & Mateo Sokac
Department of Clinical Medicine, Aarhus University, Aarhus, Denmark
Nicolai J. Birkbak, Judit Kisistok & Mateo Sokac
Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark
Nicolai J. Birkbak, Judit Kisistok & Mateo Sokac
Computational Cancer Genomics Research Group, University College London Cancer Institute, London, UK
Olivia Lucas, Abigail Bunkum, Sonya Hessey & Simone Zaccaria
University College London Hospitals, London, UK
Olivia Lucas, Emilie Martinoni Hoogenboom, Fleur Monk, James W. Holding, Junaid Choudhary, Kunal Bhakhri, Marco Scarci, Pat Gorman, Robert C. M. Stephens, Yien Ning Sophia Wong, Zoltan Kaplar & Steve Bandula
Bill Lyons Informatics Centre, University College London Cancer Institute, London, UK
Dhruva Biswas & Javier Herrero
Advanced Sequencing Facility, The Francis Crick Institute, London, UK
Foteini Athanasopoulou, Sophia Ward & Jerome Nicod
Integrative Cancer Genomics Laboratory, Department of Oncology, KU Leuven, Leuven, Belgium
Jonas Demeulemeester
VIB-KU Leuven Center for Cancer Biology, Leuven, Belgium
Jonas Demeulemeester
Medical Genomics, University College London Cancer Institute, London, UK
Carla Castignani, Elizabeth Larose Cadieux, Stephan Beck & Miljana Tanic
Danish Cancer Society Research Center, Copenhagen, Denmark
Zoltan Szallasi & Miklos Diossy
Computational Health Informatics Program, Boston Children’s Hospital, Boston, MA, USA
Zoltan Szallasi & Miklos Diossy
Department of Bioinformatics, Semmelweis University, Budapest, Hungary
Zoltan Szallasi
Department of Physics of Complex Systems, ELTE Eötvös Loránd University, Budapest, Hungary
Miklos Diossy
Department of Pathology, Ziekenhuis aan de Stroom Hospitals, Antwerp, Belgium
Roberto Salgado
Division of Research, Peter MacCallum Cancer Centre, Melbourne, Victoria, Australia
Roberto Salgado
The Francis Crick Institute, London, UK
Aengus Stewart, Alastair Magness, Clare E. Weeden, Dina Levi, Eva Grönroos, Imran Noorani, Jacki Goldman, Mickael Escudero, Philip Hobson, Roberto Vendramin, Stefan Boeing, Tamara Denner, Vittorio Barbè, Wei-Ting Lu, William Hill, Yutaka Naito, Zoe Ramsden & George Kassiotis
Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
George Kassiotis
University College London Cancer Institute, London, UK
Angela Dwornik, Angeliki Karamani, Benny Chain, David R. Pearce, Despoina Karagianni, Felip Gálvez-Cancino, Georgia Stavrou, Gerasimos Mastrokalos, Helen L. Lowe, Ignacio Garcia Matos, James L. Reading, John A. Hartley, Kayalvizhi Selvaraju, Kezhong Chen, Leah Ensell, Mansi Shah, Maria Litovchenko, Olga Chervova, Piotr Pawlik, Robert E. Hynds, Samuel Gamble, Seng Kuong Anakin Ung, Supreet Kaur Bola, Victoria Spanswick, Yin Wu, Othman Al-Sawaf, Jayant K. Rane & Thomas B. K. Watkins
Department I of Internal Medicine, University Hospital of Cologne, Cologne, Germany
Othman Al-Sawaf
Experimental Oncology, Institute for Oncology and Radiology of Serbia, Belgrade, Serbia
Miljana Tanic
Tumour Immunogenomics and Immunosurveillance Laboratory, University College London Cancer Institute, London, UK
Kevin Litchfield
Immune Regulation and Tumour Immunotherapy Group, Cancer Immunology Unit, Research Department of Haematology, University College London Cancer Institute, London, UK
Sergio A. Quezada
Experimental Histopathology, The Francis Crick Institute, London, UK
Emma Nye & Richard Kevin Stone
Department of Haematology, University College London Hospitals, London, UK
Karl S. Peggs
Cancer Immunology Unit, Research Department of Haematology, University College London Cancer Institute, London, UK
Karl S. Peggs
Retroviral Immunology Group, The Francis Crick Institute, London, UK
Kevin W. Ng
Department of Molecular Oncology and Immunology, The Netherlands Cancer Institute, Amsterdam, the Netherlands
Krijn Dijkstra
Oncode Institute, Utrecht, the Netherlands
Krijn Dijkstra
Bioinformatics and Systems Biology, Method Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany
Matthew R. Huska
Department of Medical Physics and Biomedical Engineering, Centre for Medical Image Computing, London, UK
Catarina Veiga
Department of Medical Physics and Bioengineering, University College London Cancer Institute, London, UK
Gary Royle
Department of Medical Physics and Biomedical Engineering, University College London, London, UK
Charles-Antoine Collins-Fekete
Institute of Nuclear Medicine, Division of Medicine, University College London, London, UK
Francesco Fraioli
Institute of Structural and Molecular Biology, University College London, London, UK
Paul Ashford
Department of Radiology, University College London Hospitals, London, UK
Alexander James Procter, Asia Ahmed, Magali N. Taylor & Arjun Nair
University College London Respiratory, Department of Medicine, University College London, London, UK
Arjun Nair
Department of Thoracic Surgery, University College London Hospital NHS Trust, London, UK
David Lawrence & Davide Patrini
Lungs for Living Research Centre, University College London Respiratory, University College London, London, UK
Neal Navani & Ricky M. Thakrar
Department of Thoracic Medicine, University College London Hospitals, London, UK
Neal Navani & Ricky M. Thakrar

Authors

Xiaoxi Pan
View author publications
You can also search for this author in PubMed Google Scholar
Khalid AbdulJabbar
View author publications
You can also search for this author in PubMed Google Scholar
Jose Coelho-Lima
View author publications
You can also search for this author in PubMed Google Scholar
Anca-Ioana Grapa
View author publications
You can also search for this author in PubMed Google Scholar
Hanyun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Alvin Ho Kwan Cheung
View author publications
You can also search for this author in PubMed Google Scholar
Juvenal Baena
View author publications
You can also search for this author in PubMed Google Scholar
Takahiro Karasaki
View author publications
You can also search for this author in PubMed Google Scholar
Claire Rachel Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Marco Sereno
View author publications
You can also search for this author in PubMed Google Scholar
Selvaraju Veeriah
View author publications
You can also search for this author in PubMed Google Scholar
Sarah J. Aitken
View author publications
You can also search for this author in PubMed Google Scholar
Allan Hackshaw
View author publications
You can also search for this author in PubMed Google Scholar
Andrew G. Nicholson
View author publications
You can also search for this author in PubMed Google Scholar
Mariam Jamal-Hanjani
View author publications
You can also search for this author in PubMed Google Scholar
Charles Swanton
View author publications
You can also search for this author in PubMed Google Scholar
Yinyin Yuan
View author publications
You can also search for this author in PubMed Google Scholar
John Le Quesne
View author publications
You can also search for this author in PubMed Google Scholar
David A. Moore
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

TRACERx Consortium

Charles Swanton
, Mariam Jamal-Hanjani
, Hanyun Zhang
, Khalid AbdulJabbar
, Xiaoxi Pan
, Yinyin Yuan
, Allan Hackshaw
, John Le Quesne
, Selvaraju Veeriah
, Takahiro Karasaki
, Sam M. Janes
, Anne-Marie Hacker
, Abigail Sharp
, Sean Smith
, Harjot Kaur Dhanda
, Kitty Chan
, Camilla Pilotti
, Rachel Leslie
, Anca-Ioana Grapa
, David Chuter
, Mairead MacKenzie
, Serena Chee
, Aiman Alzetani
, Eric Lim
, Paulo De Sousa
, Simon Jordan
, Alexandra Rice
, Hilgardt Raubenheimer
, Harshil Bhayani
, Lyn Ambrose
, Anand Devaraj
, Hema Chavan
, Sofina Begum
, Silviu I. Buderi
, Daniel Kaniu
, Mpho Malima
, Sarah Booth
, Nadia Fernandes
, Pratibha Shah
, Chiara Proli
, Madeleine Hewish
, Sarah Danson
, Michael J. Shackcloth
, Lily Robinson
, Peter Russell
, Kevin G. Blyth
, Andrew Kidd
, Alan Kirk
, Mo Asif
, Rocco Bilancia
, Nikos Kostoulas
, Mathew Thomas
, Andrew G. Nicholson
, Craig Dick
, Jason F. Lester
, Amrita Bajaj
, Apostolos Nakas
, Azmina Sodha-Ramdeen
, Mohamad Tufail
, Molly Scotland
, Rebecca Boyles
, Sridhar Rathinam
, Dean A. Fennell
, Claire Wilson
, Domenic Marrone
, Sean Dulloo
, Gurdeep Matharu
, Jacqui A. Shaw
, Joan Riley
, Lindsay Primrose
, Ekaterini Boleti
, Heather Cheyne
, Mohammed Khalil
, Shirley Richardson
, Tracey Cruickshank
, Gillian Price
, Keith M. Kerr
, Sarah Benafif
, Dionysis Papadatos-Pastos
, James Wilson
, Tanya Ahmad
, Jack French
, Kayleigh Gilbert
, Babu Naidu
, Akshay J. Patel
, Aya Osman
, Christer Lacson
, Gerald Langman
, Helen Shackleford
, Madava Djearaman
, Salma Kadiri
, Gary Middleton
, Angela Leek
, Jack Davies Hodgkinson
, Nicola Totten
, Angeles Montero
, Elaine Smith
, Eustace Fontaine
, Felice Granato
, Juliette Novasio
, Kendadai Rammohan
, Leena Joseph
, Paul Bishop
, Rajesh Shah
, Stuart Moss
, Vijay Joshi
, Philip Crosbie
, Antonio Paiva-Correia
, Anshuman Chaturvedi
, Lynsey Priest
, Pedro Oliveira
, Fabio Gomes
, Kate Brown
, Mathew Carter
, Colin R. Lindsay
, Fiona H. Blackhall
, Matthew G. Krebs
, Yvonne Summers
, Alexandra Clipson
, Jonathan Tugwood
, Alastair Kerr
, Dominic G. Rothwell
, Caroline Dive
, Hugo J. W. L. Aerts
, Roland F. Schwarz
, Tom L. Kaufmann
, Peter Van Loo
, Gareth A. Wilson
, Rachel Rosenthal
, Andrew Rowan
, Chris Bailey
, Claudia Lee
, Emma Colliver
, Katey S. S. Enfield
, Mark S. Hill
, Mihaela Angelova
, Oriol Pich
, Michelle Leung
, Alexander M. Frankell
, Crispin T. Hiley
, Emilia L. Lim
, Haoran Zhai
, Maise Al Bakir
, Nicolai J. Birkbak
, Olivia Lucas
, Ariana Huebner
, Clare Puttick
, Kristiana Grigoriadis
, Michelle Dietzen
, David A. Moore
, Dhruva Biswas
, Foteini Athanasopoulou
, Sophia Ward
, Jonas Demeulemeester
, Carla Castignani
, Elizabeth Larose Cadieux
, Judit Kisistok
, Mateo Sokac
, Zoltan Szallasi
, Miklos Diossy
, Roberto Salgado
, Aengus Stewart
, Alastair Magness
, Clare E. Weeden
, Dina Levi
, Eva Grönroos
, Imran Noorani
, Jacki Goldman
, Mickael Escudero
, Philip Hobson
, Roberto Vendramin
, Stefan Boeing
, Tamara Denner
, Vittorio Barbè
, Wei-Ting Lu
, William Hill
, Yutaka Naito
, Zoe Ramsden
, George Kassiotis
, Angela Dwornik
, Angeliki Karamani
, Benny Chain
, David R. Pearce
, Despoina Karagianni
, Felip Gálvez-Cancino
, Georgia Stavrou
, Gerasimos Mastrokalos
, Helen L. Lowe
, Ignacio Garcia Matos
, James L. Reading
, John A. Hartley
, Kayalvizhi Selvaraju
, Kezhong Chen
, Leah Ensell
, Mansi Shah
, Maria Litovchenko
, Olga Chervova
, Piotr Pawlik
, Robert E. Hynds
, Samuel Gamble
, Seng Kuong Anakin Ung
, Supreet Kaur Bola
, Victoria Spanswick
, Yin Wu
, Othman Al-Sawaf
, Thomas Patrick Jones
, Stephan Beck
, Miljana Tanic
, Teresa Marafioti
, Elaine Borg
, Mary Falzon
, Reena Khiroya
, Antonia Toncheva
, Christopher Abbosh
, Corentin Richard
, Cristina Naceur-Lombardelli
, Francisco Gimeno-Valiente
, Krupa Thakkar
, Mariana Werner Sunderland
, Monica Sivakumar
, Nnennaya Kanu
, Paulina Prymas
, Sadegh Saghafinia
, Sharon Vanloo
, Jie Min Lam
, Wing Kin Liu
, Abigail Bunkum
, Sonya Hessey
, Simone Zaccaria
, Carlos Martínez-Ruiz
, James R. M. Black
, Kerstin Thol
, Robert Bentham
, Kevin Litchfield
, Nicholas McGranahan
, Sergio A. Quezada
, Martin D. Forster
, Siow Ming Lee
, Javier Herrero
, Emma Nye
, Richard Kevin Stone
, Jerome Nicod
, Jayant K. Rane
, Karl S. Peggs
, Kevin W. Ng
, Krijn Dijkstra
, Matthew R. Huska
, Emilie Martinoni Hoogenboom
, Fleur Monk
, James W. Holding
, Junaid Choudhary
, Kunal Bhakhri
, Marco Scarci
, Pat Gorman
, Robert C. M. Stephens
, Yien Ning Sophia Wong
, Zoltan Kaplar
, Steve Bandula
, Thomas B. K. Watkins
, Catarina Veiga
, Gary Royle
, Charles-Antoine Collins-Fekete
, Francesco Fraioli
, Paul Ashford
, Alexander James Procter
, Asia Ahmed
, Magali N. Taylor
, Arjun Nair
, David Lawrence
, Davide Patrini
, Neal Navani
& Ricky M. Thakrar

Contributions

X.P. collated the pathological annotations and the data from the cohorts, developed the AI method and performed the analyses. K.A. collated the pathological annotations and the data from the LATTICe-A cohort. J.C.-L. reviewed the TCGA cohort and helped write the manuscript. A.-I.G. collated the pathological annotations and helped write the manuscript. H.Z. conducted the comparison of the different AI methods. D.A.M., A.H.K.C. and J.L.Q. provided the pathological annotations. J.L.Q., D.A.M. and J.B. reviewed the LATTICe-A cohort. J.L.Q., M.S. and C.R.W. provided data and advice for the LATTICe-A cohort. T.K. collated the clinicopathological data of the TRACERx 421 cohort and helped analyze the data. S.V. performed the histology sample generation and digitized the H&E slides for the TRACERx 421 cohort. A.H. performed the survival analyses for the TRACERx 421 cohort and helped analyze the data. C.S. and M.J.-H. provided clinical expertise and oversight of the TRACERx study. J.L.Q., D.A.M. S.J.A. and A.G.N. provided histopathological expertise and helped write the manuscript. D.A.M. led the central pathology review and collated the pathology data for the TRACERx 421 cohort. J.L.Q. led the central pathology review and collated the pathology data for the LATTICe-A cohort. Y.Y., J.L.Q. and D.A.M. jointly conceived and supervised the study. X.P., K.A. and Y.Y. wrote the manuscript with input from all authors.

Corresponding authors

Correspondence to Yinyin Yuan, John Le Quesne or David A. Moore.

Ethics declarations

Competing interests

S.V. is a coinventor to a patent of methods for detecting molecules in a sample (patent no. 10578620). A.H. has received fees from Abbvie, Almirall, Boehringer Ingelheim, Clovis Oncology, Ipsen, Takeda Pharmaceuticals, AstraZeneca, Daiichi Sankyo, Merck Serono, Merck/MSD, UCB, Kyowa Kirin, Servier, Sobi, Pfizer and Roche for delivering general education and training in clinical trials; has received fees for member of independent data monitoring committees for Roche-sponsored clinical trials and academic projects on real-world evidence or tumor-agnostic therapies coordinated by Roche; he has been paid honoraria for speaking at Roche-funded conferences (on real-world data); he has an academic collaboration with Navio and is an unpaid member of their advisory board; he is an investigator for an academic study (SUMMIT) sponsored by UCL, which is funded by GRAIL; he has received one honorarium for an advisory board meeting for GRAIL; he has received a consulting fee from Evidera (for one GRAIL-initiated project); and he has previously owned shares in Illumina and Thermo Fisher Scientific (sold in 2020); he is on the scientific advisory board for Adela Bio and has received no payments or honoraria for this, although he has share options available. A.G.N. reports personal fees from Merck, Boehringer Ingelheim, Novartis, AstraZeneca, Bristol Myers Squibb, Roche, Abbvie, Oncologica, Uptodate, the European Society of Oncology, Takeda Pharmaceuticals, Sanofi and Liberium, as well as personal fees and grants from Pfizer. M.J-H. is a Cancer Research UK Career Establishment Awardee and has received funding from Cancer Research UK, the International Association for the Study of Lung Cancer and International Lung Cancer Foundation, the Lung Cancer Research Foundation, the Rosetrees Trust, UK and Ireland Neuroendocrine Tumour Society, the National Institute for Health Research (NIHR) and the NIHR UCLH Biomedical Research Centre. M.J-H. has consulted for, and is a member of, the Achilles Therapeutics Scientific advisory board and steering committee, has received speaker honoraria from Pfizer, Astex Pharmaceuticals and Oslo Cancer Cluster, and holds a patent (no. PCT/US2017/028013) relating to methods for lung cancer detection. C.S. acknowledges grant support from AstraZeneca, Boehringer Ingelheim, Bristol Myers Squibb, Pfizer, Roche-Ventana, Invitae (previously Archer Dx, collaboration in minimal residual disease sequencing technologies) and Ono Pharmaceutical. He is an AstraZeneca advisory board member and chief investigator for the AZ MeRmaiD 1 and 2 clinical trials; he is also co-chief investigator of the NHS Galleri trial funded by GRAIL and a paid member of GRAIL’s scientific advisory board. He receives consultant fees from Achilles Therapeutics (scientific advisory board member), Bicycle Therapeutics (scientific advisory board), Genentech, Medicxi, Roche Innovation Centre-Shanghai, Metabomed (until July 2022) and the Sarah Cannon Research Institute. C.S. has received honoraria from Amgen, AstraZeneca, Pfizer, Novartis, GlaxoSmithKline, MSD, Bristol Myers Squibb, Illumina and Roche-Ventana. C.S. had stock options in Apogen Biotechnologies and GRAIL until June 2021; he currently has stock options in Epic Bioscience, Bicycle Therapeutics; he has stock options and is a cofounder of Achilles Therapeutics. C.S. holds patents relating to assay technology to detect tumor recurrence (no. PCT/GB2017/053289), target neoantigens (no. PCT/EP2016/059401), identify patent response to immune checkpoint blockade (no. PCT/EP2016/071471), determine HLA loss of heterozygosity (no. PCT/GB2018/052004), predict survival rates of patients with cancer (no. PCT/GB2020/050221) and identify patients who respond to cancer treatment (no. PCT/GB2018/051912), as well as a US patent related to detecting tumor mutations (no. PCT/US2017/28013) and methods for lung cancer detection (no. US20190106751A1), and both European and US patents related to identifying insertion and deletion mutation targets (no. PCT/GB2018/051892). Y.Y. has received speaker’s bureau honoraria from Roche and consulted for Merck. D.A.M. reports speaker fees from Eli Lilly, AstraZeneca and Takeda Pharmaceuticals, consultancy fees from AstraZeneca, Thermo Fisher Scientific, Takeda Pharmaceuticals, Amgen, Janssen, MIM Software, Bristol Myers Squibb and Eli Lilly, and has received educational support from Takeda Pharmaceuticals and Amgen. All other authors declare no competing interests.

Peer review

Peer review information

Nature Cancer thanks Lynette Sholl and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Precise pathological annotations for training and sub-modules of the developed deep learning model (ANORAK).

a. Examples illustrating morphologically distinct growth patterns in lung adenocarcinoma. b. Distribution of annotations regarding the number of patches and pixels. c. Detailed architectures of sub-modules developed for the AI method.

Extended Data Fig. 2 Segmentation performance.

a,b. Segmentations generated by AI at low-power and high-power resolutions, deposited in 10.6084/m9.figshare.24599796.

Extended Data Fig. 3 Segmentation performance and intra- and inter-comparisons.

a. Segmentations generated by AI at low-power and high-power resolutions, deposited in 10.6084/m9.figshare.24599796. b. Comparison of segmentation and prediction performance for ablation experiments. c. Comparison of segmentation and prediction performance with other methods.

Extended Data Fig. 4 Inter-pathologists comparison for predominant pattern and IASLC grading in LATTICe-A.

a. Interobserver agreement of each pattern. b, c. Interobserver agreement of predominant pattern and IASLC grading at tumor level. d. Growth pattern intra-tumoral heterogeneity substantially contributed to the discrepancy between pathologists (n = 845 each, P1 < 2.22 × 10⁻¹⁶, P2 = 4.323× 10⁻¹³, P3 = 1.589 × 10⁻¹⁵). P value was calculated using a two-sided Wilcoxon rank-sum test and not adjusted for the multiple comparisons. The median value is indicated by a thick horizontal line; the first and third quartiles are represented by box edges; whiskers indicate 1.5 times interquartile range. e. Interobserver agreement of each grade.

Extended Data Fig. 5 Survival analyses of AI and pathological gradings.

a. Pair-wise comparison of AI grades in univariable and multivariable Cox proportional hazards models. b–d. Multivariable Cox regression analyses showing pathological gradings were independent of age, sex, tumor stage, smoking pack-years, adjuvant therapy, type of surgery in LATTICe-A (P1 = 0.00524, P2 = 0.000913, P3 = 0.0169). HRs of each variable with 95% confidence intervals are shown on the horizontal axis; P value was derived with Wald test. Asterisks indicate: *P < 0.05, **P < 0.01, ***P < 0.001. e. Comparison of improvements driven by AI and additional manual scoring for stage I (n = 337) and stage I-III (n = 729) tumors in LATTICe-A, where models included age, sex, tumor stage and gradings from AI or/and pathologists. C-indexes with 95% confidence intervals are shown on the vertical axis.

Source data

Extended Data Fig. 6 Assistance of AI in grading challenging scenarios for stage I-III tumors in LATTICe-A.

a. Challenging scenario 1, tumors with highly diversified growth patterns indicated by the Shannon diversity index (n1 = 363, n2 = 361, n3 = 390). b. Challenging scenario 2, differentiation between lepidic- and acinar-predominant tumors (n1 = 274, n2 = 222, n3 = 340), and between lepidic- and papillary-predominant tumors (n1 = 162, n2 = 134, n3 = 137). c. Challenging scenario 3, tumors with high-grade patterns between 5% and 30% (n1 = 162, n2 = 117, n3 = 252). d. Challenging scenario 4, tumors with no less than 4 slides (n = 551). C-indexes of each variable with 95% confidence intervals are shown on the vertical axis.

Source data

Extended Data Fig. 7 Morphological and spatial analyses of acinar island.

a. Acinar morphological feature measures, area and solidity index. b. Acinar islands are morphologically different among tumors with different predominant patterns (TRACERx 421, P = 1.493 × 10⁻⁹ and P = 0.0005932, n = 173; LATTICe-A, P < 2.22 × 10⁻¹⁶ and P = 2.626 × 10⁻¹⁰, n = 654). P value was calculated using a one-way Kruskal-Wallis rank-sum test and not adjusted for the multiple comparisons. c. Acinar island areas were less varied in lepidic-predominant (TRACERx 421, P = 0.002889, n = 108; LATTICe-A, P = 7.743 × 10⁻⁹, n = 420) and high-grade-predominant (TRACERx 421, P = 7.617 × 10⁻⁸, n = 157; LATTICe-A, P = 1.611 × 10⁻¹⁵, n = 593) tumors than acinar- and papillary-predominant tumors. d. Acinar island shapes were less varied in high-grade-predominant tumors than lepidic predominant tumors (TRACERx 421, P = 6.374 × 10⁻⁶, n = 81; LATTICe-A, P = 8.184 × 10⁻¹⁶, n = 295). b-d. Each point is a tumor, y axis is the standard deviation of the area or solidity index for all the individual acinar islands within a tumor. The median value is indicated by a thick horizontal line; the first and third quartiles are represented by box edges; whiskers indicate 1.5 times interquartile range. c-d. P value was calculated using a two-sided Wilcoxon rank-sum test and not adjusted for the multiple comparisons. e. Example illustrating the transition from acinar to cribriform. f. Examples of high and low acinar scattering inferred from H&E images with the AI method, deposited in 10.6084/m9.figshare.24599796.

Source data

Extended Data Fig. 8 Acinar scattering stratifying subgroups of AI grading.

a. Acinar scattering stratifying patients at AI grade 1 (TRACERx 421 P = 0.5112, n = 12; LATTICe-A P = 0.5397, n = 55). b. Acinar scattering stratifying patients at AI grade 2 (TRACERx 421 P = 0.0533, n = 56; LATTICe-A P = 1.947 × 10⁻⁵, n = 212). c. Acinar scattering stratifying patients at AI grade 3 (TRACERx 421 P = 0.04235, n = 137; LATTICe-A P = 0.007446, n = 570). d. Acinar scattering stratifying patients at AI grades 1&2 (TRACERx 421 P = 0.02517, n = 68; LATTICe-A P = 1.387 × 10⁻⁵, n = 267). HRs of each variable with 95% confidence intervals are shown on the horizontal axis. P value was derived with Wald test, and not adjusted for multiple comparisons. Asterisks indicate: *P < 0.05, **P < 0.01, ***P < 0.001.

Source data

Supplementary information

Supplementary Information

Supplementary Figs. 1–7.

Reporting Summary

Supplementary Table

Supplementary Tables 1–6.

Supplementary Data 1

Source Data for Supplementary Figs. 1–7. Reproducing acinar morphology by using 50% of acinar islands per tumor and repeating ten times.

Source data

Source Data Figs. 1, 2–5 and Extended Data Figs. 5–8

The source data analyzed in this study are provided in a single file, with named tabs for each cohort. These data are used across Figs. 1b and 2–5, and Extended Data Figs. 5–8.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pan, X., AbdulJabbar, K., Coelho-Lima, J. et al. The artificial intelligence-based model ANORAK improves histopathological grading of lung adenocarcinoma. Nat Cancer 5, 347–363 (2024). https://doi.org/10.1038/s43018-023-00694-w

Download citation

Received: 06 December 2022
Accepted: 21 November 2023
Published: 10 January 2024
Issue Date: February 2024
DOI: https://doi.org/10.1038/s43018-023-00694-w

Subjects

Abstract

Similar content being viewed by others

Main

Results

A hierarchical AI model for growth pattern quantification

AI grading consistently improves patient risk stratification

Assisting pathologists in challenging scenarios

Acinar morphology and spatial heterogeneity

Discussion

Methods

Study cohorts

Deep learning model architecture

Implementation and evaluation

Growth pattern and grading inference

Agreement between AI and pathological scores with regard to predominant patterns

C-index measuring prognostic ability

Acinar morphological features

Acinar scattering score

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

TRACERx Consortium

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links