Skip to main content

Proteomic analysis identifies subgroups of patients with active systemic lupus erythematosus

Abstract

Objective

Systemic lupus erythematosus (SLE) is a clinically and biologically heterogenous autoimmune disease. We aimed to investigate the plasma proteome of patients with active SLE to identify novel subgroups, or endotypes, of patients.

Method

Plasma was collected from patients with active SLE who were enrolled in the British Isles Lupus Assessment Group Biologics Registry (BILAG-BR). The plasma proteome was analysed using a data-independent acquisition method, Sequential Window Acquisition of All theoretical mass spectra mass spectrometry (SWATH-MS). Unsupervised, data-driven clustering algorithms were used to delineate groups of patients with a shared proteomic profile.

Results

In 223 patients, six clusters were identified based on quantification of 581 proteins. Between the clusters, there were significant differences in age (p = 0.012) and ethnicity (p = 0.003). There was increased musculoskeletal disease activity in cluster 1 (C1), 19/27 (70.4%) (p = 0.002) and renal activity in cluster 6 (C6) 15/24 (62.5%) (p = 0.051). Anti-SSa/Ro was the only autoantibody that significantly differed between clusters (p = 0.017). C1 was associated with p21-activated kinases (PAK) and Phospholipase C (PLC) signalling. Within C1 there were two sub-clusters (C1A and C1B) defined by 49 proteins related to cytoskeletal protein binding. C2 and C6 demonstrated opposite Rho family GTPase and Rho GDI signalling. Three proteins (MZB1, SND1 and AGL) identified in C6 increased the classification of active renal disease although this did not reach statistical significance (p = 0.0617).

Conclusions

Unsupervised proteomic analysis identifies clusters of patients with active SLE, that are associated with clinical and serological features, which may facilitate biomarker discovery. The observed proteomic heterogeneity further supports the need for a personalised approach to treatment in SLE.

Introduction

Systemic lupus erythematosus (SLE) is an inflammatory autoimmune rheumatic disease with significant morbidity and mortality [1]; it is heterogeneous in both clinical features and treatment response. Although several new treatments for SLE have been developed, there is an important unmet need for disease biomarkers to measure active disease and predict treatment response.

The clinical heterogeneity of SLE may reflect different underlying cellular and molecular processes. Despite a better understanding of mechanisms that contribute to disease such as increased B cell activity and type 1 interferon production, which have led to more targeted novel therapies, the overall response rates to these therapies in clinical trials are typically around 40–60% [2, 3]. A more personalised approach to therapy may increase response rates. For example, a post hoc analysis of randomised controlled trials using belimumab (BEL), an anti-BAFF monoclonal antibody, identified increased efficacy in patients with elevated anti-dsDNA antibody or low C3/4 complement, suggesting different patient subgroups may have different treatment response [4]. Common biomarkers have variable sensitivity and specificity for active disease (approximately 62% and 93% for anti-dsDNA antibodies, 75% and 71% for C3 complements, and 48% and 71% for C4 complements) [5]. The UK Medical Research Council (MRC) Precision Medicine Consortium ‘Maximising SLE Therapeutic Potential by the Application of Novel and Stratified Approaches’ (MASTERPLANS) aimed to identify novel markers to predict treatment response. An important first step was to identify biomarkers which may associate with specific clinical features of SLE.

Proteomic analysis using mass-spectrometry (MS) has been applied in numerous immune-mediated inflammatory diseases including Rheumatoid arthritis (RA) [6], Sjogren’s syndrome [7] and Systemic sclerosis [8]. Although proteomic MS studies in SLE to date have been modest in size, a systematic review by Nicolaou et al. identified 241 potential biomarkers from 25 studies using gel electrophoresis or liquid chromatography tandem MS across numerous sample types [9, 10] within different manifestations of SLE such as nephritis, neuropsychiatric and cutaneous disease. Of note, Annexin A2 which was observed in three studies included in the review, was not detected at a high level in our study and thus excluded from analysis.

Sequential window acquisition of all theoretical mass spectra (SWATH-MS) is a data independent acquisition (DIA) MS technique. It has advantages over data-dependant acquisition (DDA) and targeted acquisition (TA) methods as it employs non-selective analysis of peptides and their fragments. These data are then compared to a spectral library of protein fragments to deconvolve the complex signal into relative quantities of peptides and proteins across hundreds of samples [11]. DIA has high reproducibility and can detect peptides in the order of tens of thousands compared to TA, where typically hundreds of peptides are measured in a directed non-discovery approach [12]. Due to its unbiased nature and ability to identify and quantify peptides at the proteome-scale; SWATH-MS is a powerful method for biomarker discovery.

The aim of the study was to use SWATH-MS to determine whether analysis of the plasma proteome can identify discrete subgroups in a large cohort of SLE patients with highly active disease.

Methods

Study population

We included patients with SLE fulfilling the 1997 Updated American College of Rheumatology (ACR) or the Systemic Lupus International Collaborating Clinics (SLICC) 2012 classification criteria registered with the BILAG-Biologics Registry (BILAG-BR) [13]. All had active disease and due to commence rituximab (RTX), belimumab (BEL) or mycophenolate mofetil (MMF). Patients commencing RTX or BEL needed to have sufficiently active disease to satisfy the NHS England 2013 Interim Clinical Commissioning Policy and National Institute for Health and Care Excellence (NICE) criteria respectively [14, 15].

Baseline plasma samples were obtained from study patients within a window of maximum 30 days prior to and a maximum of five days after, receiving escalating treatment with BEL, RTX or MMF.

Baseline disease activity was measured using BILAG-2004 index [16] and SLEDAI-2000 (SLEDAI-2 K) [17]. Active disease in each domain was defined as BILAG-2004 A or B score. Lupus-related damage was measured using the ACR/SLICC damage index (SDI) [18]. Measurement of C3 and C4 levels were conducted at recruiting sites with low C3/C4 status determined by local laboratory reference ranges. Proteinuria was measured at the local site by spot urine protein-creatinine ratio (uPCR) or albumin-creatinine ratio (uACR). Values for uPCR were converted to uACR using the method described by Sumida et al. [19].

Antibody status

All sera were tested for autoantibodies at the same centre by immunoprecipitation of proteins from radiolabelled cell lines, followed by PAGE separation and identification of recognised autoantigens (e.g. Ro60 [SS-A], U1RNP/Sm, La [SS-B]) by autoradiography as described elsewhere [20]. Anti-Ro52 was measured using the ABNOVA SS-A 52 Ab ELISA Kit and anti-dsDNA using the Inova Diagnostics Quanta Lite® dsDNA SC ELISA kit.

Sample preparation

Plasma analysis was performed using Sequential Window Acquisition of All Theoretical Mass Spectra (SWATH-MS). Briefly, 10uL of plasma was depleted of highly abundant proteins (albumin, IgG, transferrin, fibrinogen, IgA, α-2-macroglobulin, α-1-antitrypsin, IgM, haptoglobin, α1-acid glycoprotein, apolipoprotein A-I and apolipoprotein A-II), then concentrated through centrifugal filtration and added to digestion buffer of ammonium bicarbonate, yielding a final volume of 80–100 uL. Proteins were reduced and solubilised with dithiothreitol and sodium deoxycholate then alkylated with iodoacetamide and digested with trypsin. The sample was recovered through centrifugation and peptides were lyophilised from recovered supernatant by vacuum centrifugation (MiVac Quattro Concentrator, SP Scientific US) and stored at − 80 °C until use (details in Additional file 1). Samples were analysed by SWATH-MS with a micro-flow LC–MS system comprising an Eksigent nanoLC 400 autosampler and an Eksigent nanoLC 425 pump coupled to a Sciex 6600 Triple-TOF mass spectrometer with a DuoSpray Ion Source. The system was controlled by Analyst software v1.7.1 and eksigent control software v4.2. This spectral library was generated on the basis of a previously published plasma protein library [21] but updated to be compatible with a 100 variable window acquisition method employed in the Stoller Centre. Full MS methodology is included in the Additional file 1.

SWATH maps were aligned with TRIC (msproteomicstools version 0.4.3) feature alignment algorithm. The aligned openSWATH maps were processed with MSstats to infer protein-level quantification based on aligned transition-level quantitative information. With this technique, proteins in low quantities, below the threshold of detection within the plasma, were designated as absent. Full methodology for alignment is included in the Additional file 1.

Data analysis

Hierarchical clustering was used to cluster patients based on their proteomic profile. The Average linkage method was applied. The optimal number of clusters was determined using the R package, nbclust [22]. The elbow method was used to visually confirm the number of clusters, and the final number of clusters resolved by majority rule of 30 different validation indices (see Additional file 1). Initial data visualisation was performed on untransformed data using t-distributed stochastic neighbour embedding (t-SNE) with a perplexity of 11 (approximately 5% of the study population) [23]. For heat map visualisation, protein expression levels were standardised to Z scores (where if expression level = x, Z score calculated as (x – mean)/standard deviation).

Differential protein levels between clusters were performed with the proteome of a single cluster compared to the remaining proteome as a “combined cluster”, i.e., Cluster 1 vs all remaining. The comparison was performed using a linear model with multiple t-tests (R package limma) and an adjusted-P < 0.05 (Benjamini-Hochberg) was considered statistically significant. Canonical pathways and functional protein association networks were visualised using String (v.11.0, URL: https://string-db.org/) [24] and Qiagen Ingenuity Pathway Analysis (IPA) [25]. Pathways with Z score of < − 1 or > 1 was considered significant.

Clinical variables across clusters were analysed using non-parametric tests; Chi square with Fisher’s exact test and Kruskal–Wallis as appropriate. Those features which are considered increased/or decreased between clusters are reported based on descriptive numerical differences. In the regression models, finite mixture models were used to allocate proteins with a clear bimodal distribution into undetectable/low/high groups. Other proteins were considered binary (undetectable/detectable). An adaptive lasso regression model (with sample split into training and validation datasets at a ratio of 3:1) was used to identify proteins with a non-zero coefficient with lambda selected via cross-validation. Analyses were performed using R v.4.0.3, STATA v16.0 SE and SPSS v.26.

Results

Baseline patient characteristics

Plasma samples were analysed from 223 patients, 198 (88.8%) were females with median (IQR) age and disease duration of 40 (30, 51) and 10 (6, 17) years respectively. At baseline, 204 (91.5%) had at least one BILAG A and/or two BILAG B scores. Of these, active disease (BILAG 2004 A or B score) was predominantly in mucocutaneous (107, 51.2%), musculoskeletal (95, 45.5%) and renal (79, 37.4%) domains. Almost 40% had elevated anti-dsDNA antibodies (87/223 [39%]) and/or low C3 or C4 complement (100/223 [44.8%]).

Regarding treatment, 136/223 (61%) were taking regular oral corticosteroids with a median (IQR) dose of 10 (5, 14) mg/day. Almost half were taking an anti-malarial (AM) at the time of plasma acquisition (110/223 [49.3%]) although 202/223 (90.6%) had ever taken an AM and 77 (34.5%) were taking mycophenolate mofetil. The cohort demographics are shown in Table 1.

Table 1 Patient characteristics

Cluster analysis

A total of 894 proteins were quantified by SWATH-MS. After removing proteins which were only detectable in < 25% of all samples, 588 remained. Seven more were removed due to high relative levels (> 20) with low variance (< 1), resulting in 581 proteins for analysis.

The plurality (11/30) of clustering algorithms suggested the data could be split into six clusters. Clusters were created using hierarchical clustering and visualised with t-SNE (Fig. 1). The largest cluster comprised 65 patients (29.1% of the patient cohort; cluster 4, C4) followed by 27 patients (12.1% of the cohort, cluster 1, C1), 27 patients (12.1% of the cohort, cluster 5, C5), 36 patients (16.1%, both cluster 2 and 3, C2 and C3) and the smallest comprised 24 patients (10.7% of the cohort; cluster 6, C6).

Fig. 1
figure 1

Cluster analysis of proteins in 223 patients with active SLE. A t-distributed stochastic neighbour embedding (t-SNE) plot of the 6 clusters following hierarchal clustering. Each point represents a single patient allocated to one of n = 6 clusters based on the plasma proteome alone. B Heatmap of the 6 clusters showing only those proteins which were significantly different in at least 1 cluster. Colour shows the Z-score with increased levels in red and decreased levels in blue. C Canonical pathways that are predicted to be activated or supressed in more than one cluster. The arrows show the direction (up = activation, down = suppression)

Antibody status and age significantly differs between clusters

The youngest patients were in C6 (median age 30.5 [19, 40.25] years) and the oldest in C3 (45 [39.25, 50.5] years) (p = 0.012 between clusters). C6 had the least proportion of Caucasian patients (7/24 [29.2%]); for further description of ethnic groups see Additional file 1. Between clusters, there was no statistical difference in disease duration or SDI scores (Table 2).

Table 2 Comparison of patient characteristics between the 6 proteomic clusters

There was a significant difference in anti-SSA/Ro antibody positivity between clusters (p = 0.017), C4 had the highest frequency of anti-SSA/Ro (29/65 [44.6%]) and the lowest frequency was in C2 (4/36 [11.1%]). Similarly, anti-Ro60 was highest in C4, (28/65 [43.1%]) and lowest in C2 (3/36 [8.3%]) (p = 0.008). In contrast, whilst there was no significant difference between clusters for anti-Ro52 (K-Wallis, p = 0.059), a direct comparison of C4 (19/63 [30.2%]) with the remaining clusters combined (23/152 [15.1%)) was statistically significant (p = 0.014). There was no significant difference in other autoantibodies or presence of low C3/C4 complement. Concomitant immunosuppressant or corticosteroid use did not differ across clusters.

Differences in disease activity between clusters

Marked variations in the proportion of patients with active musculoskeletal disease (BILAG A or B score) were found between clusters (p = 0.002); ranging from 19/27 (70.4%) in C1 to 5/24 (20.8%) in C6. Conversely, there was no significant difference between clusters (p = 0.177) in the number of patients with inflammatory arthritis fulfilling the 1997 ACR criterion.

Similarly, BILAG 2004 scores in renal disease was highest in C6 (15/24 [62.5%]) and lowest in C2 (9/36 [25%]) although the proportion of patients with renal disease was not statistically significant between clusters (Chi-2, p = 0.051). The comparison of C6 with the other clusters combined, identified a significant difference (15/23 ([65.2%]) vs. 64/188 [41.2%] respectively, p = 0.005). Although baseline creatinine was significantly different between clusters, it was numerically equal lowest in C6 (60 [52,72] and C3 (60 [53, 70] and highest in C2 (75 [58, 94]) (K-Wallis between clusters p = 0.002).

In terms of ACR criteria, there was also a significant difference in patient numbers that satisfied the renal domain; numerically greatest in C6, (16/24 [66.7%]) and lowest in C2, (9/36 [25%]) (p = 0.017). Oral ulcers were numerically higher in C1 (20/27 [70.1%]) and lower in C5 (14/35 [40%]) (p = 0.036).

Canonical pathways and network analysis

To investigate relevant protein networks and pathways that underpin each cluster, we identified proteins that were significantly different (higher or lower) in each cluster compared to all other clusters combined, using a linear model adjusted for multiple comparisons. The number of proteins with an adjusted p-value of < 0.05 was: 118 in C1, 67 in C2, 110 in C3, 24 in C4, 49 in C5 and 14 in C6.

We performed pathways analysis using each of the 6 sets of proteins in turn. Pathways with Z score ≥ 1 or Z ≤ − 1 were considered relevant. C1, C2, C3 and C6 each had unique over-represented pathways which were not observed in other clusters (Table 3). The number of these unique pathways varied between clusters and was greatest in C1 (17 unique pathways) and lowest in C6 (one pathway). A total of 18 canonical pathways were common to two or more clusters (Fig. 1C). Notably, the RhoGDI signalling pathway was over-represented in 4 clusters but it was decreased in C2, C4 and C5 and increased in C6.

Table 3 Unique canonical pathways in each cluster

As C1 and C6 were associated with increased frequency of musculoskeletal and renal disease respectively, we performed further analyses on proteins present within these clusters.

Further stratification of Cluster 1

As MSK disease was over-represented in C1, we aimed to identify which individual proteins were increased/decreased in C1 and whether we could identify key biological pathways. We identified a total of 118 proteins which were significantly different in C1 compared to the other 5 clusters (Fig. 2A), and network analysis of these 118 proteins identified a number of central nodes (Fig. 2B). Furthermore, within the t-SNE plot (Fig. 1A), the C1 cluster appeared to form 2 distinct subclusters which we arbitrarily designated C1A as the cluster closest to C4 (purple) and the other as C1B (Fig. 1B). On evaluation of the heatmap of C1, a subgroup of 49 proteins within C1 was identified as being different between sub-clusters 1A and 1B (Fig. 3A). Of these, 47/49 (96.0%) were numerically higher in C1B, driving this group towards C3 and C6 where they were also increased (the 2 numerically lower proteins were transketolase [TKT] and coagulation factor XII). In GO analysis, the top five pathways which were differentially represented in C1A and C1B were “actin binding”, “cytoskeletal protein binding”, “calcium ion binding” and “Receptor for Advanced Glycation End products (RAGE) receptor binding” and “protein binding”. Network analysis (Fig. 3B) suggested that Cofilin-1 (CFL1), was a central node in the proteins which defined C1B compared to C1A. Seven of the 49 proteins were S100 proteins including S100A8, S100A9, S100A11 and S100A12 (see Additional file 1). After correction for multiple testing (t-test with Holm-Sidak correction) 2 proteins were significantly higher in C1B than C1A: S100A4 and Transaldolase (TALDO1). We found no differences in other disease features, medications, or serology between clusters C1A and C1B (data not shown).

Fig. 2
figure 2

Proteins in C1. A Volcano plot showing the individual proteins that are significantly different in C1 compared to the other clusters. The x-axis shows log-fold change and the y-axis − Log10 adjusted P value. Proteins in red are those with a log fold change of < − 1 or > 1 and adjusted P value < 0.05. B Network analysis of the proteins significantly different in cluster 1 compared to other clusters

Fig. 3
figure 3

Subcluster analysis of C1A and C1B. A Heatmap of proteins significantly different between C1 and the remaining clusters identifies 49 proteins that are different between C1A and C1B. B Network analysis of these 49 proteins demonstrating CFL1, LCP1 and CAP1 as central nodes

Association between plasma proteins and renal disease

As C6 had over-representation of patients with renal disease, we aimed to identify whether the proteins driving C6 belonged to one or more biological pathways. Of the 14 significantly different proteins in renal disease enriched cluster, C6, 3 were higher and 11 were lower compared to other clusters (Fig. 4A), network analysis did not identify a central node nor a distinct relationship between several nodes (Fig. 4B). Most canonical pathways related to C6 were predicted to be downregulated except for “Rho GDI signalling” and “SLE in T cell signalling pathway”. The only canonical pathway unique to C6 was “Integrin signalling” (for details of the pathway analysis related to C6 see the Additional file 1). Several canonical pathways, including Rac/Rho/Cdc42 pathways, were shared between C2 and C6 but in opposing directions.

Fig. 4
figure 4

Proteins in C6. A Volcano plot showing the individual proteins that are significantly different in C6 compared to the other clusters. Proteins in red are those with a log fold change of < − 1 or > 1 and adjusted P value < 0.05. B Network analysis of the proteins significantly different in cluster 1 compared to other clusters

We then wanted to know whether any of these 14 proteins were associated with renal disease beyond those patients in C6 after accounting for important confounders. We used an adaptive lasso regression model to select those proteins from C6 which were associated with active renal disease (BILAG A or B) across the whole cohort. In the model, three proteins (Staphylococcal nuclease domain-containing protein 1 [SND1], glycogen debranching enzyme [AGL] and marginal zone B- and B1-cell-specific protein [MZB1]) were selected at lambda 57; higher levels of AGL and lower levels of SND1 and MZB1 were associated with active renal disease. In these models, MZB1 was considered binary (detectable/undetectable) whilst SND1 and AGL were ordinal (high/low/undetectable). Multivariable logistic regression models of active renal disease were constructed with age, gender, ethnicity, low C3 and/or C4 complement and high anti-dsDNA as covariates. The AUC for this model was 0.7346, increasing to 0.7784 when the three proteins were added as covariates (p = 0.0617). Similarly, the AUC for a model of biopsy-proven lupus nephritis increased from 0.8115 to 0.86 (p = 0.0830) (Table 4). Adding the three proteins to a model of proteinuria did not improve the AUC.

Table 4 Logistic regression model for active renal disease across the whole cohort

Discussion

Using an unbiased approach, we identified 6 proteomic endotypes in a cohort of patients with active SLE (Fig. 5). These clusters were associated with some clinical or serological features, notably inflammatory arthritis, renal disease and anti-Ro/SSA antibodies. It should be noted however, that the tests for statistical significance compare values across all 6 clusters and our reporting of over or under-represented disease features is descriptive based on numerical values. We also identified 3 proteins, (Staphylococcal nuclease domain-containing protein 1 (SND1), glycogen debranching enzyme (AGL) and marginal zone B- and B1-cell-specific protein (MZB1) that were associated with the presence of active renal disease.

Fig. 5
figure 5

Summary of the 6 proteomic clusters. Key clinical and serological differences in each of the 6 clusters, ns = not statistically significant across all clusters

In a previous study by Idborg et al. [26], 281 proteins were measured by antibody suspension bead array with the proteins selected based on existing published candidate biomarkers, microarray data and LC–MS data. Using generalised linear models, higher levels of interferon regulating factor 5 (IRF5), solute carrier family 22 member 2 (SLC22A2) and S100 calcium binding protein A12 (S100A12) were identified in SLE patients compared to healthy matched controls; in our data the S100A12 pathway was increased in our C1B subcluster. Using unsupervised clustering, Idborg identified 3 clusters of patients characterised by rheumatoid factor (RF)-IgM, and high or low levels of IRF5; the low IRF5 group closely resembled healthy controls. As might be expected, the RF-high group had increased frequency of anti-Ro/SSA and anti-La/SSB antibodies and a reduced frequency of nephritis. Associations between these molecular subgroups and other clinical features of SLE are not reported.

In our study there was over-representation of active musculoskeletal disease in C1, and a reciprocal under-representation in C6. Interestingly, there was no difference in number of patients fulfilling the 1997 ACR arthritis criterion suggesting that the protein signature is related to active MSK disease, or possibly non-arthritis MSK disease (e.g. myositis). The over-represented pathways in C1 included intracellular proteins associated with regulation of cell cytoskeleton. The P21 activated kinase (PAK) pathway was predicted to be increased in C1 and decreased in C6. Increased PAK signalling has been observed in fibroblast-like synoviocytes (FLS) and associated with joint damage in RA patients [27].

Data visualisation suggested that C1 comprised two distinct sub clusters (1A and 1B) suggesting that 2 molecular subtypes may exist within patients with active MSK disease. The splitting of C1 into these sub-clusters appeared to be due to differences in 49 proteins which have roles in actin binding, cytoskeletal protein binding, calcium ion binding, RAGE receptor binding and protein binding. Cofilin-1 (CFL1) was a central node in the network analysis and has roles in actin and cytoskeletal protein binding. CFL1, which is stimulated by TNFα and GM-CSF, disassembles actin filaments during cellular replication, facilitating FLS migration in RA patients [28]. In a small study by Ooka et al., anti-CFL-1 antibodies were found in 6.3% of patients with SLE but also patients with RA, Behcet’s disease and myositis [29] suggesting that CFL-1 may be an autoantigen common to several inflammatory diseases. Proteins in the S100 family also differed between C1A and C1B. These proteins act as damage-associated molecular patterns (DAMPs) and plasma levels are increased in both adult and childhood-onset SLE, especially those with active disease [30, 31] Whilst increased S100 proteins are associated with lupus nephritis [32], our study suggests that they may also be relevant to patients with lupus arthritis, supporting observations in other forms of inflammatory arthritis [33]. S100A4 was significantly increased in C1B and in RA, S100A4 is expressed in synovium and induces the expression of matrix metalloproteinases [34]. Transaldolase was also increased in cluster C1B. In RA, monocytes have increased expression of Transaldolase which has been proposed to protect RA monocytes from apoptosis, increasing the pool of activated monocytes in inflamed synovium [35].

Cluster 6 had the greatest proportion of renal disease (lowest in C2); these patients were younger and more likely to be non-Caucasian, representing the typical demographic of lupus nephritis [36]. In our data, the integrin signalling pathway was predicted to be reduced in C6, although the role of integrins in SLE remains to be elucidated. Mutations in the Integrin Subunit Alpha M (ITGAM) gene which encodes the CD11b chain of the Mac-1 integrin is a risk factor for SLE [37], the exact mechanisms by which changes in CD11b drives inflammation but has been proposed to be through TLR inhibition of cytokine production [38]. In our pathway analysis, the “integrin signalling” pathway contained the proteins integrin αM, β1 and β3, but not the β2 integrin, which is associated with the development of SLE [39] Although these studies do not explore the clinical variation, one study in 2009 by Yang et al., observed a strong association in lupus nephritis with the mutation, and of note, also observed a higher incidence of arthritis in the absence of the mutation [40]. More research to identify the roles of these integrin subunits in lupus nephritis is required.

Kwon et al. [41], performed MS on urinary samples of SLE patients with and without lupus nephritis compared to HC. They identified an increase in 143 and 67 proteins in patients with SLE without nephritis and with nephritis respectively. They did not perform clustering analysis on their cohort; however, identified 23 common proteins between the 2 SLE groups compared to HC, five of these (ORM1, antithrombin‐III [SERPINC1], ceruloplasmin, haemoglobin subunit beta [HBB] and delta [HBD]) were significantly upregulated in patients with lupus nephritis. We did not detect ORM1 in our data, but the other 4 proteins were not significantly increased in C6 compared to the rest of the cohort, and thus were not included in our lasso model. In a small study, serum Annexin 2 as measured by ELISA, was associated with proliferative lupus nephritis but not membranous disease [42]. In our study, Annexin 2 was detected in fewer than 25% of samples and thus excluded from further analysis. In our data we do not have details of the subtypes of lupus renal disease which limited any subgroup analyses of patients with lupus nephritis.

Only three of the significantly different proteins in C6 had increased levels: filamin-C (FLNC), tyrosine-protein kinase CSK (CSK) and glycogen debranching enzyme (AGL). CSK is associated with autoimmunity as higher levels in early stages of B cell maturation increase the number auto-reactive B cells and autoantibody production [43] and contrary to our study, was significantly downregulated in a pilot proteomic study by Zhou et al. [44], which had a cohort of SLE mostly enriched with cutaneous disease, the observed reduced CSK expression may reflect the differing treatment paradigms of skin versus systemic disease but may also reflect the variation in disease pathogenesis of SLE. Among the proteins with reduced expression, marginal zone B- and B1-cell-specific (MZB1) and platelet endothelial cell adhesion molecule (PECAM-1) are associated with SLE.

PECAM-1 is an important regulator of B cell development and B cell receptor activation. In murine models, PECAM-1 deficiency leads to B cell hyper-responsiveness and autoantibody formation [45]. It has been noted to be elevated in both the urine and serum of patients with SLE [46, 47]. Increased PECAM-1 in the serum of patients with SLE may also be modulated by a greater proportion of metabolic syndrome risk factors including increased age, Body Mass Index (BMI) and waist circumference which may increase PECAM-1 levels. Plasma levels of PECAM-1 were not measured, but leakage of PECAM-1 from plasma into the urine in patients with proteinuria may explain the lower levels that we observed in our study. Further studies with contemporaneous measurement of PECAM-1 in plasma and urine should be considered in patients with lupus nephritis.

MZB1 is implicated in B cell antibody production with elevated levels reported in lymph nodes of lupus patients [48]. MBZ1 is also expressed in plasmocytic dendritic cells and regulates IFNα production, a key cytokine in SLE [49]. In our study, plasma levels of MZB1 were paradoxically lower in C6 although this may again be due to increased excretion via the kidney. To identify whether these biomarkers may have clinical utility, and to control for important confounders such as ethnicity, we developed models of active renal disease using proteins identified from C6. A combination of three proteins improved accuracy of the model to identify patients with active nephritis. Although replication of these findings in an independent cohort is needed, this confirms that data reduction using cluster analysis followed by variable selection is a valid approach for biomarker discovery in patients with active SLE.

The Rac/Rho/Cdc42 pathways, which are associated with cell motility, were differentially modulated in C2 and C6, and the role of these proteins in plasma warrants further investigation. The Rho/Rho kinase pathway is implicated in the pathogenesis of lupus as Rho kinase inhibitors ameliorate SLE disease activity in murine models [50, 51]. The RhoA-Rho kinase pathway is also implicated in B cell activation and survival [52]. If dysregulation of this pathway, reduces B cells activation or survival, this could explain the lower levels of autoantibodies in C2. Between the clusters, there was also a significant difference in number of patients with anti-SSA/Ro antibodies; highest in C4 and lowest in C2. This observation was clearer for Ro60 isotype than Ro52, although this may reflect insufficient power to detect differences between groups. Interestingly, there was no increase in clinical features often associated with anti-SSA/Ro antibodies including mucocutaneous, MSK, neurological or pulmonary disease in C4 compared to other clusters (nor reduced frequency of these features in C2) [53].

A strength of SWATH-MS is the compilation of large spectral libraries containing all known peptide fragments with good reproducibility [12]. Given this, SWATH-MS has been utilised in drug discovery and biomarker identification in gastroenterology, oncology and cardiology [54,55,56,57]. SWATH-MS has been used to identify biomarkers in other rheumatic diseases including SS [58] and osteoarthritis [59]. In the study of SS by Cecchettini et al., differentially expressed inflammatory proteins in the saliva of patients with primary SS and HC, were used perform gene ontology analysis which identified some biological processes which were also noted in our study including gluconeogenesis (increased in C1) and protein kinase A signalling (increased in C2) [58].

A strength this study design is the timing of patient plasma collection, which allowed us to capture the proteome profile of high disease activity patients. We found no differences in drug treatment between clusters, suggesting the protein signatures are the result of the disease, rather than drug effect, although longitudinal studies are important to validate these observations.

An important limitation is that only active SLE patients were included in this study, and therefore we cannot determine how the plasma proteome would compare to patients with inactive SLE or HC. As this is also a cross sectional study, it would also be important to perform a longitudinal study with HC to further assess the clinical associations of identified proteins. A study in 2022 by Zhang et al. explored the metabolic profile of 21 HC, 52 SLE patients and 43 LN patients using Ultra high-performance liquid/gas chromatography-tandem mass spectrometry (UPL/GC–MS/MS) [60]. This study discovered 28 differential metabolites, five of which were discriminatory for LN from SLE and significantly associated with urea, creatinine, Cystatin C and C1q, not observed in healthy controls and reiterating the importance of a control group for validation.

Furthermore, as most patients recruited into BILAG-BR, are patients with refractory disease requiring biological therapy, patients with early or naïve disease are not represented in this study, noting that the activity scores used to stratify patients are for purposes of disease monitoring and making treatment decisions, but may not reflect disease biology. The lack of significant differences in SDI and disease duration between clusters at least implies that the patients are relatively homogeneous in terms active and severe disease.

As each of the 6 clusters were relatively small, this study may lack power to identify differences in some clinical or serological features. Similarly, as the most frequent active organ domains were mucocutaneous, MSK and renal, we may lack power to identify differences in other domains such as neuropsychiatric disease. Importantly, SWATH-MS is not suited to detecting low level proteins such as cytokines and chemokines and future studies including multiplex cytokine/chemokine data should be integrated into proteomic analysis. Interestingly, in our data there was a lower-than-expected proportion of patients taking AM, which may reflect the refractory stage of disease in many patients within the BILAG-BR or incomplete data capture; the proportion of ever used AM was more than 90%. Importantly, we saw no difference in AM use between clusters considering ‘current’, or ‘ever use’ was considered.

Conclusion

In conclusion, SWATH-MS is a valid method for identifying proteomic differences in patients with SLE and can identify proteins which may be useful biomarkers for features of active disease, notably MSK and renal involvement. The pathways and proteins identified may serve as potential biomarkers and/or therapeutic targets and investigation of their role in SLE pathogenesis is warranted.

Availability of data and materials

The datasets used during the current study may be available from the corresponding author on reasonable request providing anonymisation can be maintained.

Abbreviations

ACR:

American College of Rheumatology

AGL:

Glycogen debranching enzyme

AM:

Anti-malarial

BEL:

Belimumab

BILAG-BR:

British Isles Lupus Assessment Group-Biologics Registry

BMI:

Body Mass Index

C1:

Cluster 1

C2:

Cluster 2

C3:

Cluster 3

C4:

Cluster 4

C5:

Cluster 5

C6:

Cluster 6

CFL-1:

Cofilin1

CSK:

Tyrosine-protein kinase

DAMPs:

Damage-associated molecular patterns

DDA:

Data-dependant acquisition

FLNC:

Filamin-C

FLS:

Fibroblast-like synoviocytes

HC:

Healthy Control

IRF5:

Interferon regulating factor 5 (IRF5)

ITGAM:

Integrin subunit alpha M

LC-MS:

Liquid chromatography mass spectrometry

MALDI-TOF MS:

Matrix assisted laser desorption ionization-time of flight mass spectrometry

MMF:

Mycophenolate Mofetil

MS:

Mass-spectrometry

MZB1:

Marginal zone B- and B1-cell-specific protein

NICE:

National Institute for Health and Care Excellence

PAK:

P21-activated kinases

PECAM-1:

Platelet endothelial cell adhesion molecule

PLC:

Phospholipase C

RA:

Rheumatoid arthritis

RAGE:

Receptor for advanced glycation end products

RF:

Rheumatoid factor

RTX:

Rituximab

S100A12:

S100 calcium binding protein A12

SS:

Sjogren’s Syndrome

SLC22A2:

Solute carrier family 22 member 2

SLE:

Systemic lupus erythematosus

SND1:

Staphylococcal nuclease domain-containing protein 1

SWATH-MS:

Sequential window acquisition of all theoretical mass spectra mass spectrometry

TA:

Targeted acquisition

t-SNE:

T-distributed stochastic neighbour embedding

uACR:

Albumin-creatinine ratio

uPCR:

Urine protein-creatinine ratio

References

  1. Bultink IEM, de Vries F, van Vollenhoven RF, Lalmohamed A. Mortality, causes of death and influence of medication use in patients with systemic lupus erythematosus vs matched controls. Rheumatology. 2020;60(1):207–16.

    Article  PubMed Central  Google Scholar 

  2. Furie R, Rovin BH, Houssiau F, Malvar A, Teng YKO, Contreras G, et al. Two-year, randomized, controlled trial of belimumab in lupus nephritis. N Engl J Med. 2020;383(12):1117–28.

    Article  CAS  PubMed  Google Scholar 

  3. Morand EF, Furie R, Tanaka Y, Bruce IN, Askanase AD, Richez C, et al. Trial of anifrolumab in active systemic lupus erythematosus. N Engl J Med. 2019;382(3):211–21.

    Article  PubMed  Google Scholar 

  4. Navarra SV, Guzman RM, Gallacher AE, Hall S, Levy RA, Jimenez RE, et al. Efficacy and safety of belimumab in patients with active systemic lupus erythematosus: a randomised, placebo-controlled, phase 3 trial. Lancet. 2011;377(9767):721–31.

    Article  CAS  PubMed  Google Scholar 

  5. Soliman S, Mohan C. Lupus nephritis biomarkers. Clin Immunol. 2017;185:10–20.

    Article  CAS  PubMed  Google Scholar 

  6. Ferreira HA-O, Melo T, Paiva A, Domingues MA-O. Insights in the role of lipids, oxidative stress and inflammation in rheumatoid arthritis unveiled by new trends in lipidomic investigations. Antioxidants. 2021. https://doi.org/10.3390/antiox10010045.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Xu T, Guo Y, Lu J, Shan J, Lin L, Qian W, et al. Untargeted serum metabolomics and potential biomarkers for Sjogren’s syndrome. Clin Exp Rheumatol. 2021;133(39 Suppl 6):23–9.

    Article  Google Scholar 

  8. Balanescu P, Balanescu A, Balanescu E, Baicus C. Candidate proteomic biomarkers in systemic sclerosis discovered using mass-spectrometry: an update of a systematic review (2014–2020). Rom J Intern Med. 2021;59(2):101–11.

    PubMed  Google Scholar 

  9. Nicolaou O, Kousios A, Hadjisavvas A, Lauwerys B, Sokratous K, Kyriacou K. Biomarkers of systemic lupus erythematosus identified using mass spectrometry-based proteomics: a systematic review. J Cell Mol Med. 2017;21(5):993–1012.

    Article  PubMed  Google Scholar 

  10. Anjo SI, Santa C, Manadas B. SWATH-MS as a tool for biomarker discovery: from basic research to clinical applications. Proteomics. 2017. https://doi.org/10.1002/pmic.201600278.

    Article  PubMed  Google Scholar 

  11. Krasny L, Huang PH. Data-independent acquisition mass spectrometry (DIA-MS) for proteomic applications in oncology. Mol Omics. 2021;17(1):29–42.

    Article  CAS  PubMed  Google Scholar 

  12. Ludwig C, Gillet L, Rosenberger G, Amon S, Collins BC, Aebersold R. Data-independent acquisition-based SWATH-MS for quantitative proteomics: a tutorial. Mol Syst Biol. 2018;14(8): e8126.

    Article  PubMed  PubMed Central  Google Scholar 

  13. McCarthy EM, Sutton E, Nesbit S, White J, Parker B, Jayne D, et al. Short-term efficacy and safety of rituximab therapy in refractory systemic lupus erythematosus: results from the British Isles Lupus Assessment Group Biologics Register. Rheumatology. 2018;57(3):470–9.

    Article  CAS  PubMed  Google Scholar 

  14. Watson SRC. Interim Clinical Commissioning Policy Statement: Rituximab 2013 02/02/0201:[01–11 pp.]. Available from: https://www.england.nhs.uk/wp-content/uploads/2018/07/Rituximab-for-the-treatment-of-systemic-lupus-erythematosus-in-adults.pdf.

  15. (NICE) NIoCE. Belimumab for treating active autoantibody-positive systemic lupus erythematosus (Technology appraisal guidance [TA397]) [Clinical Guidance]. 2016. Available from: https://www.nice.org.uk/guidance/ta397.

  16. Isenberg DA, Rahman A, Allen E, Farewell V, Akil M, Bruce IN, et al. BILAG 2004. Development and initial validation of an updated version of the British Isles Lupus Assessment Group’s disease activity index for patients with systemic lupus erythematosus. Rheumatology. 2005;44(7):902–6.

    Article  CAS  PubMed  Google Scholar 

  17. Gladman DD, Ibanez D, Urowitz MB. Systemic lupus erythematosus disease activity index 2000. J Rheumatol. 2002;29(2):288–91.

    PubMed  Google Scholar 

  18. Gladman DD, Goldsmith CH, Urowitz MB, Bacon P, Fortin P, Ginzler E, et al. The Systemic Lupus International Collaborating Clinics/American College of Rheumatology (SLICC/ACR) damage index for systemic lupus erythematosus international comparison. J Rheumatol. 2000;27(2):373–6.

    CAS  PubMed  Google Scholar 

  19. Sumida K, Nadkarni GN, Grams ME, Sang Y, Ballew SH, Coresh J, et al. Conversion of urine protein-creatinine ratio or urine dipstick protein to urine albumin-creatinine ratio for use in chronic kidney disease screening and prognosis: an individual participant-based meta-analysis. Ann Intern Med. 2020;173(6):426–35.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Betteridge Z, Gunawardena H, North J, Slinn J, McHugh N. Anti-synthetase syndrome: a new autoantibody to phenylalanyl transfer RNA synthetase (anti-Zo) associated with polymyositis and interstitial pneumonia. Rheumatology. 2007;46(6):1005–8.

    Article  CAS  PubMed  Google Scholar 

  21. Liu Y, Buil A, Collins BC, Gillet LC, Blum LC, Cheng LY, et al. Quantitative variability of 342 plasma proteins in a human twin population. Mol Syst Biol. 2015. https://doi.org/10.15252/msb.20145728

  22. Charrad M, Ghazzali N, Boiteau V, Niknafs A. NbClust: AnRPackage for determining the relevant number of clusters in a data set. J Stat Softw. 2014;61(6):1–36.

    Article  Google Scholar 

  23. van der Maaten L, Hinton G. Viualizing data using t-SNE. J Mach Learn Res. 2008;9:2579–605.

    Google Scholar 

  24. Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2018;47(D1):D607–13.

    Article  PubMed Central  Google Scholar 

  25. Krämer A, Green J, Pollard J Jr, Tugendreich S. Causal analysis approaches in ingenuity pathway analysis. Bioinformatics. 2014. https://doi.org/10.1093/bioinformatics/btt703.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Idborg H, Zandian A, Ossipova E, Wigren E, Preger C, Mobarrez F, et al. Circulating levels of interferon regulatory factor-5 associates with subgroups of systemic lupus erythematosus patients. Front Immunol. 2019;10:1029.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Fu D, Yang Y, Xiao Y, Lin H, Ye Y, Zhan Z, et al. Role of p21-activated kinase 1 in regulating the migration and invasion of fibroblast-like synoviocytes from rheumatoid arthritis patients. Rheumatology. 2012;51(7):1170–80.

    Article  CAS  PubMed  Google Scholar 

  28. Kutsuna H, Suzuki K, Kamata N, Kato T, Hato F, Mizuno K, et al. Actin reorganization and morphological changes in human neutrophils stimulated by TNF, GM-CSF, and G-CSF: the role of MAP kinases. Am J Physiol Cell Physiol. 2004;286(1):C55-64.

    Article  CAS  PubMed  Google Scholar 

  29. Ooka S, Nakano H, Matsuda T, Okamoto K, Suematsu N, Kurokawa MS, et al. Proteomic surveillance of autoantigens in patients with Behcet’s disease by a proteomic approach. Microbiol Immunol. 2010;54(6):354–61.

    Article  CAS  PubMed  Google Scholar 

  30. Sumova B, Cerezo LA, Szczukova L, Nekvindova L, Uher M, Hulejova H, et al. Circulating S100 proteins effectively discriminate SLE patients from healthy controls: a cross-sectional study. Rheumatol Int. 2019;39(3):469–78.

    Article  PubMed  Google Scholar 

  31. Donohue SJ, Midgley A, Davies JC, Wright RD, Bruce I, Beresford MW, et al. Differential analysis of serum and urine S100 proteins in juvenile-onset systemic lupus erythematosus (jSLE). Clin Immunol. 2020;214(1521–7035 (Electronic)): 108375.

    Article  CAS  PubMed  Google Scholar 

  32. Kim JW, Jung JY, Lee SW, Baek WY, Kim HA, Suh CH. S100A8 in Serum, Urine, and Saliva as a Potential Biomarker for Systemic Lupus Erythematosus. Front Immunol. 2022;13: 886209.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Foell D, Roth J. Proinflammatory S100 proteins in arthritis and autoimmune disease. Arthritis Rheum. 2004;50(12):3762–71.

    Article  CAS  PubMed  Google Scholar 

  34. Senolt L, Grigorian M, Lukanidin E, Simmen B, Michel BA, Pavelka K, Gay RE, et al. S100A4 is expressed at site of invasion in rheumatoid arthritis synovium and modulates production of matrix metalloproteinases. Ann Rheum Dis. 2006. https://doi.org/10.1136/ard.2005.047704.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Stuhlmuller B, Ungethum U, Scholze S, Martinez L, Backhaus M, Kraetsch HG, et al. Identification of known and novel genes in activated monocytes from patients with rheumatoid arthritis. Arthritis Rheum. 2000;43(4):775–90.

    Article  CAS  PubMed  Google Scholar 

  36. Patel M, Clarke AM, Bruce IN, Symmons DP. The prevalence and incidence of biopsy-proven lupus nephritis in the UK: Evidence of an ethnic gradient. Arthritis Rheum. 2006;54(9):2963–9.

    Article  PubMed  Google Scholar 

  37. Fagerholm SC, MacPherson M, James MJ, Sevier-Guy C, Lau CS. The CD11b-integrin (ITGAM) and systemic lupus erythematosus. Lupus. 2013;22(7):657–63.

    Article  CAS  PubMed  Google Scholar 

  38. Yang M, Xu W, Wang Y, Jiang X, Li Y, Yang Y, et al. CD11b-activated Src signal attenuates neuroinflammatory pain by orchestrating inflammatory and anti-inflammatory cytokines in microglia. Mol Pain. 2018;14(1744–8069):1744806918808150(Electronic).

    Google Scholar 

  39. Ramirez-Bello J, Sun C, Valencia-Pacheco G, Singh B, Barbosa-Cobos RE, Saavedra MA, et al. ITGAM is a risk factor to systemic lupus erythematosus and possibly a protection factor to rheumatoid arthritis in patients from Mexico. PLoS ONE. 2019;14(11): e0224543.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Yang W, Zhao M, Hirankarn N, Lau CS, Mok CC, Chan TM, et al. ITGAM is associated with disease susceptibility and renal nephritis of systemic lupus erythematosus in Hong Kong Chinese and Thai. Hum Mol Genet. 2009;18(11):2063–70.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Kwon OC, Lee EJ, Yeom J, Hong S, Lee CK, Yoo B, et al. Discovery of urine biomarkers for lupus nephritis via quantitative and comparative proteome analysis. Clin Transl Med. 2021;11(11): e638.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Caster DJ, Korte EA, Merchant ML, Klein JB, Wilkey DW, Rovin BH, et al. Autoantibodies targeting glomerular annexin A2 identify patients with proliferative lupus nephritis. Proteomics Clin Appl. 2015;9(11–12):1012–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Manjarrez-Orduño N, Marasco E, Chung SA, Katz MS, Katz MS, Kiridly JF, Simpfendorfer KR, Freudenberg J, et al. CSK regulatory polymorphism is associated with systemic lupus erythematosus and influences B-cell signaling and activation. Nat Genet. 2012. https://doi.org/10.1038/ng.2439.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Zhou D, Luo N, Wu Q, You Y, Zhai Z, Mou Z, et al. Transcellular distribution heterogeneity of Annexin A5 represents a protective response to lupus-related thrombophilia: a pilot proteomics-based study. Biochem Biophys Res Commun. 2012;420(2):357–63.

    Article  CAS  PubMed  Google Scholar 

  45. Wilkinson R, Lyons AB, Roberts D, Wong M-X, Bartley PA, Jackson DE. Platelet endothelial cell adhesion molecule-1 (PECAM-1/CD31) acts as a regulator of B-cell development, B-cell antigen receptor (BCR)-mediated activation, and autoimmune disease. Blood. 2002. https://doi.org/10.1182/blood-2002-01-0027.

    Article  PubMed  Google Scholar 

  46. da Rosa Franchi Santos LF, Stadtlober NP, Costa Dall’Aqua LG, Scavuzzi BM, Guimaraes PM, Flauzino T, et al. Increased adhesion molecule levels in systemic lupus erythematosus: relationships with severity of illness, autoimmunity, metabolic syndrome and cortisol levels. Lupus. 2018;27(3):380–8.

    Article  CAS  PubMed  Google Scholar 

  47. Ding H, Lin C, Cai J, Guo Q, Dai M, Mohan C, et al. Urinary activated leukocyte cell adhesion molecule as a novel biomarker of lupus nephritis histology. Arthritis Res Ther. 2020;22(1):122.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Miyagawa-Hayashino A, Yoshifuji H, Kitagori K, Ito S, Oku T, Hirayama Y, et al. Increase of MZB1 in B cells in systemic lupus erythematosus: proteomic analysis of biopsied lymph nodes. Arthritis Res Ther. 2018;20(1):13.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Kapoor T, Corrado M, Pearce EL, Pearce EJ, Grosschedl R. MZB1 enables efficient interferon alpha secretion in stimulated plasmacytoid dendritic cells. Sci Rep. 2020;10(1):21626.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Stirzaker RA, Biswas PS, Gupta S, Song L, Bhagat G, Pernis AB. Administration of fasudil, a ROCK inhibitor, attenuates disease in lupus-prone NZB/W F1 female mice. Lupus. 2012;21(6):656–61.

    Article  CAS  PubMed  Google Scholar 

  51. Dulak-Lis M, Bujak A, Gala K, Banach M, Kedzierska U, Miszkiel J, et al. A novel JAK/ROCK inhibitor, CPL409116, demonstrates potent efficacy in the mouse model of systemic lupus erythematosus. J Pharmacol Sci. 2021;145(4):340–8.

    Article  CAS  PubMed  Google Scholar 

  52. Zhang S, Zhou X, Lang RA, Guo F. RhoA of the Rho family small GTPases is essential for B lymphocyte development. PLoS ONE. 2012. https://doi.org/10.1371/journal.pone.0033773.

    Article  PubMed  Google Scholar 

  53. Yoshimi R, Ueda A, Ozato K, Ishigatsubo Y. Clinical and pathological roles of Ro/SSA autoantibody system. Clin Dev Immunol. 2012;2012: 606195.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Salie MT, Yang J, Ramirez Medina CR, Zuhlke LJ, Chishala C, Ntsekhe M, et al. Data-independent acquisition mass spectrometry in severe rheumatic heart disease (RHD) identifies a proteomic signature showing ongoing inflammation and effectively classifying RHD cases. Clin Proteomics. 2022;19(1):7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Muazzam A, Spick M, Cexus ONF, Geary B, Azhar F, Pandha H, et al. A novel blood proteomic signature for prostate cancer. Cancers (Basel). 2023;15(4):1051.

    Article  CAS  PubMed  Google Scholar 

  56. Geary B, Walker MJ, Snow JT, Lee DCH, Pernemalm M, Maleki-Dizaji S, et al. Identification of a biomarker panel for early detection of lung cancer patients. J Proteome Res. 2019;18(9):3369–82.

    Article  CAS  PubMed  Google Scholar 

  57. Min L, Zhu S, Wei R, Zhao Y, Liu S, Li P, et al. Integrating SWATH-MS proteomics and transcriptome analysis identifies CHI3L1 as a plasma biomarker for early gastric cancer. Mol Ther Oncolytics. 2020. https://doi.org/10.1016/j.omto.2020.03.020.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Cecchettini A, Finamore F, Ucciferri N, Donati V, Mattii L, Polizzi E, et al. Phenotyping multiple subsets in Sjogren’s syndrome: a salivary proteomic SWATH-MS approach towards precision medicine. Clin Proteomics. 2019;16:26.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Liao W, Li Z, Li T, Zhang Q, Zhang H, Wang X. Proteomic analysis of synovial fluid in osteoarthritis using SWATH-mass spectrometry. Mol Med Rep. 2018;17(2):2827–36.

    CAS  PubMed  Google Scholar 

  60. Zhang Y, Gan L, Tang J, Liu D, Chen G, Xu B. Metabolic profiling reveals new serum signatures to discriminate lupus nephritis from systemic lupus erythematosus. Front Immunol. 2022. https://doi.org/10.3389/fimmu.2022.967371.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The authors would also like to acknowledge the work of Patrick Doherty (MASTERPLANS data architect), Gillian Armitt (MASTERPLANS project manager) and the MASTERPLANS Patient Partners.

Funding

The MASTERPLANS Consortium was funded by a grant from the Medical Research Council (MR/M01665X/1). BILAG BR has been funded by unrestricted educational donations from Roche, GSK and LUPUS UK. Professor Bruce is a National Institute for Health Research (NIHR) Emeritus Senior Investigator and is funded by the NIHR Manchester Biomedical Research Centre. Equipment used in the Stoller Biomarker Discovery Centre is funded by a donation received from the Stoller Charitable Trust and a research grant awarded by the Medical Research Council (MR/M008959/1). MRC/EPSRC Molecular Pathology Node provided additional financial support by a grant from the Medical Research Council and Engineering & Physical Sciences Research Council (MR/N00583X/1).

Author information

Authors and Affiliations

Authors

Consortia

Contributions

KYCS: Conceptualization, formal analysis, methodology, project administration, software, validation, visualisation, writing—original draft, writing—review and edit. JAR: Conceptualization, formal analysis, methodology, project administration, validation, visualization, supervision, software, writing—review and edit. RR: Investigation, Data Curation, Software. RDS: Investigation, Data Curation, Software. JK: Investigation, Data Curation, Software. IB-J: Investigation, Data Curation, Software. DL: Investigation, Data Curation, Software. ADW: Investigation, Data Curation, Software, writing—review and edit. NG: Investigation, writing—review and edit. NMH: Investigation, Data Curation, Software, writing—review and edit. INB: Investigation, Data Curation, Software, writing—review and edit, Funding acquisition (As per: https://credit.niso.org/).

Corresponding author

Correspondence to John A. Reynolds.

Ethics declarations

Ethics approval and consent to participate

Ethical approval was granted by the NRES Committee North West Greater Manchester West (REC: 09/H1014/64) and the local Research and Development departments at participant sites. All patients provided written informed consent at the time of study registration.

Consent for publication

Not applicable.

Competing interests

Kevin Y C Su: Has received honoraria from Novartis pharmaceuticals for educational consulting. John A Reynolds: None to disclose. Rachel Reed: None to disclose. Rachael Da Silva: None to disclose. Janet Kelsall: None to disclose. Ivona Baricevic-Jones: None to disclose. David Lee: None to disclose. Anthony D Whetton: None to disclose. Nophar Geifman: None to disclose. Neil McHugh: None to disclose. Ian N Bruce has received grant/research support from GlaxoSmithKline; received consulting fees from GSK, UCB, Eli Lilly, BMS, Merck Serono, Aurinia and Astra Zeneca; and was a speaker for AstraZeneca, GlaxoSmithKline, and UCB.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

‘Elbow’ Methods of determining optimal cluster number with the RStudio package ‘nbclust’. Table S1. Detailed ethnicity data for the study population. Table S2. Details of ethnicity between clusters. Table S3. List of Proteins in Cluster 1 with Significant FC and P value. Table S4. Cluster 1 Proteins associated with subcluster 1A and 1B. Figure S2. Dissimilarity matrices for each of the 6 clusters. Table S5. Proteins and canonical pathways relating to the clusters 1A and 1B. Figure S3. Gene Ontology of all proteins in cluster 1 utilising CLUEgo, a Cytoscape plugin for biological interpretation (4). Table S6. Pathway analysis of the proteins which are increased or decreased in C6 compared to the rest of the cohort.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Su, K.Y.C., Reynolds, J.A., Reed, R. et al. Proteomic analysis identifies subgroups of patients with active systemic lupus erythematosus. Clin Proteom 20, 29 (2023). https://doi.org/10.1186/s12014-023-09420-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12014-023-09420-1

Keywords