Biomolecular condensates form spatially inhomogeneous network fluids

Dar, Furqan; Cohen, Samuel R.; Mitrea, Diana M.; Phillips, Aaron H.; Nagy, Gergely; Leite, Wellington C.; Stanley, Christopher B.; Choi, Jeong-Mo; Kriwacki, Richard W.; Pappu, Rohit V.

doi:10.1038/s41467-024-47602-z

Download PDF

Article
Open access
Published: 22 April 2024

Biomolecular condensates form spatially inhomogeneous network fluids

Nature Communications volume 15, Article number: 3413 (2024) Cite this article

3194 Accesses
10 Altmetric
Metrics details

Subjects

Abstract

The functions of biomolecular condensates are thought to be influenced by their material properties, and these will be determined by the internal organization of molecules within condensates. However, structural characterizations of condensates are challenging, and rarely reported. Here, we deploy a combination of small angle neutron scattering, fluorescence recovery after photobleaching, and coarse-grained molecular dynamics simulations to provide structural descriptions of model condensates that are formed by macromolecules from nucleolar granular components (GCs). We show that these minimal facsimiles of GCs form condensates that are network fluids featuring spatial inhomogeneities across different length scales that reflect the contributions of distinct protein and peptide domains. The network-like inhomogeneous organization is characterized by a coexistence of liquid- and gas-like macromolecular densities that engenders bimodality of internal molecular dynamics. These insights suggest that condensates formed by multivalent proteins share features with network fluids formed by systems such as patchy or hairy colloids.

Extreme dynamics in a biomolecular condensate

Article 19 July 2023

Spatially non-uniform condensates emerge from dynamically arrested phase separation

Article Open access 08 February 2023

Sequence-dependent material properties of biomolecular condensates and their relation to dilute phase conformations

Article Open access 01 March 2024

Introduction

Biomolecular condensates are compositionally distinct membraneless bodies that enable spatiotemporal organization and control over a range of biochemical reactions in cells^1,2,3,4,5,6. Condensates are often thought of as spatially homogeneous liquids that form via liquid-liquid phase separation^7,8. However, a more nuanced view is emerging. This is being driven by the realization of the importance of multivalence of associative motifs and domains as being crucial for driving condensation^9,10,11. Reversible physical crosslinks formed among multivalent macromolecules, thus giving rise to networked molecules that underlie the viscoelasticity of condensates^{12,13,14,15,16,17,18,19,20,21}. The phase transitions that give rise to condensates involve coupled associative and segregative phase transitions of associative macromolecules^22,23.

Proteins that are exemplars of associative macromolecules have distinct molecular features, typically encompassing oligomerization domains (OD), ligand binding domains, and intrinsically disordered regions (IDRs) with distinctive sequence characteristics^{11,22,24,25,26,27,28}. The coupling of associative and segregative phase transitions, referred to as COAST²², and the driving forces for these transitions derive from the molecular features of associative macromolecules^{10,20,29,30,31,32,33,34,35,36,37,38}. Complex coacervation is a clear illustration of the coupling between associative and segregative phase transitions^{11,28,39,40,41,42,43,44,45}. Here, the complexation of polyelectrolytes is driven by a combination of enthalpically favorable associations and entropically favored release of counterions^39,42,44,46. If the complexes are higher-order clusters of undefined stoichiometry, then we arrive at the Ogston limit where size and hydration details lower the solubility, and the complexes undergo a segregative transition to generate coexisting dilute and dense phases^22,47,48. The dilute phase will comprise dissociated polyions, complexed polyions that form higher-order oligomers^49,50, and pre-percolation clusters^51,52, all of which must be electroneutral and hence will involve different degrees of ion associations. The dense phase will be a percolated network of polyions, whereby each polyion has multiple partners, and formally this will be limited by the number of uncompensated charges on each polyion in the dense phase. Accordingly, complex coacervation involves electrostatically-driven associations, which can give rise to higher-order complexes, and segregation into dense and dilute phases driven by a combination of counterion release and the lower solubility, due to altered hydration profiles, of higher-order complexes. Other instantiations of COAST-like processes include the coupling of percolation and phase separation^9,10,53,54. Percolation, specifically bond percolation, also known as physical gelation^25,37,54,55, is a continuous associative phase transition whereby motifs or domains form reversible physical crosslinks to enable the formation of sequence- and architecture-specific networks that span the length scale of the system of interest^34,53. As the networks grow, phase separation can be driven by the balance of inter-macromolecule, macromolecule-solvent, and solvent-solvent interactions, controlled largely by the sequence, structural, and solubility characteristics of spacers, which are regions outside the associative domains and motifs^47,56.

COAST-like processes give rise to condensates with network-like internal organization^30,57,58,59. The networks will be defined by the architectures of the constituent molecules and the extent of crosslinking among the molecules^34,53,57,60. The internal viscosity of condensates and the elasticity of the networks will be governed by the interplay between the timescales for molecular transport within and into/out of condensates and the timescales for making and breaking physical crosslinks. Furthermore, the network-like internal organization will engender spatial inhomogeneities of physically crosslinked macromolecules^22,29,30,58.

Viscoelastic materials have time-dependent properties and network structures contribute directly to viscoelastic moduli of condensates⁶¹. Even if condensates are dominantly viscous fluids as opposed to elastic solids, there will be timescales where the materials are dominantly elastic⁵⁸. Condensates can age, and if they undergo equilibrium fluid-to-solid transitions, they transform from dominantly viscous to dominantly elastic viscoelastic materials⁵⁸. Alternatively, some aged condensates can behave like viscoelastic network glasses^62,63. While the network-like organization within condensates has been inferred from viscoelastic measurements and validated by the reproduction of measured moduli using computed network structures⁵⁸, there is a paucity of measurements that directly test the hypothesis of network structures within condensates.

The approach we pursue here is rooted in its historical use in the study of simple and complex fluids, and is based on scattering measurements, specifically small-angle neutron scattering (SANS)^64,65,66,67. A key advantage of SANS is that one can investigate the presence of spatial inhomogeneities that range from a few angstroms to hundreds of nanometers⁶⁸. Here, we investigate the structures of condensates that are mimics of nucleolar sub-phases. The nucleolus is a spatially organized condensate featuring at least three coexisting sub-phases. The GC, which is the outermost layer, is scaffolded by nucleophosmin (NPM1)^22,69,70. Condensates formed by complexation of NPM1 and Arginine-rich (R-rich) peptides and proteins such as rpL5 and SURF6 have been used to test postulates of the molecular handoff model for ribosomal subunit assembly within the nucleolar GC^{71,72,73,74,75,76}. Measurements of internal structure within condensates that are based on SANS were first reported by Mitrea et al.⁷¹. They studied condensates formed via heterotypic interactions of cationic arginine-rich peptides (rpL5) and N130, the N-terminal 130-residues of NPM1. The N130 construct includes the OD and at least three short regions that are rich in acidic residues⁷⁷.

Here, we revisit the SANS data collected by Mitrea et al.⁷¹, updating these with new measurements and combining these with simulations to answer the following question: how might descriptors from theories of simple and complex fluids be adapted for describing condensates? To answer this question, we adapt approaches that integrate scattering data with computer simulations^{78,79,80,81,82,83,84,85}. We combine traditional approaches based on pair distribution functions with graph-theoretic methods to arrive at descriptions of network structures of condensates formed by N130 and rpL5. The simulations we use are based on bespoke, sequence-specific coarse-grained (CG) models. The latter were developed using a machine-learning approach that is bootstrapped against atomistic simulations⁸⁶.

Results

N130 and rpL5 form condensates via complexation

Following the work of Mitrea et al.⁷¹, the N130 construct corresponds to residues 1–130 of mouse NPM1 that includes the OD interspersed by short, disordered regions that encompass acidic residues (Fig. 1a). Previous work showed that N130 forms condensates with R-rich peptides. Two regions within N130 are enriched in acidic residues. One encompasses a flexible loop (residues 35–44, termed A1). The other is located at the C-terminus (residues 120–133, termed A2). These regions were shown to mediate interactions with R-rich peptides and promote condensate formation (Fig. 1a)⁷⁷. There also is an N-terminal acidic region (residues 1–16, termed A0) that we discuss below.

**Fig. 1: Complexation between acidic regions within N130 and R-motifs of rpL5 is required for condensation.**

We performed atomistic simulations using the ABSINTH implicit solvation model and forcefield paradigm⁸⁷. In these simulations, the pentamerized OD was modeled as a rigid domain, the conformations adopted by the IDRs were sampled using Monte Carlo (MC) moves, and the simulations were performed at low salt concentrations of 20 mM. From the simulations, we obtained an overall structure of the N130 pentamer and an ensemble of conformations formed by N130 complexed rpL5 (Fig. 1b)⁷¹. The rpL5 peptide was taken from the ribosomal protein L5 and its sequence corresponds to the region that has been shown via experiments to interact with NPM1⁷⁷. Simulations show that rpL5 adopts ensembles of expanded conformations that maximize the favorable solvation of Arg and Lys residues⁸⁸.

Titrating in rpL5 at a fixed N130 concentration of 100 μM in the absence of crowders gives a threshold rpL5 concentration for phase separation that is between 250 and 300 µM in 150 mM NaCl (Fig. 1c). The phase boundary (Fig. 1d) is consistent with previous experimental studies^71,89. Next, we performed SANS measurements to probe the molecular organization within condensates formed by N130 complexed with rpL5 (Fig. 1e). The SANS intensity is a convolution of the form factor and structure factor. The former quantifies scattering that results from the average shapes of the scatterers, whereas the latter quantifies how the particles scatter neutrons due to spatial correlations caused by intra- and intermolecular interactions. Specifically, the structure factor measures density correlations in reciprocal space, whereas the form factor is the Fourier transform of the density distribution⁹⁰.

The importance of complexation as a driver of internal organization is made clear by the lack of peaks in the scattering profile for N130 pentamers when rpL5 is absent from the solution (also see Supplementary Fig. 1). The inhomogeneities in spatial densities that are evident in the scattering profile (shown by the arrows of Fig. 1e) are indicative of order on specific length scales. The multi-peak fitting analysis, developed by Mitrea et al.⁷¹, combined with analysis of derivatives of the scattering profile show that the most reliable, high signal-to-noise peaks correspond to length scales of ~55 Å and ~77 Å (Fig. 1e). In the derivative analysis, the signal-to-noise is found to decrease at low q values, and this makes the unambiguous assignments of peaks beyond the second one more unreliable (Supplementary Fig. 2).

The molecular diameter of the pentamerized OD (~53 Å) is a useful ruler for calibrating the different length scales. To further characterize the nature of the ordering and the interactions that contribute to ordering, we turned to computational approaches.

Systematic CG and predictions

To investigate the internal structure of fluid-like condensates, we performed CG simulations of the N130 + rpL5 condensates. In the CG model, the pentamerized OD of the N130 pentamer, referred to hereafter as PD, was modeled as a single, spherical bead defined by excluded-volume interactions. We used a single-bead-per-residue representation for residues in the IDRs of N130. Accordingly, in addition to the acidic regions, A1 and A2, we also explicitly modeled the N-terminal region of N130 (termed A0). The architectures of the CG N130 molecules are reminiscent of hairy colloids⁹¹, featuring disordered, acidic regions that protrude from one side of the sphere that mimics the PD. Hairy colloids are known to form network fluids through anisotropic interactions engendered by the architectures of the constituent molecules^{92,93,94,95,96}. All residues in rpL5 were modeled as single beads.

The systematic CG procedure was initiated by bootstrapping against information generated using atomistic simulations based on the ABSINTH implicit solvation model and forcefield paradigm⁸⁷ (Fig. 2a). Having prescribed the resolution for the CG model, we then used ensembles from atomistic simulations of N130 pentamers with 15 copies of rpL5 to generate forcefield parameters for the CG model. For this, we use the CAMELOT algorithm^30,86,97 that combines a Gaussian Process Bayesian Optimization⁹⁸ module, with an appropriate architecture and CG model. The parameters of the CG model minimize the difference between the atomistic conformational ensembles and the CG representation. This affords the dual advantages of computational efficiency afforded by the CG model and the sequence-specific effects learned via the CAMELOT algorithm. Using the CG representation, we simulate a dense phase with 108 copies of N130 and 1620 copies of the rpL5 peptide.

**Fig. 2: Coarse-grained simulations of N130 + rpL5 condensates highlight the importance of an N-terminal acidic region (A0) within N130.**

Results from the CG simulations of dense phases were used to compute inter-residue contact maps between the disordered regions of N130 and the rpL5 peptide (Fig. 2b). The A1 and A2 regions make favorable contacts with the basic residues in rpL5. We also observed that the A0 region makes contacts with the basic residues in rpL5. The frequency of contacts suggests that this region forms stronger interactions with rpL5 than A1. The contacts involve acidic residues within A0. The contact maps derived from the CG simulations suggest a rank ordering of interactions between acidic regions and rpL5, with A2 being the most favorable and A1 the least.

Our predictions motivated the generation of a new mutant construct where we replaced A0 with the residues from A2, in reverse order, to increase the linear charge density (see the sequence of the new A0 region in Fig. 2c). We refer to this construct as N130^+A2. It has more acidic residues than the wild type. We hypothesized, based on simulations, that the +A2 mutant should form condensates with a lower threshold concentration of rpL5 for a given N130 concentration. Indeed, titrating in rpL5 at a fixed concentration of N130^+A2 leads to a lower rpL5 threshold concentration (Fig. 2d) when compared to the threshold concentration that is required for condensation with wild-type N130 (Fig. 1c). Increasing the strength of the electrostatic interactions in A0 reduces the threshold concentration for rpL5 from 400 µM to below 350 µM at 100 µM N130. Note that the designs were chosen to ensure that the stoichiometric ratio required for condensation does not change.

Next, we investigated the impact of the +A2 mutant using SANS (Fig. 2e). We observed similar pairs of peaks at intermediate $q$-values for both N130 + rpL5 and the +A2 mutant + rpL5. Small shifts in the locations of the peaks are likely a combination of inherent noise and a contribution from electrostatic repulsions in the disordered N- and C-termini of N130 emanating from the same face of the PD⁷⁷. The C-terminus of the wild-type protein contains nine negatively charged residues corresponding to A2, and the +A2 mutant increases the net charge on the pentamer by 25.

We also measured the impact of the +A2 mutant on the internal dynamics of N130 + rpL5. For this, we performed measurements of fluorescence recovery after photobleaching (FRAP) on the condensates (Fig. 2f). The FRAP curve for N130 + rpL5 indicates dynamical exchange with the bulk solution with the recovery time constant being 53 $\pm$ 2 s. Increasing the total charge on N130 via the +A2 mutant decreases the overall extent of FRAP, resulting in a longer recovery time of 103 $\pm$ 8 s. Similarly, we observe that N130^+A2 + rpL5 displays slower overall dynamics at shorter timescales, and the dynamics of the two systems approach one another at longer times. The average recovery times were obtained by fitting the data, for both constructs, to a single species model. This ignores the prospect of there being an immobile fraction. However, since FRAP data are a convolution of contributions from physical crosslinks and molecular transport, we chose a parsimonious, single-species model to avoid over-fitting and over-interpretations of the data.

Condensates formed by N130 + rpL5 are network fluids

As observed in the SANS data (Fig. 1e), N130 + rpL5 condensates display correlations at length scales that are consistent with dimensions of the PD of N130. Therefore, we focus our analysis on the spatial correlations formed by N130 within condensates. Obtaining the experimental structure factor by deconvolution of the SANS spectrum would require modeling the form factor. This becomes intractable given the geometry of the molecules⁹⁹. Instead of solving an inverse problem, we computed pairwise correlations via the radial distribution function (RDF) g(r). This is the real-space analog of the experimentally measured structure factor¹⁰⁰. It describes how spatial densities change as a function of distance from an arbitrary reference particle. Normalized to an ideal gas, where the distance between particle pairs is completely uncorrelated, g(r) is the standard descriptor of liquid structure in theory, experiment, and simulations.

There are accounts of condensates being akin to simple liquids⁷. However, in the physical literature, the term “simple liquids” refers to fluids formed by Lennard-Jones (LJ) particles. Accordingly, we calibrated our expectations regarding the organization of N130 and rpL5 molecules within condensates, using the RDF, g(r), for the LJ fluid as a touchstone (Supplementary Fig. 3). In an LJ fluid, structure is defined purely by packing considerations¹⁰¹.

In any g(r), the first peak corresponds to the nearest neighbors in the vicinity of the reference particle of diameter σ, and the additional peaks correspond to higher-order neighbors in surrounding shells. As a measure of the density correlations, g(r) quantifies how the average density at a separation r from the center of any particle varies with respect to the average density of the fluid. The density correlations are large in the vicinity of the reference particle, and the relative probability, vis-à-vis the ideal gas, decays as a function of distance until the density becomes indistinguishable from the average density of the fluid¹⁰². Structure can be further characterized by the volume integral over g(r) up to defined positions such as the first minimum. This quantifies the nearest-neighbor coordination number¹⁰⁰. For the LJ fluid, the coordination number is ~12–13 due to optimal packing of the spherical particles. In contrast, complex fluids have open, network-like organization due in part to less efficient packing.

From the CG simulations, we computed g_PD-PD(r), which quantifies spatial correlations between pairs of PDs of different N130 molecules (Fig. 3a). The profile for g_PD-PD(r) is consistent with liquid-like organization, featuring short-range order and long-range disorder with g_PD-PD(r) approaching unity at large distances. Here, disorder, which refers to the length scale at which g_PD-PD(r) approaches unity, is evident beyond an inter-PD distance of 3σ, where σ ≈ 53 Å is the diameter of an N130 pentamer. Integration of g_PD-PD(r) up to the first minimum yields a coordination number of approximately four. This suggests that the average structure of the fluid, as interrogated from the vantage point of N130, is not determined purely by packing considerations, as would be the case for an LJ fluid. Instead, rather like liquid water, which has a coordination number of approximately four, defined by networks of hydrogen bonds, the N130 molecules makeup a network fluid.

**Fig. 3: Radial distribution functions point to network fluid structure of N130 + rpL5 condensates.**

The peaks in g_PD-PD(r) occur at 53 Å, 95 Å, and 144 Å. The second and third peaks correspond to ordering beyond the molecular length scale. The ratios of the computed peaks to those estimated based on SANS measurements are 0.96 and 1.25 for the first and second, peaks, respectively. Note that the estimates of higher-order peaks from SANS data are less reliable given lower signal-to-noise as quantified using analysis of the derivatives (Supplementary Fig. 2). Further, the parameters of the CG model, especially the parameters for Van der Waals interactions, which are governed by the inter-residue and inter-domain distances, will depend on the screening length and ion-mediated correlations in atomistic simulations. The ABSINTH model includes explicit representations of solution ions, and these simulations were performed at low ionic strength, with the salt concentration set at 20 mM given the explicit representations of ions and large droplet sizes. The inclusion of explicit representations of ions leads to exponential increases in simulation time because of the way electrostatic interactions are handled in the ABSINTH model⁸⁷. In the SANS measurements, the salt concentrations were 150 mM. Therefore, given the parameterization of the CG model using atomistic simulations, the differences in peak positions that correspond to intermediate and longer-range ordering are due to differences in effective Debye lengths between the simulations and SANS measurements. Because the Van der Waals parameters are learned from atomistic simulations, one cannot achieve perfect congruence by simply changing Debye lengths in the CG simulations. Instead, we need salt concentration dependent parameters within the CG model. This requires a model for how the salt-dependent interactions change at different length scales. The remainder of the discussion focuses on insights we can glean from the CG simulations. In doing so, we presume semi-quantitative congruence with SANS experiments.

Next, we computed the g(r) between basic residues in rpL5 and acidic residues in the three different regions of N130 (Fig. 3b). Note that these g(r) profiles were computed as a linear superposition of pair distributions between all the basic residues in the peptide and all the acidic residues in a specific region. Each of these g(r) profiles has distinct peak positions and heights for the first maximum. The heights of the peaks, realized in a range of r < 50 Å, are highest for A2 and lowest for A1. These trends are observed in the corresponding potentials of mean force (Supplementary Fig. 4). The most favorable interactions in the distance range of r < 50 Å are realized between A2 and rpL5. The hierarchy of interactions encoded in the different acidic regions of N130 agrees with the contact maps (Fig. 2b).

Interactions between acidic regions and rpL5

Next, we investigated the effects of in silico mutations where we neutralized the charges within each of acidic region while keeping the density of the simulated dense phases fixed to that of the wild type N130 + rpL5 condensates. These simulations were designed to assess how mutations that affect electrostatic interactions mediated by one region affect the totality of the network structure. We computed g(r) between pairs of PDs and between the acidic regions on N130 and basic residues of rpL5. Neutralizing the acidic residues on any of the three regions leads to a reduction in the first maximum of g_PD-PD(r) (Fig. 4a). The magnitude of the reduction in the first maximum is greatest for the A2 mutant, followed by the A0 and A1 mutants, for which the values of the peak heights are statistically similar within error (Supplementary Fig. 5). This indicates that mutations to A2 affect the overall structure more than mutations to the other regions. The potentials of mean force (Supplementary Fig. 6) corroborate this inference showing that the least favorable interactions involve the N130 PD of the A2 mutant. The interactions between basic residues of rpL5 and acidic residues within A0, A1, and A2 are modular. This is clear from the g_AX-rpL5(r) profiles (Fig. 4b–d), where X = 0, 1, or 2, which we compute from simulations where one of A0, A1, or A2 is neutralized. The g_AX-rpL5(r) profile deviates from that of the WT only for the region in which the charges are neutralized. Otherwise, the profiles remain roughly equivalent to those obtained from the wild-type N130 + rpL5 condensates. This suggests that the acidic regions make modular, and seemingly independent interactions with rpL5 peptides (Fig. 4b–d).

Graph-theoretic analyses of network structures of condensates

The simulation results suggest that N130 + rpL5 condensates are network fluids as opposed to simple liquids. To put the network fluid concept on a quantitative footing, we turned to graph-theoretic analysis. These approaches have been used to analyze network fluids such as hydrogen-bonding networks^{103,104,105,106,107,108} and network glasses¹⁰⁹.

Adapting precedents from work on molecular fluids¹¹⁰, we construct unweighted graphs in which two molecules are considered adjacent if any of the constituent beads are within the cutoff distance defined by the first minimum in the corresponding g(r). Using this criterion, we constructed adjacency matrices via block summations (Fig. 5). We then analyzed the network structure formed by the molecular neighbors for the set of beads considered.

**Fig. 5: Flowchart describing the graph-theoretic analysis of simulations of dense phases of N130 and rpL5.**

To provide a suitable prior of a non-networked fluid where structure is dominated by packing considerations alone, we performed graph-theoretic analyses on systems of LJ particles. For this, we quantified the degree distributions for the vapor, liquid, and solid phases of spherical particles interacting via LJ potentials (Fig. 6a). The degree reflects the number of connections or edges emanating from a node. Here, a node is an individual LJ particle. For an ideal gas, the degree is zero. However, since LJ particles have finite size and there are attractive dispersion interactions, the vapor phase is not ideal. Instead, the degree distribution is skewed to the right. For the LJ liquid, we observe a broad distribution that is roughly symmetrical about a mean degree value of 13. As the density is further increased to obtain a solid, we see that the degree distribution shows a sharp peak at twelve, corresponding to the number of neighbors expected for a 3D hexagonal close-packed lattice¹¹¹. The locally inhomogeneous nature of a liquid allows for interactions with more neighbors than the true ground-state number seen in the solid phase.

**Fig. 6: Each acidic region of N130 in the N130 + rpL5 dense phase imparts a different network structure onto the system.**

Next, we constructed graphs using acidic residues in N130 and the basic residues in rpL5 as nodes. In contrast to the LJ systems, the computed degree distributions are bimodal (Fig. 6b), and this is suggestive of a bipartite network structure. The multimeric nature of the N130 pentamer allows the acidic regions to interact with multiple rpL5 peptides, as seen in the broad second peaks in the degree distributions. Consistent with the RDFs for N130 + rpL5, we also observe a hierarchy of degrees, with A2-rpL5 having the largest degree and A1-rpL5 having the smallest. However, for the first peaks near k = 0, which correspond to the smaller rpL5, we see that the different acidic regions do not show appreciable differences. Comparison to the LJ system suggests that the N130 + rpL5 system features both liquid- and gas-like interactions. Here, the term “gas” refers to the presence of unassociated, freely diffusing rpL5 molecules that coexist with a liquid comprising associated rpL5 molecules.

Dynamics within network fluids show two distinct regimes

In spatially inhomogeneous systems, there can be regions that are locally dense or dilute. This is made clear in the graph-theoretic analysis, which shows two interaction modalities. Similar results have been reported for fluids formed by patchy particles, especially near the liquid-gas coexistence region^92,93,94. We reasoned that the coexistence of liquid- and gas-like organization within the condensates should have dynamical consequences. To test for this, we analyzed the simulations to compute mean square displacements (MSDs) of the PDs. The MSD is calculated as a function of lag time. This involves a double average, where the inner average is a cumulative sum along the time axis, starting from zero, and progressing in increments of t + ∆, where the MSD is computed over times t and t + ∆ and averaged over the motions of individual molecules. The outer average is over all molecules. A characteristic timescale corresponds to the time it takes for the PD to diffuse across a distance corresponding to its diameter. We rescaled the abscissa by t_D, which is the timescale over which the motion of the PD fits best to a purely diffusive model with MSD being proportional to t. We find that there is a timescale below t_D where the motion is super-diffusive with an exponent greater than one, and a timescale above t_D where the motion of the PD is sub-diffusive with an exponent less than one (Fig. 7a). Based on the observed length scales, the super-diffusive motion reflects the contributions of short-range steric repulsions among the PDs and the electrostatic repulsions between acidic residues. Conversely, the sub-diffusive motions reflect contributions from physical crosslinks between acidic residues and rpL5 peptides. Histograms of the exponents that we compute for the MSDs show a bimodal distribution (Fig. 7b). The distribution of sub-diffusive exponents is broader and reflects the heterogeneities of motions impacted by associative interactions between acidic regions and rpL5 peptides. The MSDs calculated for the PD and for charged residues in each of the acidic regions and the basic residues in the rpL5 peptides contrast with the MSDs computed in terms of the PD alone (Fig. 7c). The acidic regions and the peptides show sub-diffusive motions on all timescales, reflecting the fact that these moieties are influenced mainly by associative intermolecular interactions.

**Fig. 7: Motions within dense phases of N130 + rpL5 show bimodality.**

Discussion

Condensates have been referred to as simple liquids⁷ or as structureless entities characterized by non-specific interactions¹¹². Systems of LJ particles form simple liquids, and macromolecules that drive condensation are not LJ particles. Further, liquids are not structureless entities. Ad hoc criteria are often used to define liquids in the condensate literature⁸, Instead, structure in liquids is characterized by short-range order and long-range disorder. The order parameter for describing liquid-state structure is the RDF. The extent and range of orders that can be quantified using RDFs are directly connected to the molecular architectures, the spatial range, types, and strengths of intermolecular interactions¹⁰². SANS measurements are particularly useful for gleaning quantitative insights regarding RDFs.

Here, we deployed a combination of experimental and computational techniques to demonstrate that condensates formed by N130 and rpL5 are network fluids. This was established by observing peaks in SANS curves of condensates that are indicative of molecular order on the length scale of the N130 pentamers. The SANS data and computations show short- and intermediate-range ordering versus long-range disorder. From the computed g_PD-PD(r) profiles, we find that N130 pentamers have four nearest neighbors on average. Complexation between the acidic regions and the rpL5 peptides also contributes to the overall structure of the condensates. The acidic regions of N130 function as independent interaction modules. This explains why the valence of cohesive motifs is an important driver of condensation and material properties^{34,35,36,37,71}.

The sequence-specific CG model allowed us to identify a new acidic region, termed A0, in the N-terminal end of N130. We find that A0 interacts more strongly with the disordered peptide rpL5 than was hitherto appreciated. Mutations that increase the charge in A0 help lower the threshold concentration of rpL5 that is needed to observe condensation driven by heterotypic interactions with N130.

We find that there are two types of sub-graphs that underlie the structure of the N130 + rpL5 condensates. One of the sub-graphs corresponds to gas-like organization, and the other corresponds to that of a liquid. Note that “gas-like” implies that there are regions within condensates where the concentrations of macromolecules are ultra-dilute, and hence solvent filled. This is akin to the empty liquid concept¹¹³ reported for patchy colloids. Conversely, what we refer to as “liquid-like” refers to regions that are dense in macromolecules. The bipartite graphs also have dynamical fingerprints, which are manifest as the bimodality we observe for the MSDs of the PDs. Super- and sub-diffusive behaviors that we report here have been observed in MSDs computed from simulations of oligomer-grafted nanoparticles¹¹⁴. They are also consistent with data from nuclear magnetic resonance experiments where Gibbs et al. found that the PDs of NPM1 form an immobilized scaffold in NPM1 + p14ARF mixtures¹¹⁵. Taken together, our findings place the N130 + rpL5 system, and other such systems, in the same category as patchy and/or hairy colloids^{92,93,94,96,113,114,116,117}.

Here, we focused mainly on the effects of heterotypic interactions between N130 and R-rich rpL5 peptides on the network structure of the N130 + rpL5 condensates. Previous work has shown that the homotypic interactions within NPM1, uncovered in the presence of crowders, can also affect both the phase behavior and the mesoscopic structure of condensates formed with SURF6N^73,74. A new method that leveraged the Edmond-Ogston formalism¹¹⁸, allows for the intrinsic strengths of homotypic interactions to be uncovered using crowder titrations⁴⁸. Knowledge of the strengths of homotypic interactions, and the relative interplay with heterotypic interactions, will allow for simulations of binary and higher-order mixtures that mimic nucleolar GCs. An application of graph-theoretic analysis, guided by SANS measurements, to condensates that form under the competing interplay of homotypic and heterotypic interactions should be feasible. The interplay between network structures defined by the whole range of homotypic and heterotypic interactions should illuminate the relationship between rheological properties and network structure⁵⁸, for mapping intra-condensate spatial organizational preferences²⁹, and for dynamical control over compositional identities of protein-RNA condensates²⁸.

Since the nucleolus is a multicomponent and multiphasic condensate⁷⁰, we expect that varying the stoichiometries of different components will affect the overall structural properties of nucleoli. Future studies that apply the combination of experimental, computational, and analytical techniques deployed here to more complex systems will enrich our understanding of the relationship between the spatial organization of condensed systems and the network properties.

Methods

Cloning

All N130 constructs were subcloned into a pET28b plasmid vector, in frame with an N-terminal 6x His tag, followed by a TEV protease recognition sequence from synthetic double-stranded DNA (Integrated DNA Technologies, Coralville, IA, USA).

Protein expression and purification

The plasmid constructs were used to transform in E. coli strain BL21(DE3) (Millipore Sigma, Burlington, MA, USA), followed by incubation with shaking at 37 °C. When bacterial cultures reached an optical density at 600 nm of ~0.8, the temperature was reduced to 20 °C and protein expression was induced with the addition of IPTG (GoldBio, St. Louis, MO, USA) to a final concentration of 1 mM, and further incubated with shaking overnight. Cells were harvested by centrifugation and lysed by sonication in buffer A (25 mM Tris, 300 mM NaCl, 5 mM $\beta$-mercaptoethanol, pH 7.5). The soluble fraction was further separated by centrifugation for 30 min at 30,000 × g and loaded on a Ni-NTA column, pre-equilibrated in buffer A. Bound protein was eluted with a gradient of buffer B (25 mM Tris, 300 mM NaCl, 500 mM Imidazole, 5 mM $\beta$-mercaptoethanol, pH 7.5). The fractions containing the protein of interest were identified by SDS-PAGE, pooled and the 6x His affinity tag was removed by proteolytic cleavage, in the presence of TEV protease, while dialyzing against 4 L of 10 mM Tris, 200 mM NaCl, 5 mM $\beta$-mercaptoethanol, pH 7.5. To remove the cleaved affinity tag and any un-cleaved material, the protein was applied to an orthogonal Ni-NTA column, and the flow-through loaded on a C4 HPLC column, in 0.1% Trifluoroacetic acid, and eluted with a linear gradient of 0.1% Trifluoroacetic acid in acetonitrile. The fractions containing the proteins of interest were identified by SDS-PAGE, pooled and lyophilized. Lyophilized N130 and N130^+A2 proteins were resuspended in 6 M Guanidine hydrochloride, 25 mM Tris, pH 7.5 and reduced by the addition of 10 mM dithiothreitol. The proteins were refolded by dialysis, using 3 exchanges of 10 mM Tris, 150 mM NaCl, 2 mM mM dithiothreitol, pH 7.5, at 4 °C. The protein concentration during refolding was maintained at or below 100 µM N130 monomer. Protein identities were verified by determining their molecular weight using mass spectrometry in the Center for Proteomics and Metabolomics at St. Jude Children’s Research Hospital.

Fluorescence labeling of proteins

N130 and N130^+A2 were labeled with Alexa-488 (ThermoFisher, Waltham, MA, USA) at Cys104 by incubating a molar excess of Alexa-488 maleimide with freshly reduced N130 proteins overnight at 4 °C with oscillation. Excess dye was removed by successive rounds of dialysis against 10 mM Tris, pH 7.5, 150 mM NaCl, and 2 mM DTT. Labeled proteins were then unfolded in the presence of 10 mM Tris, pH 7.5, 2 mM DTT, and 6 M GdmHCl and combined with unlabeled protein to a final concentration of 10% labeled protein and refolded by successive rounds of dialysis against 10 mM Tris, pH 7.5, 150 mM NaCl, and 2 mM DTT.

Fluorescence microscopy measurements

Microscopy plates (Greiner Bio, Kremsmünster, Austria) and slides (Grace BioLabs, Bend, OR, USA) were coated with PlusOne Repel Silane ES (GE Healthcare, Pittsburgh, PA, USA) and Pluronic F-127 (Sigma-Aldrich, St. Louis, MO, USA) and washed with water before the transfer of protein solutions. Fluorescent microscopy experiments were performed using a 3i Marianas system (Intelligent Imaging Innovations Inc., Denver, CO, USA) configured with a Yokogawa CSU-W spinning disk confocal microscope utilizing a ×100/1.45 N.A. Zeiss objective (Zeiss Jena, Germany), a Photometric Prime 95B camera (Teledyne, Seattle, Washington, USA), a ×1.5 additional magnification optovar providing 70 nm pixels, and appropriate excitation and emission band pass filters. With a peak emission wavelength of 550 nm, the Rayleigh resolution for this instrument is 213 nm. Acquisition hardware was controlled using the Slidebook 6 software from 3i. The phase diagram depicted in Fig. 2d was generated by computing the average of the index of dispersion of fluorescent microscopy images of five images per well. The threshold for positive phase separation has been set to 10% of the maximum value. FRAP experiments were performed by bleaching a circular area with a diameter of 1 µm in the center of droplets ($n=12$) to ~50% of initial fluorescence intensity. The observed fluorescence intensities were then normalized to global photobleaching during data acquisition and fitted as a group to determine recovery times according to¹¹⁹ Eq. (1):

$${I}_{t}=\frac{{I}_{0}+{I}_{\infty }\frac{t}{{t}_{1/2}}}{1+\frac{t}{{t}_{1/2}}}$$

(1)

Here, I₀ is the pre-bleach intensity, I_∞ the steady-state, post-bleach intensity, t is the time at which FRAP is measured, and t_1/2, the time elapsed before half the pre-bleach intensity has been recovered following photobleaching. Uncertainty in the reported half-lives represent the standard error of the fit to the data. FRAP experiments were performed 1 hour after the mixing of components.

Peptides used in the study

The rpL5 peptide was synthesized in the Macromolecular Synthesis lab at the Hartwell Center, St. Jude Children’s Research Hospital. The lyophilized powder was directly reconstituted in buffer, and the pH was adjusted to 7.5 using 1 M Tris base.

SANS measurements

N130 and N130^+A2 were buffer exchanged into 10 mM Tris, 150 mM NaCl, 2 mM DTT, in D₂O (measured pH, 7.5). Lyophilized rpL5 peptides were resuspended in dialysis buffer. Monodisperse samples of protein only and phase-separated samples with rpL5 were prepared in the dialysis buffer. SANS experiments were performed on the extended q-range small-angle neutron scattering (EQ-SANS, BL-6) beam line at the Spallation Neutron Source (SNS) located at Oak Ridge National Laboratory (ORNL). In 30 Hz operation mode, a 4 m sample-to-detector distance with 2.5–6.1 and 9.8–13.4 Å wavelength bands was used¹²⁰ covering a combined scattering vector range of 0.006 < q < 0.44 Å⁻¹. q = 4π sin(θ)/λ, where 2θ is the scattering angle, and λ is the neutron wavelength. Samples were loaded into 1 or 2 mm pathlength circular-shaped quartz cuvettes (Hellma USA, Plainville, NY, USA) and sealed. SANS measurements were performed at 25 °C using the EQ-SANS rotating tumbler sample environment to counteract condensate settling. Data reduction followed standard procedures using MantidPlot¹²¹ and drtsans¹²². The measured scattering intensity was corrected for the detector sensitivity and scattering contribution from the solvent and empty cells, and then placed on absolute scale using a calibrated standard¹²³. Additional information regarding the data collection and analysis is given in Supplementary Table 1.

Atomistic MC simulations using the ABSINTH model

For the first step of systematic CG, we performed atomistic MC simulations to obtain a robust description of the conformational ensembles of N130. For this, we employed the ABSINTH implicit solvation model and forcefield paradigm⁸⁷. In this model, all polypeptide atoms and solution are modeled explicitly, and the degrees of freedom are the backbone and sidechain dihedral angles as well as the translational motions of the solution ions, which are spheres. All simulations were performed using version 2.0 of the CAMPARI modeling package (http://campari.sourceforge.net) and the abs_opls_3.2.prm parameter set. The initial structure of N130 was modelled as a pentamer, where the structure of the ODs is based on the coordinates deposited in the protein data bank (PDB ID: 4N8M). The structures of each disordered N-terminal tail (residues 1–18, GSHMEDSMDMDMSPL) and disordered A2 tract (residues 124–133, EDAESEDEDE) were built using CAMPARI. The degrees of freedom internal to the ODs were held fixed during the ABSINTH simulations, reflecting the fact that the domains are well folded and tightly bound to each other. The system is placed in a soft-wall spherical potential with radius 70 Å. We included sodium and chloride ions to mimic the salt concentration of ~20 mM, in addition to neutralizing ions. The simulation temperature was set to 300 K.

For efficient sampling of the conformational ensemble, we first performed simulations based on the so-called excluded volume or EV limit. In this limit, all terms other than the steric repulsions and any dihedral angle terms in the potential functions are switched off. Note that the ABSINTH model uses fixed bond lengths and bond angles. These initializing simulations were performed for 10⁸ MC steps. We sampled 100 different structures from the EV limit simulations and used each of them as initial structures simulations based on the full potential. Each simulation consists of 10⁸ MC steps, and the structural information was stored every 5000 steps. Hence, we collected 20,000 snapshots per trajectory, from 100 independent trajectories of atomistic simulations. Next, we performed ABSINTH-based MC simulations for 2.1 × 10⁷ MC steps with sampling frequency of (5000 steps)⁻¹, where the first 10⁶ MC steps were discarded as equilibration.

Computations of scattering profiles using CAMPARI

We used the computed the scattering form factor P(q) from snapshots generated using the ABSINTH-based simulations. We excised the conformations of N130 pentamers, and computed Kratky profiles using the scattercalc functionality within the CAMPARI package (http://campari.sourceforge.net). In these calculations, the scattering cross-sections of all atoms are set to be unity. The results we obtain for P(q) are plotted as log(P(q)) versus log(q) in Supplementary Fig. 1.

Systematic CG

Our approach to developing CG models involves three steps: First, we choose the resolution for the CG model. Second, we choose the form for the potential functions that describe interactions among pairs of CG sites. And third, we use a Gaussian process Bayesian optimization (GPBO) module to parameterize the model to ensure that the CG model recaptures conformational statistics of the atomistic simulations. In our choice of the model, each residue is modelled as a bead, except for the OD of N130, which by itself forms a large bead with excluded volume. The mass of each bead is determined by the total mass belonging to the specific bead. For example, the OD bead has mass of 47544.8 amu. The position of each bead is set equal to the position of the center of mass in its atomistic representation.

The potential function used for the simulations is decomposed into five different terms in Eq. (2):

$$W={W}_{{{\mbox{LJ}}}}+{W}_{{el}}+{W}_{b}+{W}_{\theta }+{W}_{\phi }$$

(2)

Each term contains several interaction parameters, which were parameterized using the CAMELOT algorithm⁸⁶. This uses a GPBO module to minimize an objective function, which is defined as the difference between the site-to-site distance distributions generated by atomistic MC simulations based on the ABSINTH model and by CG MD simulations. Within the CAMELOT algorithm, we change the parameters of the potential function for the CG model, perform CG MD simulations to obtain conformational statistics, specifically inter-site distance distributions, quantify the objective function, and iterate until a stationary state is reached for the objective function. In this work, we used conformational statistics derived from the atomistic simulations of structural ensembles of N130 and R-rich peptides as the reference against which the CG model was parameterized. The optimized parameters are given in Supplementary Tables 2–9. The bead types and corresponding amino acid residues are given in Supplementary Fig. 7.

To reduce the computational cost of scanning the parameter space, we grouped several amino acid types into one bead type, following the previous work. For the rpL5 peptide, we grouped different amino acids into three different bead types: charged (K, R, E, D), large (V, F, M, I, L, Y, Q, N, W, H), and small (A, P, G, S, T, C). Each group has its own ${\epsilon }_{i}$ value to be determined. For each residue in either the large or small group, we used ${\sigma }_{i}$ as twice of the radius of gyration of the specific residue in the atomistic simulations (and consequently ${\sigma }_{i}$ is not identical for all residues in the same group; it is residue-dependent). For charged residues, its ${\sigma }_{i}$ was left as a free parameter to be determined by the optimization module. Hence, we have four unknown parameters, three ${\epsilon }_{i}$ values for the charged, large, and small bead types plus one ${\sigma }_{i}$ value for the charged bead type.

Coarse-grained model for MD simulations

The CG model is summarized in Fig. 1, and the potential function used for the simulations can be decomposed into five different terms given in Eq. (2). Here, as in Eq. (3)

$${W}_{{LJ},{ij}}=4{\epsilon }_{{ij}}\,\left[\,{\left(\frac{{\sigma }_{{ij}}}{{r}_{{ij}}}\right)}^{12}-{\left(\frac{{\sigma }_{{ij}}}{{r}_{{ij}}}\right)}^{6}\,\right],\, {r}_{{ij}} \, < \, {r}_{c}$$

(3)

is the standard LJ potential with cutoff ${r}_{c}=2.5\sigma$. While we decomposed the two-body interaction parameters into one-body parameters: ${\sigma }_{{ij}}=({\sigma }_{i}+{\sigma }_{j})/2$ and ${\epsilon }_{{ij}}=\sqrt{{\epsilon }_{i}{\epsilon }_{j}}$. All energies have units of kcal/mol.

The electrostatic interactions were modeled using a Debye-Hückel potential given by Eq. (4):

$${W}_{{el},{ij}}=C\frac{{q}_{i}{q}_{j}}{\epsilon {r}_{{ij}}}\exp \left(-\kappa \,{r}_{{ij}}\right),\, {r}_{{ij}} \, < \, {r}_{c}$$

(4)

implemented with the lj/cut/coul/debye pair-style in LAMMPS¹²⁴ with $\epsilon=80.0$, $\kappa=0.1$, and cutoffs ${\sigma }_{1}=24.1$ and ${\sigma }_{2}=15.0{{{{{\text{\AA }}}}}}$. with constant C, charges q, dielectric constant $\epsilon$, and inverse Debye length $\kappa$. In this work, we used $\epsilon=80.0$ and $\kappa=0.1$ Å⁻¹. The charges were assigned manually; beads for R and K have +1, beads for D and E have −1, and other beads (including the PD bead) have 0.

The bond and angle terms are modeled as harmonic potentials as in Eqs. (5) and (6),

$${W}_{b,i}={K}_{i}{\left({b}_{i}-{b}_{0i}\right)}^{2}$$

(5)

Equation (5) is a quadratic bonded potential implemented as the harmonic bond-style in LAMMPS¹²⁴.

$${W}_{\theta,i}={K}_{i}{\left({\theta }_{i}-{\theta }_{0i}\right)}^{2}$$

(6)

Equation (6) shows a quadratic angular term implemented as the harmonic angle-style in LAMMPS¹²⁴. The bond parameters ${K}_{b,i}$ and ${b}_{0i}$ were obtained by fitting the normal distribution to the distribution of the distance between two adjacent residues. The angle parameters ${K}_{a,i}$ and ${\theta }_{0i}$ were also obtained by fitting the normal distribution to the distribution of the angle between three adjacent residues. For the OD bead, we assigned arbitrarily high values for the energy parameters: ${K}_{b,i}$ = 60,000 kcal/mol-Å² and ${K}_{a,i}$ = 60,00,000 kcal/mol-radian², keeping the bond extremely rigid. The equilibrium length and angle were determined by the PDB structure.

Except for the PD bead, the dihedral term is given by a Fourier series potential shown in Eq. (7),

$${W}_{\phi,i}=\mathop{\sum }\limits_{n=1}^{3}{K}_{{ni}}(1-\cos (n\,{\phi }_{0i}-{\phi }_{{ni}}))$$

(7)

This is a Fourier series dihedral term implemented as part of the class2 dihedral style in LAMMPS. with arbitrarily high value of ${K}_{d,i}$ = 60,00,000 kcal/mol-radian² and experimentally determined ${\phi }_{0i}$. The parameters corresponding to the potentials, derived from CAMELOT, for the different systems considered are in Supplementary Tables 2–9. Lastly, ${W}_{\phi,i,{{\mbox{quad}}}}={K}_{i}{\left({\phi }_{i}-{\phi }_{0i}\right)}^{2}$ is used to constrain the five arms of N130 with a very high $K$-values.

Simulations of the N130 wild type and rpL5 peptides

Given the initial configuration generated using CAMELOT⁸⁶, we used the replication command in LAMMPS to generate 108 copies of N130 pentamers 1620 copies of the rpL5 peptide. Following the replication, deform and nve/limit fixes were used to reduce the box sizes for the simulations to 250 nm. The final configuration served as initial conditions for NPT simulations to prepare systems at the correct intrinsic density. NPT simulations were run for $1\times {10}^{7}$ steps with a timestep of 1 fs. A Nose-Hoover thermostat and barostat were used, with damping constants of 100 and 1000 fs, respectively. The final configurations from these NPT simulations served as the starting configuration for NVT simulations, which were run for $1\times {10}^{8}$ steps with a timestep of 0.1 fs and a Nose-Hoover thermostat of 10 fs. The velocities were randomized. Trajectory snapshots were output every 50,000 steps, and we only considered the last 1000 frames for the different analyses. Supplementary Fig. 8 shows that the production runs were equilibrated. Five independent NVT replicates were run for each condition, and the standard error of the mean between replicates is used as the measure of uncertainty. The system setup is summarized in Table 1.

Table 1 System setup for NVT MD simulations of N130 and rpL5 peptides

Full size table

Simulations of the N130 mutants and rpL5 peptides

Starting with the initial configuration generated using CAMELOT⁸⁶, we used the replication command in LAMMPS as before and the deform and nve/limit fixes to reduce the box sizes to approximately the same dimensions as the simulations of the N130 wild type and rpL5 peptides (Table 1). The charges within each acidic region were neutralized for these simulations. Keeping the charges neutralized, we then mixed the species using the indent fix and ran NVT simulations in LAMMPS for $2\times {10}^{8}$ steps. We used a timestep of 0.1 fs and a Nose-Hoover thermostat with a damping constant of 10 fs, as before. The velocities were randomized, as before. Trajectory snapshots were output every 50,000 steps in the last $1\times {10}^{8}$ steps, and only the last 1000 frames were considered for analyses. Supplementary Figs. 9–11 show that the production runs were equilibrated. Five independent NVT replicates were run for each condition, and the standard error of the mean between replicates was used as the measure of uncertainty.

Simulations of the LJ systems

To understand the network structure of different LJ phases, we performed NVT simulations in LAMMPS, with 10,000 LJ particles. We used the NIST parameters for a pure LJ gas, a pure LJ fluid, and a pure LJ solid to access different pure phases. In reduced units, the densities and temperatures for the different systems are given in Table 2.

Table 2 Density and temperature for different LJ phases in reduced units

Full size table

With a timestep ${dt}=0.005\tau$, where $\tau$ is the dimensionless LJ time unit, an initial 50,000 steps were run to let the systems settle, after which $1\times {10}^{7}$ steps were run for data production. A Nose-Hoover thermostat was used with a damping constant of 0.5$\tau$. A non-bonded cutoff of 2.5$\sigma$ was used. The velocities were randomized, as with the N130 + rpL5 simulations. Trajectory snapshots were output every 10,000 steps. The last 500 frames of the production run were used to calculate the RDFs and degree distributions. Supplementary Figs. 12–14 show that the production runs were equilibrated. Three independent replicates were run for each condition, and the standard error of the mean between replicates was used as the measure of uncertainty.

Calculation of the RDFs ${{{{{\boldsymbol{g}}}}}}({{{{{\boldsymbol{r}}}}}})$

We used VMD¹²⁵ to calculate $g\left(r\right)$ for the different sets of beads considered in this work. To calculate the $g\left(r\right)$ for the N130 + rpL5 system, we used a bin size, ${dr}$, of 0.5 Å. Note that the g(r) profiles for the N130 mutants were computed from the pair distributions between all the acidic residues in the select acidic region, and all basic residues in the peptide. For the LJ fluid, we used a bin size of 0.025 ${dr}/\sigma$. As with our analysis of the N130 + rpL5 system, we averaged over all replicates.

Graph-building methods

First, we identify the regions on different molecules that contribute to the network structure of the fluid we wish to investigate. We focused our analysis on the network formed by acidic residues in a particular acidic region in N130 and the basic residues in rpL5. This defines two sets of residues. Given the two sets of residues/beads, we calculate the inter-set g(r) as explained in the methods. For each g(r) we compute the location of the first minimum. The locations of these minima serve as the cutoff radii for defining the presence of an edge between beads.

Given two sets of beads, a particular trajectory snapshot, and the computed cutoff radius, we generate a bead adjacency matrix. A basic bead is considered adjacent to an acidic residue if the distance between the selected acidic and basic beads is within the cutoff radius. This calculation is performed for all pairs defined by the two sets of beads that are chosen for the analysis. This generates a bead adjacency matrix where an edge is drawn between beads in the chose set if the inter-bead distances are within the computed cutoff radius. We can either consider the total bead adjacency matrix where every bead in the system is included and where all the bead types not in the initially chosen set are non-adjacent by construction, or we can generate a bead adjacency matrix where only the considered beads are included. We choose the latter.

In more detail, suppose that set-1 has $m$-bead types and set-2 has $n$ bead types. Then, given $\alpha$ N130 molecules, set-1 has $\alpha \cdot m$ beads in total. Similarly, given $\beta$ rpL5 molecules, set-2 has $\left(\beta \cdot n\right)$ beads in total. Therefore, the bead adjacency matrix will be an $\left(\alpha \cdot {m}+\beta \cdot {n}\right)\times \left(\alpha \cdot {m}+\beta \cdot {n}\right)$ matrix. As an example, suppose we have one N130 molecule, and one rpl5 molecule. This would then give us an $\left(m+15n\right)\times \left(m+15n\right)$ matrix. Furthermore, suppose that the beads are ordered such that the first $m$-rows correspond to the N130 beads, and so the next $n$-rows are the rpL5 beads (since adjacency matrices are symmetric the first $m$-columns would be for N130 and the next $n$-columns would be for rpL5). To go from this bead adjacency matrix to a molecular adjacency matrix we would look at blocks of this bead adjacency matrix. Let the bead adjacency matrix be $\hat{B}$. Using an indexing, B[0:m, 0:m] corresponds to the sub-graph of N130 adjacent beads. In our case, no beads will be adjacent since the graph is intentionally constructed between the acidic residues in N130 and the basic residues in rpL5. Moving on, B[0:m, m:m + n] (or B[m:m + n, 0:m] due to symmetry) corresponds to the sub-graph between the N130 beads and the beads of the first rpL5 molecule. Similarly, if we had more than 1 rpL5 molecule, B[0:m, m + i*n:m + (i + 1)*n] gives us the sub-graph between the N130 beads and the beads of the $i$-th rpL5 molecule. To generate the molecular adjacency matrix, $\hat{A}$ we check if any of the sub-graphs from the bead adjacency matrix are non-empty (or that there are edges in that graph) or that there is a 1 in the B[0:m, m + i*n:m + (i + 1)*n] block. Therefore, A[0,0] = 0 in our case by construction, or that molecule-0 and molecule-0 are not adjacent.

Returning to the more general case, we have the bead adjacency matrix that corresponds to a matrix of size $\left(\alpha \cdot m+\beta \cdot n\right)\times \left(\alpha \cdot m+\beta \cdot n\right)$ where the first $\alpha \cdot m$ rows (or columns) correspond to the beads in N130 molecules, and the last $\beta \cdot n$ rows (or columns) correspond to the beads in rpL5 molecules. We check all the blocks of the matrix, which correspond to the beads between the different molecules in the system. These correspond to the sub-graphs between the beads of different molecules. For any sub-graph or block that is non-empty, we consider the two corresponding molecules to be adjacent. This gives the molecular adjacency matrix, which should have the shape $\left(\alpha+\beta \right)\times \left(\alpha+\beta \right)$. This is the final graph that is analyzed. Graph properties are calculated per-snapshot and then averaged over the total set of frames considered.

The molecular adjacency graph is constructed by considering the adjacency between any of the beads from the initially selected set. Since we only care about the acidic and basic beads, by construction this graph avoids self-loops. Furthermore, since we only care about specific blocks of the bead adjacency matrix being non-empty, we only obtain an unweighted graph. If, however, we wanted to obtain the weighted graph, we would simply take the sum of the number of edges in a particular sub-graph or take the sum of that block from the bead adjacency matrix.

On an implementation level, we can skip most of the block reductions by simply asking the following: given the two sets of beads and the cutoff radius, which pairs of beads are adjacent. Then, from the pairs of adjacent beads, we ask which molecules the beads in the pairs come from. Specifically, we identify the molecule-ID of each bead. From this set of molecule-ID pairs, we find the unique pairs. Given these unique pairs of adjacent molecule-IDs, we construct the molecular adjacency matrix since the molecule-IDs directly correspond to the indices in the adjacency matrix. We set those elements of the $\left(\alpha+\beta \right)\times \left(\alpha+\beta \right)$ matrix to one, and we obtain our molecular adjacency matrix.

Degree distributions

From a given trajectory snapshot, we generate molecular graphs, $G(V,E)$, where individual molecules are represented as nodes, $V$. To calculate the edges between nodes in a generalizable way, we use the first minima from the RDFs of the given sets of beads. We use signal.find_peaks from the SciPy package¹²⁶ to find these minima. We generated RDFs for the acidic/negatively charged beads in N130 from a particular acidic region, and the basic residues in the rpL5 peptide. Given the RDFs, we then use the first minimum as the cutoff radius for the definition of an edge. An edge, $E$, is therefore drawn between two nodes if any of the beads from the considered sets are within the distance corresponding to the first minimum of the g(r) of interest. We use MDAnalysis¹²⁷ to analyze the trajectories to find adjacent molecules. Given these adjacency matrices, we then calculate the degree of each node by calculating the total number of edges for each node, or by calculating the sum of each row (or column) of the adjacency matrix. The python package NumPy¹²⁸ is used to generate these degrees. From the degrees, we calculate the degree distribution. The degree distributions from the last 10³ frames are averaged over, and the average over the five simulation replicates are reported here.

Mean square displacements

The MSD was calculated by averaging the displacements in particle positions over all windows of length $m$ and over all particles $N$ as shown in Eq. (8):

$${MSD}\left(m\right)=\frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}\frac{1}{N-m}\mathop{\sum }\limits_{k=0}^{N-m-1}{\left({\vec{r}}_{i}\left(k+m\right)-{\vec{r}}_{i}(k)\right)}^{2}$$

(8)

for $m=1,\ldots,{N}-1$. For the acidic regions and rpL5, the MSD was calculated with respect to all acidic and basic residues, respectively. For all MSDs, we fit the first 30 ps and everything past 400 ps using single exponents to describe the two different regimes. For the MSD with respect to the PD, the region from ~80–120 ps was found to fit best to a simple diffusion model with ${R}^{2}$ = 0.9987. The crossover t_D between the super- and sub-diffusive regimes is the median value within the diffusive region of the MSD with respect to the PD. In all cases, the MSD was averaged over five replicates. To calculate the histograms of the exponents of the individual molecules, we modified Eq. (2) to calculate the MSD with respect to each PD rather than averaging over all PDs. We then fit each MSD as before.

Plotting

To generate the plots, we use Matplotlib¹²⁹ along with Seaborn¹³⁰. Adobe Illustrator® is used to generate the final figures shown.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Source data are provided as a Source Data file with this manuscript and via the GitHub repository of the Pappu lab https://github.com/Pappulab/n130-liquid-structure/. Input files for the simulations and coordinate files of the final outputs are available via Zenodo at https://zenodo.org/doi/10.5281/zenodo.10823199¹³¹. PDB 4N8M is available from the Protein Data Bank. Source data are provided with this paper.

Code availability

All custom-made code for the analyses can be found on the GitHub repository of the Pappu lab at https://github.com/Pappulab/n130-liquid-structure/. Python (v3.9), VMD (v1.9.3), and MATLAB (r2021b) were used for data analysis. All CAMPARI simulations were performed using version 2.0 available at http://campari.sourceforge.net. All CAMELOT simulations were performed using version 0.1.2. All MD simulations were performed in LAMMPS (16 Dec. 2013).

References

Banani, S. F., Lee, H. O., Hyman, A. A. & Rosen, M. K. Biomolecular condensates: organizers of cellular biochemistry. Nat. Rev. Mol. Cell Biol. 18, 285–298 (2017).
Article CAS PubMed PubMed Central Google Scholar
Brangwynne, C. P. et al. Germline P granules are liquid droplets that localize by controlled dissolution/condensation. Science 324, 1729–1732 (2009).
Article ADS CAS PubMed Google Scholar
Brangwynne, C. P., Mitchison, T. J. & Hyman, A. A. Active liquid-like behavior of nucleoli determines their size and shape in Xenopus laevis oocytes. PNAS 108, 4334–4339 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Feric, M. et al. Coexisting liquid phases underlie nucleolar subcompartments. Cell 165, 1686–1697 (2016).
Article CAS PubMed PubMed Central Google Scholar
Shin, Y. et al. Spatiotemporal control of intracellular phase transitions using light-activated optoDroplets. Cell 168, 159–171.e114 (2017).
Article CAS PubMed Google Scholar
Shin, Y. & Brangwynne, C. P. Liquid phase condensation in cell physiology and disease. Science 357, eaaf4382 (2017).
Article PubMed Google Scholar
Taylor, N. et al. Biophysical characterization of organelle-based RNA/protein liquid phases using microfluidics. Soft Matter 12, 9142–9150 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Hyman, A. A., Weber, C. A. & Jülicher, F. Liquid-liquid phase separation in biology. Annu. Rev. Cell Dev. Biol. 30, 39–58 (2014).
Article CAS PubMed Google Scholar
Li, P. et al. Phase transitions in the assembly of multivalent signalling proteins. Nature 483, 336–340 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Mittag, T. & Pappu, R. V. A conceptual framework for understanding phase separation and addressing open questions and challenges. Mol. Cell 82, 2201–2214 (2022).
Article CAS PubMed PubMed Central Google Scholar
King M. R. et al. Macromolecular condensation organizes nucleolar sub-phases to set up a pH gradient. Cell 187, 1–18 (2024).
Alshareedah, I., Moosa, M. M., Pham, M., Potoyan, D. A. & Banerjee, P. R. Programmable viscoelasticity in protein-RNA condensates with disordered sticker-spacer polypeptides. Nat. Commun. 12, 6620 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Keizer, V. I. P. et al. Live-cell micromanipulation of a genomic locus reveals interphase chromatin mechanics. Science 377, 489–495 (2022).
Article ADS CAS PubMed Google Scholar
Feric, M. et al. Mesoscale structure–function relationships in mitochondrial transcriptional condensates. PNAS 119, e2207303119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Böddeker, T. J. et al. Non-specific adhesive forces between filaments and membraneless organelles. Nat. Phys. 18, 571–578 (2022).
Article PubMed PubMed Central Google Scholar
Zhou, H.-X. Viscoelasticity of biomolecular condensates conforms to the Jeffreys model. J. Chem. Phys. 154, 041103 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Ghosh, A., Kota, D. & Zhou, H.-X. Shear relaxation governs fusion dynamics of biomolecular condensates. Nat. Commun. 12, 5995 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Bergeron-Sandoval, L. P. et al. Endocytic proteins with prion-like domains form viscoelastic condensates that enable membrane remodeling. PNAS 118, e2113789118 (2021).
Article PubMed PubMed Central Google Scholar
Alshareedah, I., Kaur, T. & Banerjee, P. R. Methods for characterizing the material properties of biomolecular condensates. Methods Enzymol. 646, 143–183 (2021).
Article CAS PubMed Google Scholar
Roberts, S. et al. Injectable tissue integrating networks from recombinant polypeptides with tunable order. Nat. Mater. 17, 1154–1163 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Berry, J., Brangwynne, C. P. & Haataja, M. Physical principles of intracellular organization via active and passive phase transitions. Rep. Prog. Phys. 81, 046601 (2018).
Article ADS PubMed Google Scholar
Pappu, R. V., Cohen, S. R., Dar, F., Farag, M. & Kar, M. Phase transitions of associative biomacromolecules. Chem. Rev. 123, 8945–8987 (2023).
Article CAS PubMed Google Scholar
Zhang, Z., Chen, Q. & Colby, R. H. Dynamics of associative polymers. Soft Matter 14, 2961–2977 (2018).
Article ADS CAS PubMed Google Scholar
Choi, J. M., Holehouse, A. S. & Pappu, R. V. Physical principles underlying the complex biology of intracellular phase transitions. Annu. Rev. Biophys. 49, 107–133 (2020).
Article CAS PubMed PubMed Central Google Scholar
Choi, J. M., Dar, F. & Pappu, R. V. LASSI: a lattice model for simulating phase transitions of multivalent proteins. PLoS Comput. Biol. 15, e1007028 (2019).
Article CAS PubMed PubMed Central Google Scholar
Powers, S. K. et al. Nucleo-cytoplasmic partitioning of ARF proteins controls auxin responses in arabidopsis thaliana. Mol. Cell 76, 177–190.e175 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sanders, D. W. et al. Competing protein-RNA interaction networks control multiphase intracellular organization. Cell 181, 306–324.e328 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lin, A. Z. et al. Dynamical control enables the formation of demixed biomolecular condensates. Nat. Commun. 14, 7678 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Farag, M., Borcherds, W. M., Bremer, A., Mittag, T. & Pappu, R. V. Phase separation of protein mixtures is driven by the interplay of homotypic and heterotypic interactions. Nat. Commun. 14, 5527 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Farag, M. et al. Condensates formed by prion-like low-complexity domains have small-world network structures and interfaces defined by expanded conformations. Nat. Commun. 13, 7722 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Bremer, A. et al. Deciphering how naturally occurring sequence features impact the phase behaviours of disordered prion-like domains. Nat. Chem. 14, 196–207 (2022).
Article CAS PubMed Google Scholar
Zeng, X., Holehouse, A. S., Chilkoti, A., Mittag, T. & Pappu, R. V. Connecting coil-to-globule transitions to full phase diagrams for intrinsically disordered proteins. Biophys. J. 119, 402–418 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Yang, P. et al. G3BP1 is a tunable switch that triggers phase separation to assemble stress granules. Cell 181, 325–345.e328 (2020).
Article CAS PubMed PubMed Central Google Scholar
Schmit, J. D., Bouchard, J. J., Martin, E. W. & Mittag, T. Protein network structure enables switching between liquid and gel states. J. Am. Chem. Soc. 142, 874–883 (2020).
Article CAS PubMed PubMed Central Google Scholar
Martin, E. W. et al. Valence and patterning of aromatic residues determine the phase behavior of prion-like domains. Science 367, 694–699 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, J. et al. A molecular grammar governing the driving forces for phase separation of prion-like RNA binding proteins. Cell 174, 688–699.e616 (2018).
Article CAS PubMed PubMed Central Google Scholar
Choi, J. M., Hyman, A. A. & Pappu, R. V. Generalized models for bond percolation transitions of associative polymers. Phys. Rev. E 102, 042403 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Guillen-Boixet, J. et al. RNA-induced conformational switching and clustering of G3BP drive stress granule assembly by condensation. Cell 181, 346–361.e317 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pak, ChiW. et al. Sequence determinants of intracellular phase separation by complex coacervation of a disordered protein. Mol. Cell 63, 72–85 (2016).
Article CAS PubMed PubMed Central Google Scholar
Priftis, D., Megley, K., Laugel, N. & Tirrell, M. Complex coacervation of poly(ethylene-imine)/polypeptide aqueous solutions: Thermodynamic and rheological characterization. J. Colloid Interface Sci. 398, 39–50 (2013).
Article ADS CAS PubMed Google Scholar
Neitzel, A. E. et al. Polyelectrolyte complex coacervation across a broad range of charge densities. Macromolecules 54, 6878–6890 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Sing, C. E. & Perry, S. L. Recent progress in the science of complex coacervation. Soft Matter 16, 2885–2914 (2020).
Article ADS CAS PubMed Google Scholar
Adhikari, S., Leaf, M. A. & Muthukumar, M. Polyelectrolyte complex coacervation by electrostatic dipolar interactions. J. Chem. Phys. 149, 163308 (2018).
Article ADS PubMed Google Scholar
Galvanetto, N. et al. Extreme dynamics in a biomolecular condensate. Nature 619, 876–883 (2023).
Article ADS CAS PubMed Google Scholar
Margossian, K. O., Brown, M. U., Emrick, T. & Muthukumar, M. Coacervation in polyzwitterion-polyelectrolyte systems and their potential applications for gastrointestinal drug delivery platforms. Nat. Commun. 13, 2250 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Brangwynne, C. P., Tompa, P. & Pappu, R. V. Polymer physics of intracellular phase transitions. Nat. Phys. 11, 899–904 (2015).
Article CAS Google Scholar
Ogston, A. G. On the interaction of solute molecules with porous networks. J. Phys. Chem. 74, 668–669 (1970).
Article CAS Google Scholar
Chauhan G., Bremer A., Dar F., Mittag T., Pappu R. V. Crowder titrations enable the quantification of driving forces for macromolecular phase separation. Biophys. J. https://doi.org/10.1016/j.bpj.2023.09.006. (2023).
Chowdhury, A. et al. Driving forces of the complex formation between highly charged disordered proteins. PNAS 120, e2304036120 (2023).
Article CAS PubMed PubMed Central Google Scholar
Veis, A. A review of the early development of the thermodynamics of the complex coacervation phase separation. Adv. Colloid Interface Sci. 167, 2–11 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kar, M. et al. Phase separating RNA binding proteins form heterogeneous distributions of clusters in subsaturated solutions. PNAS 119, e2202222119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lan, C. et al. Quantitative real-time in-cell imaging reveals heterogeneous clusters of proteins prior to condensation. Nat. Commun. 14, 4831 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Harmon, T. S., Holehouse, A. S., Rosen, M. K. & Pappu, R. V. Intrinsically disordered linkers determine the interplay between phase separation and gelation in multivalent proteins. eLife 6, 30294 (2017).
Article Google Scholar
Semenov, A. N. & Rubinstein, M. Thermoreversible gelation in solutions of associative polymers. 1. Statics. Macromolecules 31, 1373–1385 (1998).
Article ADS CAS Google Scholar
Flory, P. J. Molecular size distribution in three dimensional polymers. I. Gelation1. J. Am. Chem. Soc. 63, 3083–3090 (1941).
Article CAS Google Scholar
Flory, P. J. Thermodynamics of high polymer solutions. J. Chem. Phys. 10, 51–61 (1942).
Article ADS CAS Google Scholar
Shillcock, J. C., Lagisquet, C., Alexandre, J., Vuillon, L. & Ipsen, J. H. Model biomolecular condensates have heterogeneous structure quantitatively dependent on the interaction profile of their constituent macromolecules. Soft Matter 18, 6674–6693 (2022).
Article ADS CAS PubMed Google Scholar
Alshareedah I. et al. Sequence-encoded grammars determine material properties and physical aging of protein condensates. bioRxiv, https://www.biorxiv.org/content/10.1101/2023.04.06.535902v1 (2023).
Vilgis, T. A. 8 – Polymer networks. Compr. Polym. Sci. Suppl. 8, 227–279 (1989).
Article Google Scholar
Bhandari, K., Cotten, M. A., Kim, J., Rosen, M. K. & Schmit, J. D. Structure–function properties in disordered condensates. J. Phys. Chem. B 125, 467–476 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wróbel J. K., Cortez R. & Fauci L. Modeling viscoelastic networks in stokes flow. Phys. Fluids 26, 113102 (2014).
Jawerth, L. et al. Protein condensates as aging Maxwell fluids. Science 370, 1317–1323 (2020).
Article ADS CAS PubMed Google Scholar
Jabbari-Farouji, S. et al. High-bandwidth viscoelastic properties of aging colloidal glasses and gels. Phys. Rev. E 78, 061402 (2008).
Article ADS CAS Google Scholar
Elstone, N. S. et al. Understanding the liquid structure in mixtures of ionic liquids with semiperfluoroalkyl or alkyl chains. J. Phys. Chem. B 127, 7394–7407 (2023).
Article CAS PubMed PubMed Central Google Scholar
Hirosawa, K. et al. SANS study on the solvated structure and molecular interactions of a thermo-responsive polymer in a room temperature ionic liquid. Phys. Chem. Chem. Phys. 18, 17881–17889 (2016).
Article CAS PubMed Google Scholar
Tanaka, H., Tong, H., Shi, R. & Russo, J. Revealing key structural features hidden in liquids and glasses. Nat. Rev. Phys. 1, 333–348 (2019).
Article Google Scholar
Malenkov, G. G. Structure and dynamics of liquid water. J. Struct. Chem. 47, S1–S31 (2006).
Article CAS Google Scholar
Mühlbauer, S. et al. Magnetic small-angle neutron scattering. Rev. Mod. Phys. 91, 015004 (2019).
Article ADS MathSciNet Google Scholar
Pederson, T. The nucleolus. Cold Spring Harb. Perspect. Biol. 3, 165–182 (2011).
Article Google Scholar
Lafontaine, D. L. J., Riback, J. A., Bascetin, R. & Brangwynne, C. P. The nucleolus as a multiphase liquid condensate. Nat. Rev. Mol. Cell Biol. 22, 165–182 (2021).
Article CAS PubMed Google Scholar
Mitrea, D. M. et al. Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA. Elife 5, e13571 (2016).
Article PubMed PubMed Central Google Scholar
Mitrea, D. M. & Kriwacki, R. W. Phase separation in biology; functional organization of a higher order. Cell Commun. Signal. 14, 1 (2016).
Article PubMed PubMed Central Google Scholar
Ferrolino, M. C., Mitrea, D. M., Michael, J. R. & Kriwacki, R. W. Compositional adaptability in NPM1-SURF6 scaffolding networks enabled by dynamic switching of phase separation mechanisms. Nat. Commun. 9, 5064 (2018).
Article ADS PubMed PubMed Central Google Scholar
Mitrea, D. M. et al. Self-interaction of NPM1 modulates multiple mechanisms of liquid-liquid phase separation. Nat. Commun. 9, 842 (2018).
Article ADS PubMed PubMed Central Google Scholar
Riback, J. A. et al. Composition-dependent thermodynamics of intracellular phase separation. Nature 581, 209–214 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Riback, J. A. et al. Viscoelasticity and advective flow of RNA underlies nucleolar form and function. Mol. Cell 83, 3095–3107.e3099 (2023).
Article CAS PubMed Google Scholar
Mitrea, D. M. et al. Structural polymorphism in the N-terminal oligomerization domain of NPM1. PNAS 111, 4466–4471 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Clark, G. N. I., Hura, G. L., Teixeira, J., Soper, A. K. & Head-Gordon, T. Small-angle scattering and the structure of ambient liquid water. PNAS 107, 14003–14007 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Borodin, O. et al. Liquid structure with nano-heterogeneity promotes cationic transport in concentrated electrolytes. ACS Nano 11, 10462–10471 (2017).
Article CAS PubMed Google Scholar
Maier, E. E. et al. Liquid like order of charged rodlike particle solutions. Macromolecules 25, 1125–1133 (1992).
Article ADS CAS Google Scholar
Londono, J. D., Annis, B. K., Turner, J. Z. & Soper, A. K. The intermolecular hydrogen–hydrogen structure of chain–molecule liquids from neutron diffraction. J. Chem. Phys. 101, 7868–7872 (1994).
Article ADS CAS Google Scholar
Cousin, F., Gummel, J., Ung, D. & Boué, F. Polyelectrolyte−protein complexes: structure and conformation of each specie revealed by SANS. Langmuir 21, 9675–9688 (2005).
Article CAS PubMed Google Scholar
Fujii, K., Kumai, T., Takamuku, T., Umebayashi, Y. & Ishiguro, S.-i Liquid structure and preferential solvation of metal ions in solvent mixtures of N,N-dimethylformamide and N-methylformamide. J. Phys. Chem. A 110, 1798–1804 (2006).
Article CAS PubMed Google Scholar
Troitzsch, R. Z., Martyna, G. J., McLain, S. E., Soper, A. K. & Crain, J. Structure of aqueous proline via parallel tempering molecular dynamics and neutron diffraction. J. Phys. Chem. B 111, 8210–8222 (2007).
Article CAS PubMed Google Scholar
Schöttl, S. et al. Combined molecular dynamics (MD) and small angle scattering (SAS) analysis of organization on a nanometer-scale in ternary solvent solutions containing a hydrotrope. J. Colloid Interface Sci. 540, 623–633 (2019).
Article ADS PubMed Google Scholar
Ruff, K. M., Harmon, T. S. & Pappu, R. V. CAMELOT: A machine learning approach for coarse-grained simulations of aggregation of block-copolymeric protein sequences. J. Chem. Phys. 143, 243123 (2015).
Article ADS PubMed PubMed Central Google Scholar
Vitalis, A. & Pappu, R. V. ABSINTH: a new continuum solvation model for simulations of polypeptides in aqueous solutions. J. Comput. Chem. 30, 673–699 (2009).
Article CAS PubMed PubMed Central Google Scholar
Fossat, M. J., Zeng, X. & Pappu, R. V. Uncovering differences in hydration free energies and structures for model compound mimics of charged side chains of amino acids. J. Phys. Chem. B 125, 4148–4161 (2021).
Article CAS PubMed PubMed Central Google Scholar
Banerjee, P. R., Milin, A. N., Moosa, M. M., Onuchic, P. L. & Deniz, A. A. Reentrant phase transition drives dynamic substructure formation in ribonucleoprotein droplets. Angew. Chem. Int. Ed. 56, 11354–11359 (2017).
Article CAS Google Scholar
Guinier A., Fournet Gr. Small-angle scattering of X-rays. (Wiley, 1955).
Chen, Y. M. Shaped hairy polymer nanoobjects. Macromolecules 45, 2619–2631 (2012).
Article ADS CAS Google Scholar
de las Heras, D., Tavares, J. M. & Telo da Gama, M. M. Phase diagrams of binary mixtures of patchy colloids with distinct numbers of patches: the network fluid regime. Soft Matter 7, 5615–5626 (2011).
Article ADS Google Scholar
Dias, C. S., Araújo, N. A. M. & Telo da Gama, M. M. Dynamics of network fluids. Adv. Colloid Interface Sci. 247, 258–263 (2017).
Article CAS PubMed Google Scholar
Dias, C. S., Tavares, J. M., Araújo, N. A. M. & Telo da Gama, M. M. Dynamics of a network fluid within the liquid–gas coexistence region. Soft Matter 14, 2744–2750 (2018).
Article ADS CAS PubMed Google Scholar
Speedy, R. J. & Debenedetti, P. G. Persistence time for bonds in a tetravalent network fluid. Mol. Phys. 86, 1375–1386 (1995).
Article ADS CAS Google Scholar
Espinosa, J. R. et al. Liquid network connectivity regulates the stability and composition of biomolecular condensates with many components. PNAS 117, 13238–13247 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Bai, W., Sargent, C. J., Choi, J.-M., Pappu, R. V. & Zhang, F. Covalently-assembled single-chain protein nanostructures with ultra-high stability. Nat. Commun. 10, 3317 (2019).
Article ADS PubMed PubMed Central Google Scholar
Seeger, M. Gaussian processes for machine learning. Int. J. Neural Syst. 14, 69–106 (2004).
Article PubMed Google Scholar
Pedersen, J. S. Analysis of small-angle scattering data from colloids and polymer solutions: modeling and least-squares fitting. Adv. Colloid Interface Sci. 70, 171–210 (1997).
Article CAS Google Scholar
Hansen J-P, McDonald I. R. Theory of simple liquids: with applications of soft matter. Fourth edn. (Elsevier, 2013).
Chandler, D., Weeks, J. D. & Andersen, H. C. Van der waals picture of liquids, solids, and phase transformations. Science 220, 787–794 (1983).
Article ADS CAS PubMed Google Scholar
Widom, B. Intermolecular forces and the nature of the liquid state: liquids reflect in their bulk properties the attractions and repulsions of their constituent molecules. Science 157, 375–382 (1967).
Article ADS CAS PubMed Google Scholar
Choi, J. H., Lee, H., Choi, H. R. & Cho, M. Graph theory and ion and molecular aggregation in aqueous solutions. Annu Rev. Phys. Chem. 69, 125–149 (2018).
Article ADS CAS PubMed Google Scholar
Bako I., Pusztai L., Pothoczki S. Topological descriptors and Laplace spectra in simple hydrogen bonded systems. J. Mol. Liq. 363, 119860 (2022).
Pusztai, L., Bako, I. & Pothoczki, S. Connecting diffraction experiments and network analysis tools for the study of hydrogen-bonded networks. J. Phys. Chem. B 127, 3109–3118 (2023).
Article PubMed PubMed Central Google Scholar
Agayan, G. M., Balabaev, N. K. & Rodnikova, M. N. Description of mixed networks of h-bonds in a water-ethylene glycol system by methods of graph theory and delaunay simplices. Russ. J. Phys. Chem. A 95, 1283–1290 (2021).
Article CAS Google Scholar
Faccio C., Benzi M., Zanetti-Polzi L., Daidone I. Low- and high-density forms of liquid water revealed by a new medium-range order descriptor. J. Mol. Liq. 355, 118922 (2022).
de Oliveira, P. M. C., de Souza, J. I. R., da Silva, J. A. B. & Longo, R. L. Temperature dependence of hydrogen bond networks of liquid water: thermodynamic properties and structural heterogeneity from topological descriptors. J. Phys. Chem. B 127, 2250–2257 (2023).
Article PubMed Google Scholar
Tan A. R., Urata S., Yamada M., Gomez-Bombarelli R. Graph theory-based structural analysis on density anomaly of silica glass. Comp. Mater. Sci. 225, 112190 (2023).
Choi J. H., Cho M. Ion aggregation in high salt solutions. II. Spectral graph analysis of water hydrogen-bonding network and ion aggregate structures. J. Chem. Phys. 141, 154502 (2014).
Kihara, T. & Koba, S. Crystal structures and intermolecular forces of rare gases. J. Phys. Soc. Jpn 7, 348–354 (1952).
Article ADS CAS Google Scholar
Musacchio, A. On the role of phase separation in the biogenesis of membraneless compartments. EMBO J. 41, e109952 (2022).
Article CAS PubMed PubMed Central Google Scholar
Russo J., Leoni F., Martelli F., Sciortino F. The physics of empty liquids: from patchy particles to water. Rep. Prog. Phys. 85, 016601 (2022).
Chremos, A., Panagiotopoulos, A. Z. & Koch, D. L. Dynamics of solvent-free grafted nanoparticles. J. Chem. Phys. 136, 044902 (2012).
Article ADS PubMed Google Scholar
Gibbs, E., Perrone, B., Hassan, A., Kummerle, R. & Kriwacki, R. NPM1 exhibits structural and dynamic heterogeneity upon phase separation with the p14ARF tumor suppressor. J. Magn. Reson. 310, 106646 (2020).
Article CAS PubMed Google Scholar
Bianchi, E., Largo, J., Tartaglia, P., Zaccarelli, E. & Sciortino, F. Phase diagram of patchy colloids: towards empty liquids. Phys. Rev. Lett. 97, 168301 (2006).
Article ADS PubMed Google Scholar
Sciortino, F. & Zaccarelli, E. Reversible gels of patchy particles. Curr. Opin. Solid State Mater. Sci. 15, 246–253 (2011).
Article ADS CAS Google Scholar
Edmond, E. & Ogston, A. G. An approach to the study of phase separation in ternary aqueous systems. Biochem. J. 109, 569–576 (1968).
Article CAS PubMed PubMed Central Google Scholar
Feder, T. J., Brust-Mascher, I., Slattery, J. P., Baird, B. & Webb, W. W. Constrained diffusion or immobile fraction on cell surfaces: a new interpretation. Biophys. J. 70, 2767–2773 (1996).
Article ADS CAS PubMed PubMed Central Google Scholar
Heller, W. T. et al. The suite of small-angle neutron scattering instruments at Oak Ridge National Laboratory. J. Appl. Crystallogr. 51, 242–248 (2018).
Article ADS CAS Google Scholar
Arnold, O. et al. Mantid-Data analysis and visualization package for neutron scattering and mu SR experiments. Nucl. Instrum. Meth. A 764, 156–166 (2014).
Article ADS CAS Google Scholar
Heller W. T. et al. drtsans: the data reduction toolkit for small-angle neutron scattering at Oak Ridge National Laboratory. Softwarex 19, 101101 (2022).
Wignall, G. D. & Bates, F. S. Absolute calibration of small-angle neutron-scattering data. J. Appl. Crystallogr. 20, 28–40 (1987).
Article ADS CAS Google Scholar
Thompson, A. P. et al. LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Comput. Phys. Commun. 271, 108171 (2022).
Article CAS Google Scholar
Humphrey, W., Dalke, A. & Schulten, K. VMD: visual molecular dynamics. J. Mol. Graph Model. 14, 33–38 (1996).
Article CAS Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
Michaud-Agrawal, N., Denning, E. J., Woolf, T. B. & Beckstein, O. Software news and updates MDAnalysis: a toolkit for the analysis of molecular dynamics simulations. J. Comput. Chem. 32, 2319–2327 (2011).
Article CAS PubMed PubMed Central Google Scholar
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Hunter, J. D. Matplotlib: a 2D graphics environment. Comput. Sci. Eng. 9, 90–95 (2007).
Article Google Scholar
Waskom M. seaborn: statistical data visualization. J. Open Source Softw. 6, 3021 (2021).
Dar F. et al. Biomolecular condensates form spatially inhomogeneous network fluids. Zenodo https://doi.org/10.5281/zenodo.10823199 (2024).
Mao, A. H. & Pappu, R. V. Crystal lattice properties fully determine short-range interaction parameters for alkali and halide ions. J. Chem. Phys. 137, 064104 (2012).
Article ADS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the St. Jude Children’s Research Hospital Research Collaborative on the Biology and Biophysics of RNP granules (to R.V.P. and R.W.K.), the US National Science Foundation (MCB-2227268 to R.V.P.), the US National Institutes of Health (NIGMS R01 GM115634 and R35 GM131891 to R.W.K., and NCI P30 CA021765 to St. Jude Children’s Research Hospital), ALSAC (supporting studies at St. Jude Children’s Research Hospital), and National Research Foundation (NRF) grants of Korea (2021R1C1C1010943 and 2022R1A4A1033471 to J.-M.C.). A portion of this research, conducted at the Oak Ridge National Laboratory (ORNL) Spallation Neutron Source, was sponsored by the Scientific User Facilities Division, Office of Basic Energy Sciences, U.S. Department of Energy. S.R. Cohen acknowledges financial support via T32 EB028092 from the US National Institutes of Health. We thank Jared M. Lalmansingh for technical assistance with CAMPARI. Fluorescence microscopy images were acquired at the St. Jude Cell & Tissue Imaging Center at St. Jude Children’s Research Hospital (supported by P30 CA021765); we thank V. Frohlich, J. Peters, A. Taylor, A. Pitre, and G. Campbell for technical assistance.

Author information

These authors contributed equally: Furqan Dar, Samuel R. Cohen, Jeong-Mo Choi.

Authors and Affiliations

Department of Biomedical Engineering and Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, 63130, USA
Furqan Dar, Samuel R. Cohen & Rohit V. Pappu
Center of Regenerative Medicine, Washington University in St. Louis, St. Louis, MO, 63130, USA
Samuel R. Cohen
Dewpoint Therapeutics Inc., 451 D Street, Boston, MA, 02210, USA
Diana M. Mitrea
Department of Structural Biology, St. Jude Children’s Research Hospital, Memphis, TN, 38105, USA
Aaron H. Phillips & Richard W. Kriwacki
Neutron Scattering Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
Gergely Nagy & Wellington C. Leite
Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37830, USA
Christopher B. Stanley
Department of Chemistry and Chemistry Institute for Functional Materials, Pusan National University, Busan, 46241, Republic of Korea
Jeong-Mo Choi

Authors

Furqan Dar
View author publications
You can also search for this author in PubMed Google Scholar
Samuel R. Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Diana M. Mitrea
View author publications
You can also search for this author in PubMed Google Scholar
Aaron H. Phillips
View author publications
You can also search for this author in PubMed Google Scholar
Gergely Nagy
View author publications
You can also search for this author in PubMed Google Scholar
Wellington C. Leite
View author publications
You can also search for this author in PubMed Google Scholar
Christopher B. Stanley
View author publications
You can also search for this author in PubMed Google Scholar
Jeong-Mo Choi
View author publications
You can also search for this author in PubMed Google Scholar
Richard W. Kriwacki
View author publications
You can also search for this author in PubMed Google Scholar
Rohit V. Pappu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.V.P., J.-M.C. and R.W.K. came up with the project idea. D.M.M. and A.H.P. prepared samples for measurements. D.M.M., A.H.P., W.C.L., G.N., C.B.S., performed SANS measurements and analyzed SANS data. D.M.M. and A.H.P. designed and characterized the phase behaviors of mutants. J-M.C prototyped the CAMELOT-based CG, and the original simulations using LAMMPS. F.D. designed and performed all the LAMMPS simulations reported in this work. F.D., S.R.C. and R.V.P. designed and iterated on the structure of the analysis with inputs from J.-M.C. F.D. and S.R.C. deployed the entirety of analyses, including the SANS data, and integrated the findings with experimental work. F.D., S.R.C. and A.H.P. made the figures. F.D., S.R.C. and R.V.P. wrote the manuscript. All authors contributed to editing of the manuscript.

Corresponding authors

Correspondence to Jeong-Mo Choi, Richard W. Kriwacki or Rohit V. Pappu.

Ethics declarations

Competing interests

R.V.P. is a member of the scientific advisory board and shareholder of Dewpoint Therapeutics Inc. D.M.M. is an employee and shareholder of Dewpoint Therapeutics. The work reported here was not influenced by these affiliations. The remaining authors have no competing interests to declare.

Peer review

Peer review information

Nature Communications thanks Lars Schäfer and the other, anonymous, reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dar, F., Cohen, S.R., Mitrea, D.M. et al. Biomolecular condensates form spatially inhomogeneous network fluids. Nat Commun 15, 3413 (2024). https://doi.org/10.1038/s41467-024-47602-z

Download citation

Received: 07 October 2023
Accepted: 05 April 2024
Published: 22 April 2024
DOI: https://doi.org/10.1038/s41467-024-47602-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.