(C) PLOS One
This story was originally published by PLOS One and is unaltered.
. . . . . . . . . .
MMTV RNA packaging requires an extended long-range interaction for productive Gag binding to packaging signals [1]
['Suresha G. Prabhu', 'Department Of Microbiology', 'Immunology', 'College Of Medicine', 'Health Sciences', 'Cmhs', 'United Arab Emirates University', 'Uaeu', 'Al Ain', 'United Arab Emirates']
Date: 2024-10
The packaging of genomic RNA (gRNA) into retroviral particles relies on the specific recognition by the Gag precursor of packaging signals (Psi), which maintain a complex secondary structure through long-range interactions (LRIs). However, it remains unclear whether the binding of Gag to Psi alone is enough to promote RNA packaging and what role LRIs play in this process. Using mouse mammary tumor virus (MMTV), we investigated the effects of mutations in 4 proposed LRIs on gRNA structure and function. Our findings revealed the presence of an unsuspected extended LRI, and hSHAPE revealed that maintaining a wild-type–like Psi structure is crucial for efficient packaging. Surprisingly, filter-binding assays demonstrated that most mutants, regardless of their packaging capability, exhibited significant binding to Pr77 Gag , suggesting that Gag binding to Psi is insufficient for efficient packaging. Footprinting experiments indicated that efficient RNA packaging is promoted when Pr77 Gag binds to 2 specific sites within Psi, whereas binding elsewhere in Psi does not lead to efficient packaging. Taken together, our results suggest that the 3D structure of the Psi/Pr77 Gag complex regulates the assembly of viral particles around gRNA, enabling effective discrimination against other viral and cellular RNAs that may also bind Gag efficiently.
Funding: This work was primarily funded by grants from the College of Medicine and Health Sciences, United Arab Emirates University (NP-24-09 to T.A.R.); United Arab Emirates University (UPAR-12M103 to T.A.R.); Centre national de la recherche scientifique (RetroPack International Research Project to R.M.); Department of Education and Knowledge, Abu Dhabi (AARE20‐344 to T.A.R.); ASPIREMRIAD (ASPIRE Precision Medicine Research Institute, Abu Dhabi; VRI‐20‐10 to T.A.R.). S.G.P. was supported through a fellowship from the College of Graduate Studies, United Arab Emirates University, and through grants AARE20‐344 and VRI‐20‐10. V.N.P. was supported by AARE20-344. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Briefly, we introduced mutations that disrupted LRIs I-IV and compensatory mutations designed to restore them, then analyzed the impact of these mutations on gRNA packaging, gRNA structure, and Pr77 Gag binding to Psi. Our findings indicate the presence of only 2 LRIs, which together form an extended LRI that significantly influences gRNA packaging, prompting us to propose a revised secondary structure model for MMTV Psi. Maintaining a Psi structure akin to the wild type (WT) emerges as crucial for efficient packaging. Surprisingly, most mutants displayed notable binding to Pr77 Gag regardless of their packaging ability, suggesting that Gag binding to Psi alone is insufficient for efficient packaging. Intriguingly, SHAPE footprinting experiments revealed that efficient RNA packaging correlated with Pr77 Gag binding to 2 specific sites within Psi, whereas binding elsewhere in Psi did not result in efficient packaging. These findings underscore the necessity for Pr77 Gag to bind to specific nucleotides within the correct structural context of Psi and support a model wherein the 3D structure of the Psi/Pr77 Gag complex governs the assembly of viral particles around gRNA, thereby facilitating effective discrimination against other viral and cellular RNA species that may also bind Gag efficiently.
(A) Genome organization of MMTV and its RNA packaging signal (Ψ) beginning from the R region to 120 nucleotides of Gag, denoted by a blue wavy arrow. ( B) hSHAPE-validated RNA secondary structure of MMTV packaging sequences located at the 5′ end, including LRIs-I-IV depicted in blue insets. Nucleotides are color-coded according to hSHAPE reactivity, with features annotated, such as stem-loops (SL1-7), PBS, DIS, single-stranded purines (ssPurines), mSD, and Gag start site. DIS, dimerization initiation site; LRI, long-range interaction; MMTV, mouse mammary tumor virus; mSD, major splice donor; PBS, primer binding site.
In this study, we tackled these questions using mouse mammary tumor virus (MMTV) as a model. MMTV, a betaretrovirus, induces breast cancer and, in certain instances, T-cell lymphomas in mice [ 56 – 58 ]. Unlike the majority of retroviruses, including lentiviruses, MMTV assembles in the cytoplasm into immature particles that later translocate to the plasma membrane for budding [ 59 , 60 ]. The MMTV Psi resides within the 5′ UTR and the first 120 nts of the gag gene [ 61 ], and recent studies have elucidated the structure–function relationships of several structural motifs within Psi [ 36 , 41 , 53 ]. Based on SHAPE (selective 2′ hydroxyl acylation analyzed by primer extension) data, the current secondary structure model of MMTV Psi suggests the presence of 4 evolutionary conserved LRIs [ 36 , 41 , 53 ]: LRIs I-III involve complementary base-pairing between U5 and Gag sequences that are approximately 291 nucleotides apart, while LRI IV is formed by complementary base-pairing of sequences within U5 that are approximately 190 nucleotides apart ( Fig 1 ).
These retroviral Psi sequences adopt complex higher-order structures containing various structural motifs that are essential for Gag binding and retroviral RNA packaging [ 4 , 9 , 10 , 12 , 14 – 18 , 21 , 26 , 35 – 46 ]. Notably, Psi sequences in all retroviruses feature long-range interactions (LRIs) that maintain their overall secondary structure and, in several instances, are critical for Gag binding and gRNA packaging [ 3 , 9 , 10 , 26 , 36 , 43 , 45 – 55 ]. It is important to note that Psi sequences are significantly longer than typical protein-binding sites, and without a high-resolution 3D structure of the Psi/Gag complex, the precise role of LRIs in these functions remains unclear. Additionally, although it is well established that Psi/Gag interactions are necessary for retroviral gRNA packaging [ 4 , 12 , 14 – 19 , 21 ], it is not yet clear whether these interactions alone are sufficient for this process to occur.
In retroviruses, the selective packaging of gRNA, which exists as a dimer of single-stranded RNA molecules noncovalently linked near their 5′-end [ 3 , 4 ], from the cellular environment where it represents only a small fraction (around 1%) of total mRNA, is facilitated by the recognition of cis-acting sequences known as packaging signal (Psi; ψ) typically located at the 5′ end of the retroviral genome, sometimes extending into the Gag open reading frame (ORF) [ 4 – 27 ]. While the nucleocapsid (NC) domain of Gag is crucial for selective packaging [ 12 – 17 , 21 , 22 , 28 , 29 ], in the case of human immunodeficiency virus (HIV-1), the matrix (MA) domain [ 30 , 31 ], capsid domain (CA; [ 32 , 33 ]), p6 domain [ 28 ], and spacer peptide sp1 [ 31 , 34 ] also participate in the specific recognition of Psi by Gag.
The vast majority of virus species selectively package their DNA or RNA genome into viral particles. While DNA viruses typically assemble a procapsid first, followed by packaging of the genome in an energy-dependent process, many RNA viruses, including significant human pathogens such as retroviruses, influenza viruses, and coronaviruses, undergo concerted processes of genomic RNA (gRNA) packaging and viral assembly [ 1 , 2 ].
Overall, footprinting experiments conducted on all mutants revealed that efficient RNA packaging correlates with strong Pr77 Gag binding to nucleotides in ssPurines and the PBS, known as primary Gag-binding sites during WT MMTV gRNA packaging [ 53 ]. In stark contrast, mutants that failed to package RNA efficiently revealed Pr77 Gag binding to nucleotides scattered in regions other than ssPurines and the PBS. Hence, while Gag binding during retroviral RNA packaging is essential, successful RNA packaging relies on binding to specific nucleotides that can only take place within the appropriate structural context.
Histograms showing SHAPE reactivities of nucleotides in the absence (red bars) and presence (blue bars) of Gag for the packaging signal RNAs of: (A) wild type (SA35), (B) mutant SP107i designed to destabilize LRI-IV, and (C) mutant SP108i designed to restore LRI-IV. The dashed boxes depict the already identified primary Gag-binding sites such as single stranded purines (ssPurines) and the PBS. For full details of nucleotides attenuation of SHAPE reactivities in the absence and presence of Pr77 Gag from a minimum of 3 independent experiments, see S3 Table and S9 and S10 Figs. All nucleotides that show a reactivity decrease >40% upon Gag addition, also show statistically significant difference according to the Mann–Whitney non parametrical U test (p < 0.05). The data underlying this figure can be found in S1 Data . LRI, long-range interaction; PBS, primer binding site; WT, wild type.
This result was confirmed by footprinting experiments on mutants SP107i and SP108i. In brief, the destabilizing mutant (SP107i), in which LRI-IV is disrupted, showed no footprint within ssPurines, and only 1 nucleotide was protected by Pr77 Gag in the PBS (134G; Figs 13B and S9 ). Conversely, mutant SP108i, which restores LRI-IV, exhibited protections for a majority of the nucleotides in the ssPurines (6/9) and in the PBS (4/7) (Figs 13C and S10 ). These results indicate that when LRI-IV is restored, Pr77 Gag can bind to the ssPurines and the PBS and promote efficient RNA packaging ( Fig 2 ).
By contrast, mutant SP109, designed to restore LRI-III’ based on the new RNA secondary structure model and which successfully restored both RNA packaging and structure ( Fig 8 ), showed Pr77 Gag footprints primarily within ssPurines and the PBS, similar to the wild type (Figs 12D and S8 ). These results further confirm the new hSHAPE-validated RNA secondary structure model for MMTV Psi and stress the importance of Pr77 Gag binding to the ssPurines and the PBS for efficient RNA packaging.
Histograms showing the SHAPE reactivities of nucleotides in the absence (red bars) and presence (blue bars) of Gag for the packaging signal RNAs of: (A) wild type (SA35), (B) mutant SP105i designed to destabilize LRI-III, (C) mutant SP106i designed to re-stabilize LRI-III, and (D) mutant SP109i designed to restore the LRI-III’. The dashed boxes depict the already identified primary Gag-binding sites, such as single stranded purines (ssPurines) and the PBS. For full details of nucleotides attenuation of SHAPE reactivities in the absence and presence of Pr77 Gag from a minimum of 3 independent experiments, see S3 Table and S6 , S7 and S8 Figs. All nucleotides that show a reactivity decrease >40% upon Gag addition, also show statistically significant difference according to the Mann–Whitney non parametrical U test (p < 0.05). The data underlying this figure can be found in S1 Data . LRI, long-range interaction; PBS, primer binding site; WT, wild type.
In the case of LRI-III, neither SP105i (destabilizing mutant) nor SP106i (restoring mutant) showed any Pr77 Gag footprint within ssPurines ( S6 and S7 Figs and Fig 12B and 12C ). In the PBS, only 1 (137G) or 2 nucleotides (137G and 147C) were protected by Pr77 Gag in SP105i and SP106i, respectively ( S6 and S7 Figs and Fig 12B and 12C ). The absence of footprints in the ssPurines and PBS of mutants SP105i and SP106i corroborates with their inability to package gRNA and the loss of LRIs I-IV in their secondary structure (Figs 2 and 5 ).
Histograms showing the SHAPE reactivities of nucleotides in the absence (red bars) and presence (blue bars) of Gag for the packaging signal RNAs of: (A) wild type (SA35), (B) mutant SP101i designed to destabilize LRI-I, and (C) mutant SP102i designed to re-stabilize LRI-I. The dashed boxes mark the already identified primary Gag-binding sites, such as single stranded purines (ssPurines) and the PBS. For full details of nucleotides attenuation of SHAPE reactivities in the absence and presence of Pr77 Gag from a minimum of 3 independent experiments, see S3 Table and S4 and S5 Figs. All nucleotides that show a reactivity decrease >40% upon Gag addition, also show statistically significant difference according to the Mann–Whitney non parametrical U test (p < 0.05). The data underlying this figure can be found in S1 Data . LRI, long-range interaction; PBS, primer binding site; WT, wild type.
Footprinting experiments conducted on SP101i revealed protections of most nucleotides within ssPurines and the PBS that have been shown to be the primary Gag-binding sites in the WT RNA (Figs 11B and S4 ). This is consistent with the mutant’s ability to package RNA ( Fig 2 ). In contrast, minimal or no protections were observed in ssPurines and PBS regions in the RNA SP102i (only 1 nucleotide (135C) in the PBS was protected). Surprisingly, in this mutant, other nucleotides (26G, 30G, 32U, 35U, 44G, 52G) were protected by Pr77 Gag (Figs 11C and S5 ). The lack of protection in ssPurines and the PBS, which are primary Gag-binding sites [ 53 ], aligns with the non-packaging nature of SP102 ( Fig 2 ). However, given the packaging defect of this mutant, the binding of Pr77 Gag to other nucleotides was rather puzzling.
(A) Schematic illustration of the MMTV WT packageable vector (SA35) RNA from R to 712 nucleotides expressed from a T7 expression plasmid. (B) Illustration of the envelope (env) spliced RNA (AK29) from R to 712 nucleotides expressed from a T7 expression plasmid. (C) hSHAPE analysis was carried out both with and without Pr77 Gag . The mean triplicate SHAPE reactivity obtained without Pr77 Gag was used to predict the RNA secondary structure model of WT (SA35) RNA. Subsequently, the mean hSHAPE reactivities obtained with Pr77 Gag were overlaid onto the RNA secondary structure model predicted in the absence of Pr77 Gag . Notably, nucleotides within the previously identified primary Gag-binding sites, such as single-stranded purines (ssPurines) and the PBS, exhibited significant reductions in hSHAPE reactivities. Nucleotides in ssPurines, PBS, and all other nucleotides marked by arrows show significant reduction in SHAPE reactivities. The hSHAPE reactivity key was developed based on the mean of hSHAPE reactivities for each nucleotide, as shown in S3 Table . The data shown is from a minimum of 3 independent experiments conducted both in the absence and presence of Pr77 Gag . All nucleotides that show a reactivity decrease >40% upon Gag addition, also show statistically significant difference according to the Mann–Whitney non parametrical U test (p < 0.05). MMTV, mouse mammary tumor virus; PBS, primer binding site; WT, wild type.
When hSHAPE was conducted on WT (SA35) RNA in the absence of Pr77 Gag , the reactivity pattern was consistent with the new structure model presented in Fig 7 . To identify nucleotides interacting with Gag, hSHAPE was conducted on wild type RNA (SA35; Fig 10A ) in the presence of Pr77 Gag and spliced env RNA (AK29; Fig 10B ) as a competitor. The differences in hSHAPE reactivity obtained in the absence and in the presence of Pr77 Gag were quantified and mapped onto this secondary structure model. Differences were deemed significant when hSHAPE reactivities showed a variance of ≥0.20 and a relative difference exceeding 40% to 50% [ 81 ]. hSHAPE reactivities in the presence of Pr77 Gag consistently showed reduced reactivity in nucleotides within single-stranded purines (ssPurines) and the primer binding site (PBS), confirming Pr77 Gag binding to these sites, as reported earlier (Figs 10C and 11A ; [ 53 ]).
Since the filter-binding assays revealed that Pr77 Gag binding affinity does not correlate with RNA packaging efficiency, we asked whether Pr77 Gag binds at the same sites within Psi of the packaging and non-packaging mutants. To this goal, we identified specific nucleotides that bind to Pr77 Gag by conducting RNA modification (via BzCN), both in the presence and absence of Pr77 Gag . Reduced hSHAPE reactivity in the presence of Pr77 Gag revealed the footprints of Pr77 Gag on the WT and mutant Psi. To prevent nonspecific binding, hSHAPE was conducted with excess spliced env RNA (AK29; Fig 3C ) at a high (4-fold) molar concentration. Pr77 Gag was used at a concentration 10-fold higher than the Kd value, ensuring complete saturation of the high-affinity binding sites. These conditions identified high-affinity binding sites in other retroviral Psi footprinting experiments [ 46 , 53 , 80 ].
(A) The membrane-bound radioactivity of the wild type WT (SA35) unspliced and mutant RNAs was quantified at increasing concentrations of MMTV Pr77 Gag . Data points were fitted with the Hill’s equation, with error bars denoting standard deviation from the mean of 3 independent experiments. (B) Pr77 Gag binding parameters to the WT and LRI mutant MMTV Psi region as derived using Hill’s equation. Cumulative data is derived from 3 independent experiments. Bmax: represents the maximum-specific binding; h: represents the Hill slope; Kd: represents the Pr77 Gag concentration needed to achieve a half-maximum binding followed by their standard deviation. The data underlying this Fig 9A and 9B can be found in S1 Data . LRI, long-range interaction; MMTV, mouse mammary tumor virus; WT, wild type.
Next, we performed filter-binding assays to determine whether the gRNA packaging results obtained with the LRI mutants could be correlated to their ability to bind to Pr77 Gag . The Pr77 Gag protein we used in these assays is able to form virus-like particles (VLPs) in vitro as well as in vivo [ 53 , 79 ]. Furthermore, the Gag VLPs formed by Pr77 Gag -His 6 fusion protein in human embryonic kidney (HEK293T) cells efficiently package RNA containing the MMTV Psi [ 53 , 79 ]. Based on the ratio of UV absorbance at 260 and 280 nm, the purified protein was observed to be devoid of nucleic acids. Finally, dynamic light scattering (DLS) revealed that the average hydrodynamic radius (Rh) determined based on volume (percent) and number (percent) distribution was around 6.00 nm, corresponding to Pr77 Gag trimers ([ 53 ]; S3 Fig ). Thus, this purified protein was used in filter-binding assays along with radiolabeled in vitro transcribed RNA from WT (SA35) and LRI mutants (SP101i, 102i, SP105i-SP109i RNAs; Fig 3B ). As is evident from Fig 9A , almost all mutant RNAs (whether packaging or non-packaging) were able to bind to Pr77 Gag efficiently; however, some of the non-packaging mutants exhibited a lower binding plateau, which could reflect a different stoichiometry. Interestingly, as shown in Fig 9B , all mutants except SP106i bind Pr77 Gag with a Kd and a cooperativity (h) similar to the WT. These results suggest that Pr77 Gag efficiently binds to the MMTV Psi region regardless of its ability to promote RNA packaging.
(A) Description of the substitution mutants in the newly proposed LRI-III with red nucleotides indicating introduced mutations aimed at destabilizing or re-stabilizing complementarity with heterologous sequences. Columns 4, 5, and 6 show results of the effect of mutations on RNA packaging, propagation, and structure, respectively. The RNA packaging data shown here is from a minimum of 3 independent experiments performed in triplicates (±SD). The RNA propagation data shown here is from a minimum of 3 independent experiments performed in duplicates (±SD). (B) A hSHAPE-validated RNA secondary structure model depicts the restored LRI-III’ mutant SP109i, designed to re-stabilize LRI-III’ with heterologous complementary sequences. Structural elements that are also present in the new WT structure (SA35) such as stem-loops (SL1-7), PBS, DIS, single-stranded purines (ssPurines), and mSD are marked as in Fig 7 . The X and Y-axes of LRIs III’ and IV are boxed and labeled in different colors for clarity. Nucleotides are color-coded according to SHAPE reactivity derived from a minimum of 3 independent experiments, with data provided in S1 Table . The data underlying this Fig 8A can be found in S1 Data . DIS, dimerization initiation site; LRI, long-range interaction; mSD, major splice donor; PBS, primer binding site; SL, stem loop; WT, wild type.
To examine the role of the new LRI-III’ in MMTV gRNA packaging, we utilized mutant SP105, which disrupts it, and created mutant SP109, containing the same mutations as SP105 and additional ones designed to restore LRI-III’ ( Fig 8A ). As observed earlier, SP105 mutant exhibited severely compromised RNA packaging and propagation; however, when the new LRI-III’ structure was restored, RNA packaging was restored to almost WT levels ( Fig 8A ). Accordingly, hSHAPE indicated that the secondary structure of mutant SP109i is similar to wild type, as expected (compare Fig 7 with Fig 8B ). These results suggest that the secondary structure of the MMTV Psi RNA is held together by a long stretch of 11 Watson–Crick–Franklin base pairs (interrupted by a 1 nucleotide bulge), the sequence of which is not important for function.
The same software and input sequences (432 nucleotides) were used to predict the secondary structure of the MMTV 5′ region in this work and our previous publication [ 53 ]. Both studies also utilized the same RNA and experimental probing conditions. However, new primers for cDNA synthesis were introduced in this study (refer to the Material and methods section). The 3′ primer was redesigned because the one used in our previous study was unsuitable for analyzing the mutants used here. More importantly, a new 5′ primer was designed to improve the signal-to-noise ratio at the 5′ end of the RNA. The enhanced quality of the SHAPE data at the 5′ end of RNA (SA35) resulted in our new modeling, providing a slightly different secondary structure. Notably, our new modeling identified the previously published secondary structure model as the third most stable structure. For large RNAs, it is common for small differences in experimental data to result in different “most stable” structures, as multiple structures often have very similar minimal free energies [ 78 ].
The updated structure model closely resembles the previously proposed structure but exhibits notable differences in the LRIs and minor differences in SLs 5, 6, and 7. In the new structure model, LRIs I and II are absent, while LRI-IV remains unchanged. Additionally, the X-strand of the initially proposed LRI-III now base pairs with complementary sequences 195 nucleotides downstream within U5, instead of Gag. Structural elements consistent with the earlier proposed model, such as SLs1-7, PBS, DIS, single-stranded purines (ssPurines), and mSD, are present in their native positions and labeled as in Fig 1 . The X and Y-strands of LRIs III’ and IV are shown in boxes and labeled with different colors for clarity. Nucleotides are color-coded according to SHAPE reactivity derived from a minimum of 3 independent experiments, with data provided in S1 Table . DIS, dimerization initiation site; LRI, long-range interaction; MMTV, mouse mammary tumor virus; mSD, major splice donor; PBS, primer binding site; SL, stem loop; WT, wild type.
To confirm these results, we re-probed the same region of WT (SA35) MMTV gRNA (712 nucleotides from R to 400 nucleotides of Gag). The new hSHAPE-validated structure ( Fig 7 ) closely resembles the earlier proposed structure, but differs notably in the LRIs, with minor differences in SLs 5–7. Specifically, LRI-I is not observed, and LRI-II’s U5 sequences (5′ GCC 3′; nts 44–46) are base paired with a different Gag sequence (3′ CGG 5′; nts 337–335) ( Fig 7 ). However, the existence and biological significance of this alternative LRI remains uncertain because when we mutated the U5 sequences (which forms a LRI in both the old and the new structure models), it did not affect RNA packaging. In the new model, LRI-III is present in a modified form that we named LRI-III’: its 5′ sequence (5′ CCGU 3′; nts 50–53) is base paired with sequences within U5 (5′ ACGG 3′; 244–248) instead of Gag. Consistently, no changes are observed in LRI-IV in the new hSHAPE-validated structure, which corroborates the structure–function data of LRI-IV mutants presented above ( Fig 7 ).
Overall, the combined biological and structural data suggest that the previously proposed LRI-I and LRI-II [ 36 , 53 ] may not necessarily exist. LRI-III mutants SP105 and SP106 exhibited pronounced defects in RNA packaging, which suggests that while these sequences are important for function, they may not be involved in complementary base-pairing, as initially thought [ 36 , 53 ]. In contrast, biological and structural data obtained with mutants SP107 and SP108 strongly support the existence of LRI-IV, as initially proposed [ 53 ].
An intriguing observation from the hSHAPE structures is the consistent retention of 2 critical structural patterns, SL2 and the branched SL4, in their original locations across all mutants (Figs 4 – 6 ). These motifs encompass nucleotides recognized as primary Gag-binding sites crucial for MMTV RNA packaging [ 53 ]. Despite this, certain mutants exhibited defects in RNA packaging ( Fig 2 ), suggesting that Pr77 Gag was unable to bind to these sites in those specific mutants.
The first 432 nucleotides of the 712 nt long RNA are shown. (A) Mutant SP107i was designed to destabilize LRI-IV. (B) Mutant SP108i was designed to re-stabilize LRI-IV. Structural elements that are also present in the WT structure (SA35) such as stem-loops (SL1-7), PBS, DIS, single-stranded purines (ssPurines), and mSD are marked as in Fig 1 . The X and Y-strands of different LRIs in both destabilizing and re-stabilizing mutants are boxed and labeled in different colors for clarity. Nucleotides are color-coded according to SHAPE reactivity derived from a minimum of 3 independent experiments, with data provided in S1 Table . DIS, dimerization initiation site; LRI, long-range interaction; mSD, major splice donor; PBS, primer binding site; WT, wild type.
The SP107 mutant, designed to disrupt LRI-IV, lost its ability to package and propagate RNA, while mutant SP108, designed to restore LRI-IV, restored RNA packaging and propagation to WT levels ( Fig 2 ). Accordingly, the hSHAPE-validated structure of SP107i revealed not only the loss of LRI-IV but also the loss of the other 3 LRIs ( Fig 6A ), whereas in the case of SP108i, LRI-IV was restored, but not the other LRIs, while adopting a structure globally similar, though not identical, to the wild type ( Fig 6B ). Altogether, these results indicated that the structure of LRI-IV, but not its sequence is essential for function.
The first 432 nucleotides of the 712 nt long RNA are shown. (A) Mutant SP105i was designed to destabilize LRI-III. (B) Mutant SP106i was designed to re-stabilize LRI-III. Structural elements that are also present in the WT structure (SA35) such as stem-loops (SL1-7), PBS, DIS, single-stranded purines (ssPurines), and mSD are marked as in Fig 1 . The X and Y-strands of different LRIs in both destabilizing and re-stabilizing mutants are boxed and labeled in different colors for clarity. Nucleotides are color-coded according to SHAPE reactivity derived from a minimum of 3 independent experiments, with data provided in S1 Table . DIS, dimerization initiation site; LRI, long-range interaction; mSD, major splice donor; PBS, primer binding site; WT, wild type.
Next, we tested the structure of LRI-III mutant RNAs SP105i and SP106i, as these 2 mutants exhibited severe defects in RNA packaging and propagation ( Fig 2 ). According to their hSHAPE structures ( Fig 5A and 5B ), all 4 LRIs were lost in these mutants, consistent with their loss of function ( Fig 2 ).
The first 432 nucleotides of the 712 nt long RNA are shown. (A) Mutant SP101i was designed to destabilize LRI-I. (B) Mutant SP102i was designed to re-stabilize LRI-I. Structural elements that are also present in the WT structure (SA35) such as stem-loops (SL1-7), PBS, DIS, single-stranded purines (ssPurines), and mSD are marked as in Fig 1 . The X and Y-strands of different LRIs in both destabilizing and re-stabilizing mutants are boxed and labeled in different colors for clarity. Nucleotides are color-coded according to SHAPE reactivity derived from a minimum of 3 independent experiments, with data provided in S1 Table . DIS, dimerization initiation site; LRI, long-range interaction; mSD, major splice donor; PBS, primer binding site; WT, wild type.
As SP101, designed to disrupt LRI-I, exhibited minimal or no packaging defect, while SP102, designed to restore LRI-I, was severely compromised ( Fig 2 ), we performed hSHAPE on these mutants. Stem loops (SLs) 1–4 and LRI-IV, which were present in the secondary structure of the WT MMTV gRNA proposed earlier [ 53 ], are maintained in the resulting RNA secondary structure model of SP101i, while LRIs-I-III are lost ( Fig 4A ). Unexpectedly, LRI-I was not restored in SP102i; indeed, this mutant also lost the other 3 LRIs, namely LRI-II, III, and IV ( Fig 4B ), suggesting that the packaging defect of SP102 ( Fig 2 ) might be due to the disruption of LRI-IV.
Next, we investigated the RNA secondary structure of the 5′ end of the WT and selected LRI mutant MMTV genomes using hSHAPE [ 72 – 75 ]. To that goal, we treated in vitro transcribed RNAs corresponding to the 712 nucleotides at the 5′ end of the WT MMTV genome (SA35; Fig 3B ) and LRI mutants (SP101i, 102i, SP105i-SP109i RNAs; Fig 3B ) with BzCN. The resulting modifications of the flexible riboses were identified as stops during the extension of fluorescently labeled primers by reverse transcriptase and cDNA analysis by capillary electrophoresis. The SHAPE reactivity of each nucleotide obtained from 3 experiments using QuShape [ 76 ] ( S1 Table ) were utilized as pseudoenergy constraints to derive RNA secondary structure models for MMTV Psi of the WT and LRI mutants using the RNAStructure version 6.1 program [ 77 ].
(A) Schematic representation of the MMTV genome indicating the location of MMTV gRNA packaging determinants. (B) Illustration of the packageable vector RNA from R to 712 nucleotides expressed from a T7 expression plasmid, with the table detailing the names of the clones and the nature of the mutations. (C) Representative gel images displaying in vitro dimerization of WT (SA35) and LRI mutant RNAs in TBM buffer. M and D labels below the lanes indicate monomer and dimer buffers used for dimerization experiments. Monomeric and dimeric RNA species are denoted by letters M and D, respectively, on the gel’s horizontal margin. Gels have been cropped as indicated by vertical white spaces to show the relevant areas only. (D) Histograms illustrating the dimerization efficiencies of mutant RNAs compared to WT (SA35) calculated through densitometric analysis of bands from 3 independent experiments. Dimerization efficiency was determined by dividing the intensity of the dimeric RNA band by the intensity of the band from the total RNA (i.e., sum of dimer and monomer bands). No statistically significant differences (p-values > 0.05) were observed in the ability of the mutant clones to dimerize when compared to the WT (SA35) according to the nonparametric Mann–Whitney U test, except for mutant SP106i (p < 0.05). The data underlying this Fig 3D can be found in S1 Data . gRNA, genomic RNA; LRI, long-range interaction; MMTV, mouse mammary tumor virus; WT, wild type.
Packaging of retroviral gRNA is closely linked to its dimerization [ 3 , 4 , 12 , 14 , 17 , 24 , 36 , 38 , 70 , 71 ]. Therefore, to test if the mutations were introduced in the LRIs affect RNA dimerization, 712 nucleotides from the 5′ end of the MMTV genome, encompassing wild type as well as mutant packaging sequences were cloned into a T7 promoter-containing plasmid ( Fig 3A and 3B ) and dimerization assays were performed using in vitro transcribed RNAs. Interestingly, both LRI packaging and non-packaging RNA mutants were found to dimerize at WT levels and showed no statistically significant differences, except for mutant SP106i ( Fig 3C and 3D ). Note that the RNA monomer species migrates slightly faster in the dimer condition than in the monomer condition, as the high ionic strength of the dimer buffer compacts the RNA. These results reveal that while the LRI destabilizing and restabilizing mutants affected RNA packaging, they did not adversely affect RNA dimerization, further validating that the effects being observed on RNA packaging were bona fide and not because of any effects on RNA dimerization.
To investigate the role of LRI-IV, the longest LRI in MMTV gRNA Psi ( Fig 1B ), on viral replication, we created mutants SP107 and SP108, designed to disrupt and restore 3 of the 7 base-pairs of LRI-IV, respectively ( Fig 2A ). SP107 revealed nearly complete abrogation of RNA packaging (RPE = 0.06 ± 0.03; p-value < 0.05; Fig 2C ) and propagation (CFU = 0.07 ± 0.06; p-value < 0.05; Fig 2D ). On the other hand, SP108 showed restoration of both RNA packaging (RPE = 1.41 ± 0.47; p-value > 0.05; Fig 2C ) and RNA propagation (CFU = 0.78 ± 0.05; p-value < 0.05; Fig 2D ) to WT levels. These results support the existence and biological significance of LRI-IV in MMTV replication. Furthermore, they highlight the importance of complementarity among nucleotides forming LRI-IV, rather than the sequence itself, for RNA packaging.
Mutants SP105 and SP106 were designed to disrupt and restore LRI-III, respectively ( Fig 2A ). RNA packaging and propagation of these mutants were almost completely abolished, regardless of whether the base pairing sequence complementarity of LRI-III was disrupted (RPE = 0.12 ± 0.13; p-value < 0.05; CFU = 0.12 ± 0.14; p-value < 0.05; Fig 2C and 2D ) or restored (RPE = 0.06 ± 0.03; p-value < 0.05; CFU = 0.03 ± 0.04; p-value < 0.05; Fig 2C and 2D ). These results suggest that either the identity of the sequences forming LRI-III is crucial, or the proposed LRI-III may not actually exist.
Analysis of mutants SP103 and SP104, which were designed to disrupt and restore LRI-II, respectively ( Fig 2A ), did not allow to conclude about the existence of this LRI. Indeed, compared to the wild type, mutant SP103 did not demonstrate any defects in both RNA packaging (RPE = 0.98 ± 0.46; p-value > 0.05; Fig 2C ) as well as in RNA propagation (CFU = 1.21 ± 0.23; p-value < 0.05; Fig 2D ). Similarly, mutant SP104 revealed nearly identical results as the wild type for both RNA packaging and propagation (RPE = 1.03 ± 0.24; p-value > 0.05; CFU = 1.17 ± 0.41; p-value > 0.05; Fig 2C and 2D ). These results indicate that the sequences mutated in SP103 and SP104 play no significant role in RNA packaging, irrespective of the existence, or not, of LRI-II.
Compared to the wild-type, SP101 RNA ( Fig 2A ) exhibited only moderate reductions in gRNA packaging (RPE = 0.82 ± 0.07; p-value < 0.05; Fig 2C ) and propagation (CFU = 0.74± 0.14; p-value < 0.05; Fig 2D ). Conversely, mutant SP102 showed 87% reduction in both RNA packaging (RPE = 0.13± 0.11; p-value < 0.05; Fig 2C ) and propagation (CFU = 0.13 ± 0.08; p-value < 0.05; Fig 2D ). These results indicate that mutations in the X and Y strands of LRI-I ( Fig 2A ) have cooperative (or additive) rather than compensatory effects suggesting that LRI-I does not exist.
Using our previously established three-plasmid genetic complementation assay [ 67 ], we evaluated the relative packaging efficiency (RPE) of the WT and LRI-I mutant RNAs by quantifying the amount of RNA packaged into the virions. To ensure the stable expression and successful transport of each vector RNA to the cytoplasm, nuclear and cytoplasmic RNA fractions were isolated from transfected cells and quantified. The quality of cell fractionation was assessed by the presence of unspliced β-actin mRNA in the nuclear and not in the cytoplasmic fractions [ 69 ]. Multiplex RT-PCR did not detect amplification of unspliced β-actin mRNA in the cytoplasmic fraction ( Fig 2B ; panel I), in contrast to the presence of 18S rRNA ( Fig 2B ; panel I) confirming the absence of nuclear leakage. Complementary DNAs (cDNAs) prepared from cytoplasmic RNA fractions and pelleted viral particles were amplified using specific primers ( Fig 2B ; panels II and III, respectively). Amplification of the desired region across all samples validated the efficient and stable expression as well as appropriate transport of transfer vector RNAs from the nucleus to the cytoplasm ( Fig 2B ; panel II). Finally, WT or mutant transfer vector RNAs in the cytoplasm and in the pelleted virus particles were quantified using RT-qPCR [ 36 , 41 , 53 , 61 ]. Part of the transfected supernatant was also used to infect HeLa CD4+ cells to evaluate the ability of the produced virions to transduce the packaged RNA into the target cells. This was achieved by monitoring the emergence of hygromycin-resistant colonies following selection of the infected cultures with hygromycin B-containing medium.
( A ) List of substitutions in the U5/Gag and U5/U5 LRIs, with mutations highlighted in red. (B) Representative gel images of the controls necessary for validating different aspects of the three-plasmid in vivo packaging and propagation assay: (I) multiplex amplification for nucleocytoplasmic fractionation technique, (II) PCR amplification for cDNAs prepared from cytoplasmic RNA fraction validating stability and nuclear export of transfer vector RNA, and (III) PCR amplification of packaged transfer vector RNA. (C) Packaging efficiency of mutant transfer vector RNAs relative to the wild type (DA024). (D) Relative propagation of MMTV transfer vector RNAs measured as normalized hygromycin resistant CFUs/ml for mutant transfer vectors compared to the wild type (DA024) vector. Mock samples contained only the transfer vector and no packaging construct. Data are presented as mean ± standard deviation from a minimum of 3 independent experiments performed in triplicates for RNA packaging (panel C) and in duplicates for RNA propagation (panel D). Differences compared to the wild-type were considered significant when p < 0.05 according to the nonparametric Mann–Whitney U test. The data underlying this Fig 2C and 2D can be found in S1 Data . CFU, colony-forming unit; gRNA, genomic RNA; LRI, long-range interaction; MMTV, mouse mammary tumor virus.
Considering the role of LRI sequences in anchoring and stabilizing RNA structures [ 10 , 35 , 39 , 52 , 54 ], we hypothesized that disrupting the base-pairing between the U5 and Gag sequences within LRI-I-IV could potentially negatively affect RNA packaging. To the contrary, stabilizing these interactions should restore structure and hence RNA packaging. To test this hypothesis, we first designed mutant SP101, in which the U•G wobble base pairs of LRI-I were disrupted ( Fig 2A ). Next, in mutant SP102, we restored 2 G•U wobble base pairing in LRI-I ( Fig 2A ).
A new tailor-made MMTV TaqMan real-time qPCR assay was developed based on principles described earlier [ 61 ]. This new MMTV TaqMan assay was used along with a commercially available endogenous β-actin TaqMan assay to quantify both mutant and WT MMTV gRNAs expressed in the cytoplasm and packaged into virions. The amplification efficiency of MMTV and β-actin TaqMan assays was tested on serially diluted DA024 and a β-actin-expressing plasmid DNAs ( S2A Fig ; panels I and II). If the 2 assays have similar amplification efficiencies, the slope of the log input amount versus ΔCT should ideally be ≤0.1. Under our experimental conditions, the slope was 0.1076 ( S2C Fig ).
Conventional studies on retroviral RNA packaging using WT viruses are constrained by the involvement of the 5′ end of gag in Psi. This makes analyzing packaging using full-length WT viruses challenging, as mutations in Psi can affect the Gag ORF. While introducing point mutations is feasible, it often causes unexpected cis effects like mRNA nuclear export alterations [ 62 – 66 ]. To address these limitations and study the effects of mutations in MMTV Psi on gRNA packaging and propagation, we used a three-plasmid genetic complementation assay ( S1 Fig ) [ 36 , 41 , 53 , 61 , 67 ]. This involved co-transfecting a transfer vector (DA024) containing necessary cis-acting sequences and a marker gene [ 67 ], a packaging construct (JA10) expressing gag-pro-pol genes [ 67 ], and a VSV-G expression plasmid (MD.G) [ 68 ]. This split genome strategy generates pseudotyped virus particles from JA10 and MD.G, while the transfer vector produces packageable RNA. The replication of packaged RNA is restricted to a single round, allowing monitoring of vector RNA propagation via the hygromycin resistance gene. RNA packaging efficiency is assessed through real-time quantitative PCR (RT-qPCR), correlating hygromycin-resistant colonies with viral RNA content. This approach allows manipulation of RNA secondary structure sequences involved in MMTV gRNA packaging without affecting gag-pro-pol sequences.
Discussion
RNA viruses rely on conserved structural information found in noncoding and occasionally coding sequences to execute crucial events during their life cycle. The 5′ end of the genome in numerous RNA viruses, including crucial pathogens for humans, animals, and plants, harbors a wealth of cis-acting information conveyed through diverse higher-order structures, often maintained by LRIs [82,83].
Notably, LRIs have been identified as conserved structural motifs in various retroviruses [9,10,36,43,45,47–54,84]. Despite significant sequence and structural heterogeneities, the persistence of LRIs in isolates of HIV-1, HIV-2, MPMV, MMTV, and FIV provides evidence for their functional importance in the retroviral life cycle [9,10,36,43,47,48,52–54]. Indeed, mutations destabilizing the complementarity of these LRIs adversely affect crucial steps in the retroviral life cycle, including RNA packaging and dimerization [10,47,48,52,54].
The propensity of retroviral Psi to fold into complex secondary structures underscores their crucial role during retroviral gRNA packaging. In most retroviruses, including lentiviruses, the efficiency and selectivity of gRNA packaging in part is governed by the multimerization of the Gag precursor at the plasma membrane [32,85–88], complicating the analysis of this process. MMTV constitutes an attractive system to study membrane-independent retroviral RNA packaging, as it assembles in the cytoplasm of infected cells [59,60]. Within the 5′ end of the MMTV genome, various sequence and structural motifs have been identified as pivotal for gRNA packaging and dimerization ([36,41,53,61]; Fig 1). A distinctive feature of the previously published RNA secondary structure of the MMTV gRNA Psi is the presence of several LRIs predicted to anchor the overall secondary RNA structure ([36,41,53]; Fig 1).
The initial goal of this study was to test the role of the 4 proposed LRIs (LRI-I-IV) in MMTV gRNA packaging by combining structural and functional approaches. Among the LRIs that involve only complementary U5 sequences, we were unable to confirm the presence of LRI-II. However, our findings indicate that the proposed LRI-I and LRI-III do not manifest as originally proposed (see Fig 2). On the other hand, our functional data support the existence of LRI-IV, formed by complementary base-pairing of U5/U5 sequences (Fig 2). Our structural data (Figs 4–7) supported these conclusions and demonstrated the existence of an alternative LRI involving U5/U5 complementary sequences (approximately 198 nucleotides apart) instead of U5/Gag as initially proposed ([36,53], compare Fig 1 with Fig 7) that we named as LRI-III’. Interestingly, LRI-III’ and LRI-IV are both important for gRNA packaging (Figs 2 and 8), and our results suggest these LRIs function mechanistically in a similar manner, as their sequences can be substituted with heterologous sequences without having any adverse effects on RNA packaging as long as base pairing is maintained (Figs 2, 6B and 8). Interestingly, the 7 nucleotide LRI-IV in MMTV exhibit functional similarities to the R/U5-Gag heptanucleotide LRI observed in FIV, a lentivirus, since base-pairing, but not sequence, of the FIV LRI is crucial for RNA packaging [10,54].
Of note, the MMTV LRI-III’ and LRI-IV are contiguous in the new secondary structure model of the MMTV Psi, being only separated by a one nucleotide bulge (Fig 7). Accordingly, when 2 nucleotides (out of 4) in LRI-III’ or 3 nucleotides (out of 7) in LRI-IV were substituted independently to destabilize the respective LRIs, both LRIs were lost, as well as function (Figs 2, 5A and 6A). Furthermore, in a compensatory approach, when we designed mutants aimed at restoring LRI-III’ and LRI-IV individually (mutants SP109 and SP108, respectively), both function and structure were restored (Figs 2, 6B and 8). The data presented here thus suggest that any perturbation designed to destroy complementarity in this 11 nucleotide long extended LRI severely compromise MMTV gRNA packaging. In light of these observations, we reviewed the structure function analysis of all LRI mutants and observed that the LRI-I destabilizing mutant (SP 101), which did not affect RNA packaging maintained both LRI-III’ and IV as a long continuous stretch as discussed above (Figs 2 and 4A). Taken together, these observations strongly argue that LRIs III’ and IV must be regarded as 1 extended LRI rather than 2 separate LRIs. In this respect, MMTV resembles HIV and FIV, which possess a single identified long-range RNA-RNA interaction [9,10,43,47,48,52,54]. However, unlike other retroviruses such as HIV-1, HIV-2, MPMV, and FIV, where U5/Gag sequences are involved in forming LRIs, [9,10,43,47,48,52,54], the extended MMTV LRI III’-IV only involves U5/U5 sequences (198 nucleotides apart). MPMV, another betaretrovirus harbors 2 LRIs that are important for RNA packaging, and in contrast with MMTV, both the structure as well as the sequence of one of these LRIs are important for MPMV gRNA packaging [35,52].
Results from filter-binding experiments demonstrated that both WT and LRI mutants, irrespective of their RNA packaging phenotype, could efficiently bind to Pr77Gag (Fig 9). This observation is in strong contrast with a number of previous studies on several retroviruses, including HIV [44,46], FIV [54], and MMTV [53], in which a clear correlation was observed between binding of the Gag precursor to Psi and gRNA packaging efficiency. Our present results thus indicate that while Gag binding to the packaging signal is necessary, it is not sufficient to ensure efficient RNA packaging.
Several previous studies have proposed that high affinity binding of HIV-1 Gag to Psi-containing RNAs cannot explain selective packaging of HIV-1 genomic RNA [89–92]. However, a Gag precursor lacking the p6 domain (GagΔp6) was used in all these studies. When a full-length Gag precursor was used, specific binding of Gag to Psi-containing RNA was observed [28,44,46] that is consistent with enrichment of HIV-1 genomic RNA in viral particles [93]. In HIV-1 GagΔp6, nonspecific electrostatic interactions overweigh specific interactions [89,92]. It is likely that the p6 domain, which is negatively charged, contributes to the specific binding of full-length Gag to Psi by neutralizing the positive charges of the NC domain [28]. Besides, Mutational Interference Mapping Experiment (MIME), an unbiased exhaustive approach, revealed a good correlation between mutations that decreased the binding affinity of full-length HIV-1 Gag for Psi-containing RNA [45] with those that decreased packaging of the genomic RNA [26] indicating that, in HIV-1, specific binding of Gag to Psi does contribute to packaging. However, some mutations affected RNA packaging without affecting Gag binding [26]. While some of these mutations were found to affect RNA metabolism, thus reducing packaging indirectly, the effect of mutations in the PBS domain remained unexplained. Therefore, it is possible that specific Gag binding to Psi is not the only mechanism ensuring selective packaging of the genomic HIV-1 RNA, and kinetics could also affect selectivity of the packaging process [91]. Hence, in the case of MMTV, selection of the gRNA may indeed primarily rely on the kinetic advantage provided by Psi.
Indeed, our footprinting experiments (Figs 10–13 and S4–S10) showed that efficient gRNA packaging correlates with Pr77Gag binding in the PBS and ssPurines regions of Psi. These results are in line with our previous study that showed that mutations in the PBS and ssPurine regions of MMTV Psi dramatically reduced Pr77Gag binding and gRNA packaging [53]. It is also noteworthy that in the case of HIV-1 and many other retroviruses, such as HIV-2, SIV, and MPMV, multiple unpaired purines (single-stranded purines) play a key role in the selective encapsidation of viral RNA [80,94–98]. This points towards a highly conserved role of unpaired purines in selective packaging of retroviral RNAs irrespective of their assembly mode in different parts of the cell. With our non-packaging mutants, we observed scattered footprints even though the local structure of the PBS and ssPurines regions was not altered (Figs 4–6). This suggests that alteration of the overall 3D structure of the Psi of the non-packaging mutants, caused by disruption of the extended LRI III’-IV, prevents Pr77Gag binding at these specific sites and promotes binding to nonspecific nucleotides. This indicates that for MMTV gRNA to be selectively packaged, Pr77Gag must bind to specific nucleotides in the correct structural context rather than binding to any nucleotide(s) when the proper structural context is lost, as has been proposed for HIV-1 [99].
Why does efficient Pr77Gag binding to nonspecific nucleotides not promote RNA packaging? RNA is known to be a crucial structural element of retroviral particles [100] and it induces Gag multimerization and Gag assembly [91,101]. Previous experiments suggested that HIV-1 Psi more efficiently promotes in vitro Gag/RNA assemblies than heterologous RNAs [91]. Our present study supports a model in which Gag must bind to Psi in the correct 3D context to promote efficient assembly of viral particles. Indeed, computer simulations showed that the RNA folding geometry of the packaging signal affects the assembly activation energy barrier, allowing kinetic selectivity of the genomic RNA [102].
[END]
---
[1] Url:
https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3002827
Published and (C) by PLOS One
Content appears here under this condition or license: Creative Commons - Attribution BY 4.0.
via Magical.Fish Gopher News Feeds:
gopher://magical.fish/1/feeds/news/plosone/