(C) PLOS One [1]. This unaltered content originally appeared in journals.plosone.org.
Licensed under Creative Commons Attribution (CC BY) license.
url:https://journals.plos.org/plosone/s/licenses-and-copyright

------------



Proteome-wide Mendelian randomization identifies causal links between blood proteins and severe COVID-19

['Alish B. Palmos', 'Social', 'Genetic', 'Developmental Psychiatry Centre', 'Institute Of Psychiatry', 'Psychology', 'Neuroscience', 'King S College London', 'London', 'United Kingdom']

Date: 2022-05

In November 2021, the COVID-19 pandemic death toll surpassed five million individuals. We applied Mendelian randomization including >3,000 blood proteins as exposures to identify potential biomarkers that may indicate risk for hospitalization or need for respiratory support or death due to COVID-19, respectively. After multiple testing correction, using genetic instruments and under the assumptions of Mendelian Randomization, our results were consistent with higher blood levels of five proteins GCNT4, CD207, RAB14, C1GALT1C1, and ABO being causally associated with an increased risk of hospitalization or respiratory support/death due to COVID-19 (ORs = 1.12–1.35). Higher levels of FAAH2 were solely associated with an increased risk of hospitalization (OR = 1.19). On the contrary, higher levels of SELL, SELE, and PECAM-1 decrease risk of hospitalization or need for respiratory support/death (ORs = 0.80–0.91). Higher levels of LCTL, SFTPD, KEL, and ATP2A3 were solely associated with a decreased risk of hospitalization (ORs = 0.86–0.93), whilst higher levels of ICAM-1 were solely associated with a decreased risk of respiratory support/death of COVID-19 (OR = 0.84). Our findings implicate blood group markers and binding proteins in both hospitalization and need for respiratory support/death. They, additionally, suggest that higher levels of endocannabinoid enzymes may increase the risk of hospitalization. Our research replicates findings of blood markers previously associated with COVID-19 and prioritises additional blood markers for risk prediction of severe forms of COVID-19. Furthermore, we pinpoint druggable targets potentially implicated in disease pathology.

As of November 2021, more than five million people have died due to COVID-19. Although vaccinations provide good protection, it is important to fully understand the biology behind the severe forms of COVID-19. Mendelian randomization facilitates the identification of blood proteins that may be involved in the pathophysiology of severe forms. Here, we investigated whether >3,000 blood proteins might play a role in hospitalization due to COVID-19 or the requirement of respiratory support or death due to COVID-19. Using genetic instruments and under the assumption of Mendelian randomization, our results are consistent with higher levels of five proteins being causally associated with an increased risk of both COVID-19 outcomes and higher levels of one protein associated with hospitalization. Our results are also consistent with higher levels of four proteins–mainly playing a role in cell adhesion–being causally associated with a decreased risk of hospitalization and respiratory support/death, and higher levels of four proteins being causally associated with a decreased risk of hospitalization. These proteins may represent new biomarkers useful in risk prediction of severity and may lead to new therapeutics by prioritizing druggable targets.

Funding: BM and GB are supported to conduct COVID-19 neuroscience research (The Covid-19 Clinical Neuroscience Study (COVID-CNS)) by the Medical Research Council (UKRI/MRC; MR/V03605X/1); for additional neurological inflammation research due to viral infection BM is also supported by grants from the MRC/UKRI (MR/V007181//1), MRC (MR/T028750/1) and Wellcome (ISSF201902/3). CH acknowledges funding from Lundbeckfonden (R276-2018-4581). MJG is supported for neuroscience research internationally by MRC Newton Fund (MR/S019960/1), MRC Developmental Pathway Funding Scheme (MR/R015406/1), and National Institute for Health Research (NIHR; 153195 17/60/67, 126156 17/63/11, and 200907). DKM is also funded by the NIHR (through the Cambridge NIHR Biomedical Research Centre) and by the Addenbrooke’s Charities Trust. This paper represents independent research partially funded by the National Institute for Health Research (NIHR) Biomedical Research Centre (BRC) at the South London and Maudsley NHS Foundation Trust and King’s College London. The authors acknowledge use of the research computing facility at King’s College London, Rosalind ( https://rosalind.kcl.ac.uk ), which is delivered in partnership with the National Institute for Health Research (NIHR) Biomedical Research Centres at South London & Maudsley and Guy’s & St. Thomas’ NHS Foundation Trusts, and part-funded by capital equipment grants from the Maudsley Charity (award 980) and Guy’s & St. Thomas’ Charity (TR130505). The funders had no role in study design, data collection, and analysis, decision to publish, or preparation of the manuscript.

These GWASs represent a powerful source of information to identify new biomarkers and therapeutic leads for drug development or repositioning. The method of Mendelian randomization can investigate the relationship between immunomodulatory blood proteins and a severe COVID-19 infection. Mendelian randomization exploits the fact that alleles are randomly inherited from parent to offspring in a manner analogous to a randomized-controlled trial, and allows estimation of putative causal effects of an exposure on a disease while avoiding confounding environmental effects, thus overcoming some of the limitations of observational studies. Recent advancements in Mendelian randomization methods allow use of GWAS summary statistics to identify genetic proxies (i.e., instrumental variables) of modifiable risk factors and test their association with disease outcomes [ 14 , 15 ]. We, therefore, conducted Mendelian randomization analyses between high levels of a large number of blood proteins and COVID-19, highlighting specific proteins associated with an increased risk of hospitalization due to COVID-19 and once hospitalized, an increased risk for need of respiratory support/death due to COVID-19. We identified putative causal associations that help us understand how innate differences in protein levels can affect the COVID-19 disease course and which proteins could be prioritized in clinical studies.

Numerous genome-wide association studies (GWASs) in healthy populations associated genetic variants with immunomodulatory blood proteins [ 11 – 13 ]. In addition, the COVID-19 Host Genetics Initiative carried out GWASs on COVID-19 outcomes to understand the role of host genetic factors in susceptibility and severity of COVID-19 [ 8 ]: the first GWAS associated genetic variants with hospitalization due to COVID-19; and the second with need for respiratory support and death subsequently to a COVID-19 hospitalization. The findings suggest that once hospitalized another set of genetic variants may be responsible for a severe respiratory form of COVID-19, which may lead to the need for respiratory support or death.

A dysregulated pro- and anti-inflammatory immunomodulatory response is thought to drive much of the pathophysiology of COVID-19 and comprises alveolar damage, lung inflammation and pathology of an acute respiratory distress syndrome [ 1 , 2 , 6 , 7 ]. Given that the innate immune response has an individual-level genetic basis, genetic variants carried by an individual could play an important role in the individual-level immune response and, therefore, may influence progression and severity of COVID-19. This individual difference may also be key in our understanding of why some individuals require hospitalization due to the severity of their symptoms, whilst others are able to recover from COVID-19 without hospitalization [ 8 ]. In addition, once hospitalized, this individual difference may drive some people towards fatal outcomes or intensive care with respiratory support, whilst others are discharged from hospitals without respiratory complications.

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was identified in late 2019 in Wuhan, China, and is commonly referred to as coronavirus disease 2019 (COVID-19) that rapidly evolved into a global pandemic [ 1 , 2 ]. As of November 2021, more than 240 million cases have been confirmed worldwide with total deaths exceeding 5 million [ 3 ]. COVID-19 pathology encompasses a wide spectrum of clinical manifestations from asymptomatic, mild, moderate, to 15% being severe infections [ 1 , 2 , 4 ]. Severe COVID-19 commonly requires hospitalization and intensive care with assisted respiratory support, and respiratory failure is the most common reason for COVID-19 associated mortality [ 5 ].

Finally, given that many inflammatory and immunomodulatory proteins share genetic loci and may therefore be driving associations via genetically correlated SNPs in high linkage disequilibrium, we computed pairwise linkage disequilibrium for all SNPs used as instrumental variables of blood proteins that were significantly associated in our analyses. To calculate linkage disequilibrium, we used LDlink [ 42 ] and the CEU population panel (Utah residents from North and West Europe) as the reference.

Furthermore, when possible, we performed GSMR using only variants in the cis region of the gene encoding the blood marker (defined as variants either within a gene, up to 1 Mb proximal to the start of the gene, or up to 1 Mb distal to the end of the gene). Gene information was obtained from ensembl [ 36 ] using the biomaRt library [ 37 ], SNP information was obtained from NCBI dbSNP [ 38 ] using the rsnps library [ 39 ] (Gustavsen et al., “Get ‘SNP’ (‘Single-Nucleotide’ ‘Polymorphism’) Data on the Web [R Package Rsnps Version 0.4.0]” 2020).

For all GWASs, SNPs used as instrumental variables were selected by applying a suggestive genome-wide p-value threshold (p < 5 x 10 −6 ), to identify enough SNPs (i.e., at least 5) in common between the exposure (e.g., blood marker) and outcome (e.g. COVID-19 hospitalization). Note that although reducing the p-value threshold may introduce potential false positive SNPs as instruments, SNPs with the strongest effect sizes are robust and reliable for conducting MR. The use of a lower p-value threshold in numerous MR studies is common [ 27 – 31 ], and we additionally calculated F-statistics and I-squared statistics to transparently present the strength of our instruments ( S2 Table ). We needed to take this analytical step as GWAS of blood proteins with more statistical power are not available at this time.

To examine the influence of blood proteins on the risk of developing severe COVID-19, we selected genetic variants, single nucleotide polymorphisms (SNPs), that were strongly associated with actual blood protein levels in 5,504 genome-wide analyses of single proteins using robust methodologies (see S1 Data , for more details on how the proteins were measured, and instruments for all significant proteins). Using these genetic loci as proxies for protein levels, we performed an analysis using Mendelian randomization, a method that enables tests of putative causal associations of these blood proteins with the development of severe COVID-19. We used the Generalized Summary data-based Mendelian randomization (GSMR) method as the base method [ 26 ]. GSMR tests for putative causal associations between a risk factor and a disease using multi-SNP effects from GWAS summary data. The HEIDI-outlier approach in GSMR removes SNP instruments with strong putative pleiotropic effects. In addition, GSMR accounts for linkage disequilibrium (LD) among SNPs not removed by clumping using a reference dataset for LD estimation. In this study, the European 1,000 Genomes dataset was used as the reference dataset [ 18 ].

In order to capture increased risk of very severe respiratory COVID-19, including respiratory support and death, we downloaded the COVID-19 Host Genome Initiative GWAS meta-analysis [ 8 ] of “very severe respiratory confirmed covid vs. population”, European ancestry (A2_ALL_eur, release 5, January 2021; www.covid19hg.org/results/ r5). Cases were defined as SARS-CoV-2 infected individuals who were admitted to hospital, had COVID-19 as the primary reason for admission, and had died or needed respiratory support (i.e., intubation, continuous positive airway pressure, or bilevel positive airway pressure). Controls were defined as non-cases, i.e. the population [ 8 ]. The European sample consisted of 5,101 cases and 1,383,241 controls. In this study, we refer to this GWAS as the respiratory support/death-COVID-19 GWAS.

In total, we included ten publications for which summary data was readily available and processed those using standard GWAS summary statistics quality control metrics including removal of incomplete genetic variants, variants with information metrics of lower than 0.6 and allele frequencies more extreme than 0.005 or 0.995. Allele frequencies were estimated from raw genotypes of the European 1,000 Genomes Project dataset, where needed [ 25 ]. See S1 Table for a full list of studies included in these analyses. Links to the summary statistics are also provided. Note that all protein measurements from all studies described above were included in our analyses, meaning that some proteins were analyzed more than once.

In total, we amassed 5,305 sets of GWAS summary statistics for blood biomarkers [ 11 – 13 , 16 – 23 ]. A systematic search was performed based on the ontology lookup service (OLS; www.ebi.ac.uk/ols/index ) using R and the packages ‘rols’ and ‘gwasrapidd’ between July 7th and July 27th, 2020. OLS is a repository for biomedical ontologies, such as gene ontology (GO) or the experimental factor ontology (EFO), including a systematic description of many experimental variables. First, all subnodes of the EFO ‘protein measurements’ (EFO:0004747) were determined using an iterative process based on the package ‘rols’. Overall, 628 unique EFO IDs were determined. Subsequently, all genetic associations reported in the GWAS Catalog [ 24 ] ( www.ebi.ac.uk/gwas/ ) were identified and linked to the corresponding study using the package ‘gwasrapidd’. One hundred and seventy-eight unique GWAS catalog accession IDs with available summary statistics were curated manually before inclusion. In order to expand the dataset, studies published at a later date were included manually and the first and the last author of studies without publicly available summary statistics were contacted.

High BMI has been robustly associated with both inflammatory cytokine levels in blood and an increased risk of severe COVID-19 [ 9 , 40 , 41 ]. After validating this relationship by performing Mendelian randomization analysis between BMI and the two COVID-19 outcomes ( S9 Table ), we performed bidirectional Mendelian randomization analyses between all significant blood markers and BMI, using the largest publicly available BMI GWAS [ 50 ]. Our results indicate that genetic susceptibility for higher BMI is associated with higher levels of SELE_Scal, C1GALT1C1_Sun, SELE_Folk, KEL_Sun, SELL_Sun, RAB14_Sun and SFTPD_Breth. In addition, the genetic susceptibility for higher levels of LCTL_Sun, SELE_Sliz, SFTPD_Breth, PECAM-1_Scal and RAB14_Sun is associated with higher BMI (see S10 Table for full results). Note that some protein GWASs had controlled for BMI, whereas others, the majority of which show a significant association with BMI, did not (see S1 Table ).

In order to establish whether the significant associations identified in our study were being driven by cis-regulatory variants, we identified cis-SNPs from all significant blood markers and performed Mendelian randomization analyses using only these SNPs with the respective COVID-19 GWASs. Out of the 28 exposure-outcome pairs, only seven (25%) where based on at least one cis-SNP and could therefore be analysed. The results show that ABO_Sun and SFTPD_Breth cis-SNPs are significantly associated with hospitalization, and ABO_Sun and sICAM-1_Sliz cis-SNPs are significantly associated with need for respiratory support/death. All other significant associations are deemed to be driven by trans-SNPs ( S8 Table ).

In our analysis of the need for respiratory support/death due to COVID-19, we used 305 unique SNPs as instrument variables for blood proteins; however, in 31 cases, one SNP was used twice as an instrument for two or more different proteins, 28 of which were located on chromosome 9. Furthermore, assessing pairwise LD between all SNPs, we found 50 SNP pairs (based on 65 unique SNPs) in high LD (r 2 > 0.6). Most pairs (n = 44) were located on chromosome 9 carrying the ABO gene ( S7 Table and S6 Fig ) [ 47 – 49 ].

In our analysis of hospitalization as a result of COVID-19, we used 296 unique SNPs as instrument variables for blood proteins, in 30 cases, one SNP was used as an instrument for two or more different proteins, 27 of which were located on chromosome 9. In the assessment of pairwise LD, 40 SNP pairs (based on 55 unique SNPs) showed high LD (r 2 > 0.6). Most pairs (n = 37) were located on chromosome 9 carrying the ABO gene ( S6 Table and S5 Fig ).

Given that we are using a reduced p-value threshold to identify SNPs as our genetic instruments, we calculated F-statistic and I-squared on all our significantly associated proteins. We found that with the exception of ATP2A3, all F-statistics were larger than 20 and all I-squared statistics were larger than 0.9, suggesting strong genetic instruments for use in analyses [ 44 – 46 ]. See S2 Table for full results of these analyses.

After multiple testing correction (p FDR = 0.05), using genetic instruments and under the assumptions of Mendelian randomization, our results were consistent with eight blood markers being causally associated with a statistically significantly decreased risk of need for respiratory support/death due to COVID-19 ( Figs 2 and S4 ); the reverse associations with these blood markers were nonsignificant ( S3 Table ). Per standard deviation (SD) increase in the respective blood marker the increases in odds for respiratory support/death ranged from 11 to 27%, with the platelet endothelial cell adhesion molecule (PECAM-1) showing the strongest effect size: OR = 0.73 (95% CI: 0.63, 0.83, q ≤ 0.01; Table 1 ).

Summary figure of the false discovery rate-corrected (p FDR = 0.05) Mendelian randomization results using the Generalised Summary data-based Mendelian randomization (GSMR) method. Using genetic instruments and under the assumptions of Mendelian randomization, this figure displays: (A) Summary figure when respiratory support/death-COVID-19 is the outcome of interest; (B) Summary figure when respiratory support/death-COVID-19 is the exposure of interest. Odds ratios (ORs) of blood markers causally associated with the need for respiratory support/death due to COVID-19 are displayed on the x-axis (with 95% confidence intervals). The blood markers are displayed on the y-axis. The dashed line at one represents an odds ratio of one (i.e., no effect). Using genetic instruments and under the assumptions of Mendelian randomization, five blood markers were causally associated with a significantly increased risk for need for respiratory support/death due to COVID-19 and eight blood markers were causally associated with a significantly decreased risk for respiratory support/death (q FDR ≤ 0.05). ABO = ABO system transferase; C1GALT1C1 = C1GALT1 specific chaperone 1; CD207 = langerin; GCNT4 = glucosaminyl (N-Acetyl) transferase 4; LCTL = lactase-like protein; PECAM1 = platelet endothelial cell adhesion molecule; RAB14 = ras-related protein rab-14; SELE = E-selectin; SELL = L-selectin; sICAM1 = soluble intercellular adhesion molecule-1.

After multiple testing correction (p FDR = 0.05), using genetic instruments and under the assumptions of Mendelian randomization, our results were consistent with five blood markers being causally associated with need for respiratory support/death due to COVID-19 ( Figs 2 and S3 ); the reverse associations with these blood markers as outcomes were nonsignificant ( S3 Table ). Per standard deviation (SD) increase in these respective blood marker the increase in odds for respiratory support/death ranged from 12 to 35%, with glucosaminyl (N-Acetyl) transferase 4 (GCNT4) showing the strongest effect: OR = 1.35 (95% CI: 1.26, 1.44, q ≤ 0.01; Table 1 ).

After multiple testing correction (p FDR = 0.05), using genetic instruments and under the assumptions of Mendelian randomization, our results were consistent with nine blood markers being significantly causally associated with a decreased risk of hospitalization as a result of COVID-19 ( Figs 1 and S2 ); the reverse associations with these nine blood markers were nonsignificant ( S3 Table ). Per SD increase in the respective blood marker the decreases in odds for hospitalization ranged from 7 to 20%, with the platelet endothelial cell adhesion molecule (PECAM-1) showing the strongest effect: OR = 0.80 (95% CI: 0.73, 0.87, q ≤ 0.01; Table 1 ).

Summary figure of the false discovery rate-corrected (p FDR = 0.05) Mendelian randomization results using the Generalized Summary data-based Mendelian randomization (GSMR) method. Using genetic instruments and under the assumptions of Mendelian randomization, this figure displays: (A) Summary figure when hospitalization-COVID-19 is the outcome of interest; (B) Summary figure when hospitalization-COVID-19 is the exposure of interest. Odds ratios (ORs) of the blood markers causally associated with hospitalized-Covid-19 are displayed on the x-axis (with 95% confidence intervals). The blood markers are displayed on the y-axis. The dashed line at one represents an odds ratio of one (i.e., no effect). Using genetic instruments and under the assumptions of Mendelian randomization, six blood markers were causally associated with a significantly increased risk for hospitalization COVID-19 and nine blood markers were causally associated with a significantly decreased risk for hospitalization (q FDR ≤ 0.05). ABO = ABO system transferase; ATP2A3 = ATPase sarcoplasmic/endoplasmic reticulum Ca2+ transporting 3; C1GALT1C1 = C1GALT1 specific chaperone 1; CD207 = langerin; FAAH2 = fatty acid amide hydrolase 2; GCNT4 = glucosaminyl (N-Acetyl) transferase 4; KEL = Kell metallo-endopeptidase (Kell Blood Group); LCTL = lactase-like protein; PECAM1 = platelet endothelial cell adhesion molecule; RAB14 = ras-related protein rab-14; SELE = E-selectin; SELL = L-selectin; SFTPD = surfactant protein D.

After multiple testing correction (p FDR = 0.05), using genetic instruments and under the assumptions of Mendelian randomization, our results were consistent with six blood markers being significantly causally associated with an elevated risk of hospitalization as a result of COVID-19 ( Figs 1 and S1 ); the reverse associations with risk of hospitalization as exposure and these six blood markers as outcome revealed no significant associations ( S3 Table ). Per standard deviation (SD) increase in the respective blood marker our results were consistent with an increase in odds for hospitalization ranging from 7 to 19%, with fatty acid amide hydrolase 2 (FAAH2) showing the strongest effect: odds ratio (OR) = 1.19 (95% CI: 1.12, 1.25, q ≤ 0.01; Table 1 ).

Details of significant, false discovery rate-(FDR)-corrected Mendelian randomization results, using the Generalized Summary Data-based Mendelian randomization (GSMR) method. Using genetic instruments and under the assumptions of Mendelian randomization, the top section shows results consistent with six blood markers being significantly causally associated with an increased risk of hospitalization as a result of COVID-19 and the nine blood markers causally associated with a decreased risk of hospitalization, as well as one protein showing a decrease in risk of hospitalization. The bottom section shows results consistent with five blood markers being significantly causally associated with an increased risk for the need of respiratory support/death due to COVID-19 and eight blood markers causally associated with a decreased risk for the need of respiratory support/death due to COVID-19, as well as one protein decreasing risk for the need of respiratory support/death due to COVID-19. The table presents the log odds statistics (i.e., beta) and corresponding standard error as well as odds ratios, 95% confidence intervals, and the FDR-adjusted Q values (p FDR = 0.05).

We tested 3,890 associations with hospitalization-COVID-19 as the exposure and blood proteins as outcome (yielding 1 statistically significant association); and in reverse, 5,314 associations of blood proteins as the exposure and hospitalization-COVID-19 as the outcome (yielding 15 statistically significant associations). Additionally, we tested 2,687 associations with need for respiratory support/death-COVID-19 as the exposure (yielding 1 significant association); and in reverse, 3,273 associations with respiratory support/death-COVID-19 as the outcome (yielding 13 significant associations, Table 1 ). Our results show for some proteins, robust associations with the same proteins twice, as they were measured twice in independent GWASs, serving as direct replication. In order to easily identify these proteins, we added a suffix of the study name to the protein. In addition, note that units of protein measurement differed in studies, with some studies using standardized units (see studies in S1 Table ). Thus, we will report our findings per standard deviation increase.

Discussion

Using genetic instruments and under the assumptions of Mendelian randomization, our proteome-wide analyses are consistent with higher levels of certain blood proteins being causally associated with risk of being hospitalized due to COVID-19, and subsequently experiencing the most severe form including respiratory support or ending lethal (i.e., respiratory support/death in the following). All these proteins have detectable blood plasma or serum levels. For our discussion, we grouped the proteins by function in Table 2 and provided more detail in S11 Table.

It is important to note that, in our analyses, we did not identify typical canonical immune proteins, such as interleukin 6 or C-reactive protein [51,52]. This suggests that with a larger database of proteins we can pinpoint non-canonical immunomodulatory proteins relevant to disease pathophysiology. We did, however, estimate associations for some proteins twice, as they were measured separately in independent GWASs. Results from these analyses displayed the same direction of effects, rendering them a direct replication and increasing the validity of our findings.

Blood group proteins In the blood group protein group, using genetic instruments and under the assumptions of Mendelian randomization, our findings were consistent with ABO being causally associated with both an increased risk of hospitalization as well as the requirement of respiratory support or death by COVID-19 (i.e., respiratory support/death). ABO is an enzyme with glycosyltransferase activity that determines the ABO blood group of an individual [53]. However, the precise blood group associated with the increased risk for hospitalization as a result of COVID-19 cannot be determined from our results, as the probe for the blood marker measures both the A and B isoform of the protein while not showing a signal for O. Given the underlying British population of the original GWAS, A should be the more prevalent blood group (24%) in the sample compared to B (8%) [54]. Nevertheless, it is more likely that A, B, or the combination of A and B is associated with higher risk for hospitalization. Our findings confirm previous reports of the ABO blood group system being an important risk factor for a severe COVID-19 infection. For example, the proportion of group A is higher in COVID-19 positive individuals than in controls [55–60], and group A has been associated with higher mortality [61]. All evidence taken together suggests that blood group A is the more likely candidate for follow-up studies. Additionally, KEL, which is part of the complex Kell blood group system that contains many highly immunogenic antigens [62], was associated with a decreased risk of hospitalization as a result of COVID-19. This supports the notion that Kell negative individuals may be more susceptible to COVID-19 [63].

Antigen recognition In our study, CD207, also known as langerin, was associated with hospitalization as well as the requirement of respiratory support or death by COVID-19. This protein is exclusively expressed in Langerhans cells (LC)–the first dendritic cells to encounter pathogens entering the body via the mucosa or skin [64]. Langerin binds COVID-19 glycoprotein glycans; however, it does not mediate transfection of COVID-19 pseudovirions in a T-lymphocyte cell line [65], rendering its role in COVID-19 infections inconclusive [66]. Our findings showed evidence consistent with high levels of SFTPD potentially protecting against COVID-19 hospitalization. SFTPD is strongly expressed in lung, brain, and adipose tissue, and contributes to the lung’s defense against microorganisms, antigens, and toxins [67]. SFTPD also interacts with COVID-19 spike proteins [68]. In COVID-19, expression findings are mixed: some studies show that SFTPD is highly expressed in the lungs of COVID-19 patients [68], whereas others evidence a decreased expression [69]. Moreover, another study which investigated gene expression patterns in COVID-19-affected lung tissue and SARS-CoV-2 infected cell-lines, report a downregulation of SFTPD along with several regulatory partners [70]. Given its role in immunomodulation and air exchange in the lung, this supports our finding that higher levels of SFTPD may be causally associated with COVID-19 immunity [71,72]. Although more research is needed; however, ours and others’ findings imply that SFTPD may protect against severe forms of COVID-19.

Adhesion molecules Using genetic instruments and under the assumptions of Mendelian randomization, our analysis was consistent with the adhesion molecules SELE, SELL, and PECAM-1 being causally associated with a decreased risk of both hospitalization and the requirement of respiratory support/death by COVID-19, while ICAM-1 was only protective against respiratory support/death. This is in keeping with results from out pathway analyses which suggest a significant enrichment in the KEGG pathway “cell adhesion molecules”. Studies have suggested that late stage COVID-19 is an endothelial disease [73]. The vascular endothelium is the crucial interface between blood and other tissues, regulating vascular structure, permeability, vasomotion, inflammation, and oxidative stress [73]. SELL and SELE are members of the selectin class of leukocyte adhesion molecules, which facilitate slow rolling of blood leukocytes along the endothelium [73]. Specifically, SELL promotes initial tethering and rolling of leukocytes to the endothelium [74,75] and SELE is responsible for the accumulation of blood leukocytes at sites of inflammation by mediating the adhesion of cells to the vascular lining [76,77]. The firm binding of leukocytes to the endothelial surface depends upon other molecules, such as PECAM-1, which is a cell adhesion molecule required for leukocyte transendothelial migration under most inflammatory conditions [78,79], and our results were consistent with it being protective against hospitalization. Once tightly bound, chemoattractant cytokines can signal to the bound leukocytes to traverse the endothelial monolayer and enter tissues where they can combat pathogenic invaders and initiate tissue repair [80]. This may be one of the biological explanations why we saw elevated levels of SELL, SELE, and PECAM-1 as being protective against hospitalization. ICAM-1–our results being consistent with it being protective against hospitalization–mediates cell-cell adhesion and is involved in inflammation [81]. Contrary to our findings, higher ICAM-1 levels have been associated with COVID-19 severity [82,83], requiring follow-up investigations. In summary, molecules that mediate the interaction between immune cells and blood vessels may be important in late stage COVID-19 and moderate severity.

Transporter molecules In the protein transporter/trafficking group, using genetic instruments and under the assumptions of Mendelian randomization, our results were consistent with RAB14 being causally associated with an increased risk of hospitalization and respiratory support/death, whereas ATP2A3 may protect against hospitalization. Rab proteins are central regulators of phagosome maturation; RAB14 particularly regulates the interaction of phagosomes with early endocytic compartments [84]. One study identified RAB14 GTPases as a critical COVID-19 host factor: coronaviruses hijack Rab GTPase in host cells to replicate [85]. Additionally, whole genome analysis of COVID-19 lung tissue identified RAB14 polymorphisms that alter its binding to some miRNAs [86]. Therefore, Rab GTPases could be therapeutic targets. ATP2A3 is a magnesium-dependent ATP hydrolase, transports calcium from the cytosol into the sarcoplasmic/endoplasmic reticulum involved in muscular excitation/contraction and contributes to calcium sequestration [87,88]. Note that ATP2A3 has previously been genetically associated with severe COVID-19, but its exact role in infection remains unclear [89]. Cardiac failure in severe COVID-19 has been reported [90,91], hence, ATP2A3 may be involved as it regulates cardiomyocyte contraction [92]. However, note that ATP2A3 did not survive our sensitivity analyses, so this finding would need further validation.

[END]

[1] Url: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1010042

(C) Plos One. "Accelerating the publication of peer-reviewed science."
Licensed under Creative Commons Attribution (CC BY 4.0)
URL: https://creativecommons.org/licenses/by/4.0/


via Magical.Fish Gopher News Feeds:
gopher://magical.fish/1/feeds/news/plosone/