10 November 2015: Molecular Biology
Potential Role of lncRNAs in Contributing to Pathogenesis of Intervertebral Disc Degeneration Based on Microarray Data
Yu Chen BC , Haijian Ni BD , Yingchuan Zhao BC , Kai Chen C , Ming Li AEF , Cheng Li E , Xiaodong Zhu F , Qiang Fu F
DOI: 10.12659/MSM.894638
Med Sci Monit 2015; 21:3449-3458
Abstract
BACKGROUND: Our study intended to identify potential long non-coding RNAs (lncRNAs) and genes, and to elucidate the underlying mechanisms of intervertebral disc degeneration (IDD).
MATERIAL AND METHODS: The microarray of GSE56081 was downloaded from the Gene Expression Omnibus database, including 5 human control nucleus pulposus tissues and 5 degenerative nucleus pulposus tissues, which was on the basis of GPL15314 platform. Identification of differentially expressed lncRNAs and mRNAs were performed between the 2 groups. Then, gene ontology (GO) and pathway enrichment analyses were performed to analyze the biological functions and pathways for the differentially expressed mRNAs. Simultaneously, lncRNA-mRNA weighted coexpression network was constructed using the WGCNA package, followed by GO and KEGG pathway enrichment analyses for the genes in the modules. Finally, the protein-protein interaction (PPI) network was visualized.
RESULTS: A total of 135 significantly up- and 170 down-regulated lncRNAs and 2133 significantly up- and 1098 down-regulated mRNAs were identified. Additionally, UBA52 (ubiquitin A-52 residue ribosomal protein fusion product 1), with the highest connectivity degree in PPI network, was remarkably enriched in the pathway of metabolism of proteins. Eight lncRNAs – LINC00917, CTD-2246P4.1, CTC-523E23.5, RP4-639J15.1, RP11-363G2.4, AC005082.12, MIR132, and RP11-38F22.1 – were observed in the modules of lncRNA-mRNA weighted coexpression network. Moreover, SPHK1 in the green-yellow module was significantly enriched in positive regulation of cell migration.
CONCLUSIONS: LncRNAs LINC00917, CTD-2246P4.1, CTC-523E23.5, RP4-639J15.1, RP11-363G2.4, AC005082.12, MIR132, and RP11-38F22.1 were differentially expressed and might play important roles in the development of IDD. Key genes, such as UBA52 and SPHK1, may be pivotal biomarkers for IDD.
Keywords: Biomarkers - metabolism, Databases, Genetic, Gene Expression Regulation, gene ontology, Intervertebral Disc Degeneration - genetics, Oligonucleotide Array Sequence Analysis, Protein Interaction Mapping, RNA, Long Noncoding - genetics, RNA, Messenger - metabolism
Background
Intervertebral disc degeneration (IDD) is an important cause of low back pain, which affects patient quality of life and causes heavy socioeconomic burdens [1,2]. Currently, low back pain is mainly treated by medication combined with physiotherapy. Despite progress in surgical therapy, it is not curative and is related with several complications [3]. Fortunately, gene therapy has attracted more attention in recent years [4].
Several studies have unveiled a large number of genes or proteins that are implicated in IDD. For example, a former study has demonstrated that
Recently, aberrant expression of long non-coding RNAs (lncRNAs) is shown to cause disordered gene expression. LncRNAs are mRNA-like transcripts ranging in length from 200 nt to 100 kb, and they lack significant open reading frames [13,14]. In spite of this, accumulating studies have indicated that lncRNAs play crucial roles in many biological processes and human diseases [15,16]. For example, Wan et al. [17] demonstrated that over-expressed lncRNA RP11-296A18.3 induced the up-regulation of
In our study, we intended to use the same microarray profile to further identify the differentially expressed lncRNA and mRNAs. Moreover, gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, construction of lncRNA-mRNA weighted coexpression network, and protein-protein interaction (PPI) network were used to elucidate the molecular mechanism in the process of IDD. We believe that different results obtained from the microarray profile of Wan et al. might provide a profound understanding of IDD.
Material and Methods
TISSUE SAMPLES AND DATA ACQUISITION:
The gene expression data of GSE56081 deposited by Wan et al. [17] was downloaded from the National Center of Biotechnology Information (NCBI) Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) database, which was on the basis of GPL15314 platform of Arraystar Human LncRNA microarray V2.0 (Agilent_033010 Probe Name version). A total of 10 samples (5 human control nucleus pulposus tissues and 5 degenerative nucleus pulposus tissues) were included for the development of this microarray.
DATA PREPROCESSING:
Expression data of probes were converted to corresponding genes symbols according to the annotation of GPL15314 platform. Aggregate() function of R was used to compute the average expression value, which was used for genes corresponding to multi-probes, and KNN method [18] of Impute package of R [19] was used to add the missing values of probes.
Then, quantile normalization was carried out using preprocessCore package of R [20]. Subsequently, the expression matrix was obtained.
IDENTIFICATION OF DIFFERENTIALLY EXPRESSED LNCRNAS AND MRNAS:
Limma (linear models for microarray data) package [21] in R was used to identify differentially expressed lncRNAs and mRNAs between the 2 groups through use of the t test. Multiple testing correction was implement by calculating the Benjamini-Hochberg [22] false discovery rate (FDR). |log2 fold-change| >1 and an FDR <0.05 were regarded as the criteria for differential expression.
GENE ONTOLOGY (GO) AND PATHWAY ENRICHMENT ANALYSIS:
GO analysis is frequently used in functional enrichment studies of large-scale genes [23]. Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis was performed to analyze the biological pathways, involving the differentially expressed mRNAs. In the current study, TargetMine (http://targetmine.nibio.go.jp) [24] was used to investigate the functional enrichment condition for the up- and down-regulated differentially expressed mRNAs. FDR less than 0.05 was selected as the threshold.
THE CONSTRUCTION OF LNCRNA-MRNA WEIGHTED COEXPRESSION NETWORK:
WGCNA package [25] in R was used to construct the lncRNA/mRNA coexpression network. The network construction steps included the following: (1) Network construction: the lncRNA-mRNA weighted coexpression network is fully specified by its adjacency matrix amn, and amn encodes the network connection strength between nodes m and n. In order to calculate the adjacency matrix, the default approach defines the co-expression similarity Smn as the absolute value of the correlation coefficient between nodes of m and n: Smn=|cor(am, an)|. The weighted adjacency amn between 2 genes is proportional to their similarity on a logarithmic scale with β ≥0.8, log(amn)=β×log(Smn). Adjacency functions were obtained by approximate scale-free topology criterion. Then, the adjacency matrix was converted into topology matrix; (2) Module detection: we applied Dynamic Tree Cut method and Static Tree Cut method to detect modules with a minimum of 30 lncRNA/genes; and (3) Correlation analysis between disease information and gene module using the correlation coefficient method and gene significance (GS) value.
Then, we performed the GO and KEGG pathway enrichment analysis of genes in the module.
PPI NETWORK CONSTRUCTION:
The interaction of the proteins were analyzed by means of the online STRING database (Search Tool for the Retrieval of Interacting Genes, http://string-db.org/) [26] and the combined score >0.9 was used as the cut-off criterion. Subsequently, Cytoscape (http://cytoscapeweb.cytoscape.org/) [27] was used to display the PPI network.
Results
DATA PREPROCESSING:
On the basis of the annotation information of GPL56081, a total of 18 208 probes were identified, including 1970 lncRNAs and 16238 mRNAs. The data before and after normalization are shown in Figure 1A and 1B.
IDENTIFICATION OF DIFFERENTIALLY EXPRESSED LNCRNAS AND MRNAS:
Based on the cut-off criteria, 135 significantly up- and 170 down-regulated lncRNAs were screened out. Moreover, 2133 significantly up- and 1098 down-regulated mRNAs were observed, as shown in Figure 2.
GO AND PATHWAY ENRICHMENT ANALYSIS:
TargetMine was used to identify GO enriched functions for significant differentially expressed mRNAs. Top 5 GO and KEGG terms are listed in Table 1. Up-regulated genes were significantly enriched in macromolecule metabolic process (FDR=1.41E-03) and cellular macromolecule metabolic process (FDR=1.28E-03) of BP, membrane-bounded organelle (FDR=1.13E-08) and organelle (FDR=1.74E-07) of CC, RNA binding (FDR=1.31E-05), and nucleic acid binding (FDR=9.35E-04).
Significantly enriched KEGG pathways of up-regulated differentially expressed mRNAs were mainly on translation (FDR=2.24E-15) and cellular responses to stress (FDR=6.40E-10). However, the significant functions and pathways of down-regulated genes were not identified.
THE CONSTRUCTION OF LNCRNA-MRNA WEIGHTED COEXPRESSION NETWORK:
In the current study, a total of 7 weighted coexpression sub-networks were identified. The results of correlation analysis demonstrated that in addition to the grey module, the other 6 modules were highly relevant in intervertebral disc degeneration, as shown in Table 2 and Figure 3. Then, lncRNA-mRNA weighted coexpression network was constructed on the basis of the genes with top 30 connectivity degrees in 6 modules and P value <0.01. Among these, 8 lncRNAs were involved in the 4 modules (green module: CTC-523E23.5, RP4-639J15.1 and RP11-363G2.4; green-yellow module: CTD-2246P4.1 and LINC00917; magenta module: AC005082.12; purple module: MIR132 and RP11-38F22.1, Table 3) and were up-regulated (Figure 4A–4D). Notably, in these modules, red diamond nodes represented up-regulated lncRNA, pink square nodes represented up-regulated genes, and pastel green square nodes represented down-regulated genes (Figure 4).
Then, we performed the GO and KEGG pathway enrichment analysis of genes in the module. As shown in Table 4, the genes in the green module were significantly enriched in regulation of cell adhesion and response to axon injury. The genes in the green-yellow module were significantly enriched in positive regulation of cell migration and positive regulation of cell motion. The genes in the magenta module were significantly enriched in nucleotide catabolic process as well as nucleobase, nucleoside, nucleotide, and nucleic acid catabolic process. The genes in the purple module were significantly enriched in contractile fiber part and striated muscle thin filament.
PPI NETWORK:
The PPI network of significantly up-regulated genes was constructed using the STRING database. As shown in Figure 5, several PPI nodes were high in connectivity degrees, as follows: UBA52 (ubiquitin A-52 residue ribosomal protein fusion product 1, degree=51), RPS3 (ribosomal protein S3, degree=41), RPL9 (ribosomal protein L9, degree=38), RPL19 (ribosomal protein L19, degree=37), RPL23 (ribosomal protein L23, degree=36), RPL30 (ribosomal protein L30, degree=36), and RPS14 (ribosomal protein S14, degree=36).
Discussion
In this study, we identified 135 significantly up- and 170 down-regulated lncRNAs and 2133 significantly up- and 1098 down-regulated mRNAs. Additionally, up-regulated
Significantly, up-regulated
In the green-yellow module, lncRNA LINC00917 and CTD-2246P4.1 were remarkably higher in IDD samples relative to normal tissues. Moreover, their interacted genes, such as up-regulated
Furthermore, our study also identified the other 6 differentially expressed lncRNAs as well as hundreds of differentially expressed genes in different modules. CTC-523E23.5, RP4-639J15.1, and RP11-363G2.4 were identified and could interact with Integrin, Alpha 11 (ITGA11) in the green module. ITGA11 is shown to be differentially expressed and is associated with cell proliferation in IVD [38]. AC005082.12 was screened out and could interact with Ephrin-A3 (EFNA3) in the magenta module. MIR132 and RP11-38F22.1 could interact with Cathepsin L (CTSL) in the purple module. CTSL is demonstrated to degrade ECM and basement membrane components [39]. The destruction of ECM has been regarded to be an important cause of IDD [29,30]. Although the underlying mechanisms of these key lncRNAs in the development of IDD have not been fully discussed in our study, our results indicate that they may contribute to IDD development to some extent.
However, there were still some limitations in the current study. First, sample size was small which might limit the validity of the results. Second, no animal experiments were performed in our study to further validate these results. Thus, large sample sizes and more experiments, such as RNA interference and fluorescence
Conclusions
lncRNAs LINC00917, CTD-2246P4.1, CTC-523E23.5, RP4-639J15.1, RP11-363G2.4, AC005082.12, MIR132, and RP11-38F22.1 were differentially expressed and might play important roles in the development of IDD. Key genes, such as
References
1. Luoma K, Riihimäki H, Luukkonen R, Low back pain in relation to lumbar disc degeneration: Spine, 2000; 25; 487-92, pmid: 10707396
2. Vos T, Flaxman AD, Naghavi M, Years lived with disability (YLDs) for 1160 sequelae of 289 diseases and injuries 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010: Lancet, 2013; 380; 2163-96, pmid: 23245607
3. Kumar M, Baklanov A, Chopin D, Correlation between sagittal plane changes and adjacent segment degeneration following lumbar spine fusion: Eur Spine J, 2001; 10; 314-19, pmid: 11563617
4. Tow BP, Hsu WK, Wang JC, Disc regeneration: a glimpse of the future: Clin Neurosurg, 2007; 54; 122, pmid: 18504908
5. Smolders LA, Meij BP, Onis D, Gene expression profiling of early intervertebral disc degeneration reveals a down-regulation of canonical Wnt signaling and caveolin-1 expression: implications for development of regenerative strategies: Arthritis Res Ther, 2013; 15; R23, pmid: 23360510
6. Le Maitre CL, Hoyland JA, Freemont AJ, Catabolic cytokine expression in degenerate and herniated human intervertebral discs: IL-1beta and TNFalpha expression profile: Arthritis Res Ther, 2007; 9; R77, pmid: 17688691
7. Lee JM, Song JY, Baek M, Interleukin-1β induces angiogenesis and innervation in human intervertebral disc degeneration: Orthop Res, 2011; 29; 265-69
8. Yoon ST, Kim KS, Li J: Spine, 2003; 28; 1773-80, pmid: 12923462
9. Eskola PJ, Kjaer P, Daavittila IM, Genetic risk factors of disc degeneration among 12–14-year-old Danish children: a population study: Int J Mol Epidemiol Genet, 2010; 1; 158-65, pmid: 21537388
10. Wang S-Z, Rui Y-F, Tan Q, Wang C, Enhancing intervertebral disc repair and regeneration through biology: platelet-rich plasma as an alternative strategy: Arthritis Res Ther, 2013; 15; 220, pmid: 24165687
11. Gui K, Ren W, Yu Y, Inhibitory effects of platelet-rich plasma on intervertebral disc degeneration: A preclinical study in a rabbit model: Med Sci Monit, 2015; 21; 1368-75, pmid: 25965093
12. Zhang Y-H, Zhao C-Q, Jiang L-S, Dai L-Y, Lentiviral shRNA silencing of CHOP inhibits apoptosis induced by cyclic stretch in rat annular cells and attenuates disc degeneration in the rats: Apoptosis, 2011; 16; 594-605, pmid: 21479580
13. Gibb EA, Brown CJ, Lam WL, The functional role of long non-coding RNA in human carcinomas: Mol Cancer, 2011; 10; 38-55, pmid: 21489289
14. Qi P, Du X, The long non-coding RNAs, a new cancer diagnostic and therapeutic gold mine: Mod Pathol, 2012; 26; 155-65, pmid: 22996375
15. Wilusz JE, Sunwoo H, Spector DL, Long noncoding RNAs: functional surprises from the RNA world: Genes Dev, 2009; 23; 1494-504, pmid: 19571179
16. Li J, Xuan Z, Liu C, Long non-coding RNAs and complex human diseases: Int J Mol Sci, 2013; 14; 18790-808, pmid: 24036441
17. Wan Z-Y, Song F, Sun Z, Aberrantly expressed long noncoding RNAs in human intervertebral disc degeneration: a microarray related study: Arthritis Res Ther, 2014; 16; 465, pmid: 25280944
18. Altman NS, An introduction to kernel and nearest-neighbor nonparametric regression: Am Stat, 1992; 46; 175-85
19. Hastie T, Tibshirani R, Narasimhan B, Gilbert C, Imputation for microarray data: Bioinformatics, 2001; 17; 520-25, pmid: 11395428
20. Bolstad BM, A collection of pre-processing functions: R package version, 2013; 1
21. Limma SG, Linear models for microarray data: Bioinformatics and computational biology solutions using R and Bioconductor, 2005; 397-420, New York, Springer
22. Benjamini Y, Hochberg Y, Controlling the false discovery rate: a practical and powerful approach to multiple testing: J R Stat Soc Series B, 1995; 289-300
23. Ashburner M, Ball CA, Blake JA, Gene ontology: tool for the unification of biology. The Gene Ontology Consortium: Nat Genet, 2000; 25; 25-29, pmid: 10802651
24. Chen Y-A, Tripathi LP, Mizuguchi K, TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery: PLoS One, 2011; 6; e17844, pmid: 21408081
25. Langfelder P, Horvath S, WGCNA: an R package for weighted correlation network analysis: BMC Bioinformatics, 2008; 9; 559, pmid: 19114008
26. Franceschini A, Szklarczyk D, Frankild S, STRING v9. 1: protein-protein interaction networks, with increased coverage and integration: Nucleic Acids Res, 2013; 41; D808-15, pmid: 23203871
27. Saito R, Smoot ME, Ono K, A travel guide to Cytoscape plugins: Nat Methods, 2012; 9; 1069-76, pmid: 23132118
28. Le Maitre C, Pockert A, Buttle D, Matrix synthesis and degradation in human intervertebral disc degeneration: Biochem Soc Trans, 2007; 35; 652-55, pmid: 17635113
29. Castro MM, Tanus-Santos JE, Inhibition of matrix metalloproteinases (MMPs) as a potential strategy to ameliorate hypertension-induced cardiovascular alterations: Curr Drug Deliv, 2013; 14; 335-43
30. Urban J, Roberts S, Degeneration of the intervertebral disc: Arthritis Res Ther, 2003; 5; 120-38, pmid: 12723977
31. Vo NV, Hartman RA, Yurube T, Expression and regulation of metalloproteinases and their inhibitors in intervertebral disc aging and degeneration: Spine J, 2013; 13; 331-41, pmid: 23369495
32. Lee JH, Lee MJ, Emerging roles of the ubiquitin-proteasome system in the steroid receptor signaling: Arch Pharm Res, 2012; 35; 397-407, pmid: 22477185
33. Goldberg AL, Protein degradation and protection against misfolded or damaged proteins: Nature, 2003; 426; 895-99, pmid: 14685250
34. Steagall RJ, Daniels CR, Dalal S, Extracellular ubiquitin increases expression of angiogenic molecules and stimulates angiogenesis in cardiac microvascular endothelial cells: Microcirculation, 2014; 21; 324-32, pmid: 24308702
35. Spiegel S, Milstien S, Sphingosine-1-phosphate: an enigmatic signalling lipid: Nat Rev Mol Cell Biol, 2003; 4; 397-407, pmid: 12728273
36. Limaye V, Li X, Hahn C, Sphingosine kinase-1 enhances endothelial cell survival through a PECAM-1–dependent activation of PI-3K/Akt and regulation of Bcl-2 family members: Blood, 2005; 105; 3169-77, pmid: 15632208
37. Ørom UA, Derrien T, Beringer M, Long noncoding RNAs with enhancer-like function in human cells: Cell, 2010; 143; 46-58, pmid: 20887892
38. Markova DZ, Kepler CK, Addya S, An organ culture system to model early degenerative changes of the intervertebral disc II: profiling global gene expression changes: Arthritis Res Ther, 2013; 15; R121, pmid: 24171898
39. Zhang L, Wei L, Shen G, Cathepsin L is involved in proliferation and invasion of ovarian cancer cells: Mol Med Rep, 2015; 11; 468-74, pmid: 25333746
In Press
Clinical Research
Institutional and Regional Variations in Access to Clinical Trials and Next-Generation Sequencing in Turkis...Med Sci Monit In Press; DOI: 10.12659/MSM.951027
Clinical Research
Low-Intensity Blood Flow-Restricted Multi-Joint Exercise Improves Muscle Function in Patients With Patellof...Med Sci Monit In Press; DOI: 10.12659/MSM.950516
Review article
Musculoskeletal Ultrasound and MRI in the Evaluation of Chemotherapy-Induced Peripheral Neuropathy: A ReviewMed Sci Monit In Press; DOI: 10.12659/MSM.951283
Clinical Research
Sensory Processing, Dissociation, and Affective Symptoms in Misophonia: A Cross-Sectional Study of 35 AdultsMed Sci Monit In Press; DOI: 10.12659/MSM.950938
Most Viewed Current Articles
17 Jan 2024 : Review article 10,187,196
Vaccination Guidelines for Pregnant Women: Addressing COVID-19 and the Omicron VariantDOI :10.12659/MSM.942799
Med Sci Monit 2024; 30:e942799
13 Nov 2021 : Clinical Research 3,708,487
Acceptance of COVID-19 Vaccination and Its Associated Factors Among Cancer Patients Attending the Oncology ...DOI :10.12659/MSM.932788
Med Sci Monit 2021; 27:e932788
14 Dec 2022 : Clinical Research 2,341,643
Prevalence and Variability of Allergen-Specific Immunoglobulin E in Patients with Elevated Tryptase LevelsDOI :10.12659/MSM.937990
Med Sci Monit 2022; 28:e937990
16 May 2023 : Clinical Research 706,524
Electrophysiological Testing for an Auditory Processing Disorder and Reading Performance in 54 School Stude...DOI :10.12659/MSM.940387
Med Sci Monit 2023; 29:e940387






