<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">AJMB</journal-id><journal-title-group><journal-title>American Journal of Molecular Biology</journal-title></journal-title-group><issn pub-type="epub">2161-6620</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/ajmb.2012.24040</article-id><article-id pub-id-type="publisher-id">AJMB-24041</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Biomedical&amp;Life Sciences</subject></subj-group></article-categories><title-group><article-title>
 
 
  Genetic variation may play a crucial role in non-coding RNA biogenesis
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>eyalakshmi</surname><given-names>Kandhavelu</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Meenakshisundaram</surname><given-names>Kandhavelu</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>School of Veterinary Medicine, University of Camerino, Camerino, Italy</addr-line></aff><aff id="aff2"><addr-line>Department of Signal Processing, Tampere University of Technology, Tampere, Finland</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>meenakshisundaram.kandhavelu@tut.fi(MK)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>31</day><month>10</month><year>2012</year></pub-date><volume>02</volume><issue>04</issue><fpage>386</fpage><lpage>389</lpage><history><date date-type="received"><day>29</day>	<month>June</month>	<year>2012</year></date><date date-type="rev-recd"><day>31</day>	<month>July</month>	<year>2012</year>	</date><date date-type="accepted"><day>24</day>	<month>August</month>	<year>2012</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  Transcription, post-transcriptional modification, translation, post-translational modification, DNA replication, and signaling interaction of intra- and extra- cellular components are the relevant mechanisms in gene regulation. Transcription is one of the most important mechanisms in the control of gene expression. Further, post-transcriptional modifications play a crucial role after transcription which determine whether the transcribed gene is coding or non-coding RNA (ncRNAs). Genome-wide analysis of RNAs provides information about the coding RNAs, whereas the status of ncRNAs are still at large and must be discussed in detail as variations in the ncRNAs can lead to different phenotypes. In this short article, we discuss the role of genetic variation in ncRNA genes and how this variation may play a crucial role in ncRNA biogenesis that eventually leads to phenotypic variation and thus speciation.
 
</p></abstract><kwd-group><kwd>Genome Wide Analysis; Bioinformatics; Genetic Variation; Non-Coding RNA Biogenesis; RNA Structure; Speciation</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. DARK MATTERS IN GENE EXPRESSION</title><p>Our understanding of the principle mechanisms that orchestrate the central dogma of life is limited due to the lack of appropriate physical approaches [1,2] , i.e. accurate and advanced experimental facilities to understand the genetic variations, the limitations in the bioinformatics tools which predict DNA/RNA secondary structure, interactions, control elements, among others. Due to these limitations, a detailed understanding of the exact mechanisms underlying the processes of transcription and translation are still obscure. Transcription is a complex and tightly regulated mechanism in gene expression where the genes can be turned on or off based on internal or external signals [<xref ref-type="bibr" rid="scirp.24041-ref3">3</xref>] . The post transcriptional modifications of the transcripts designates them as coding (protein-coding) or non-coding RNAs (ncRNAs) [4-6] . Extensive research on RNA proves their significant role in gene expression [7,8] . Recently, a pilot study reported the properties of loss-of-function (LoF) variants of human protein-coding genes [<xref ref-type="bibr" rid="scirp.24041-ref9">9</xref>] , which show that each human individual carries ~100 LoF variants, with ~20 genes completely lost. These loci can be described as “dark matter” because intronic sequences hide crucial information in their sequences which affect gene expression. There is evidence that variations in the exonic or the intronic regions of DNA can affect the structure and functions of their target mRNA or ncRNAs and vice versa, and thus the cellular signaling pathways of the cells that control, for example, the birth and death of cells. It is thus important to discuss the status of genetic variations not only in coding RNA but also in the ncRNAs. While ncRNAs are important in the control of gene expression, it is unclear how these ncRNAs originate. Below, we discuss the role of genetic variation in creating variation in ncRNAs during biogenesis and how it affects the structure of ncRNAs.</p></sec><sec id="s2"><title>2. ROLE OF GENETIC VARIATION IN NCRNA BIOGENESIS</title><p>Each individual’s genome differs in its expression patterns from other members of the species, as well as between species [<xref ref-type="bibr" rid="scirp.24041-ref10">10</xref>]. Using a Markov model, one recent report suggests that humans and chimpanzees speciated 4.1 million years ago. Total human genome analysis reported that only &lt;2% of the total genome sequence has protein-coding capacity [<xref ref-type="bibr" rid="scirp.24041-ref11">11</xref>] . This implies that ~98% of the genome might have both coding and ncRNA producing genes, which might show expression variation in any given environment. These variations would likely impart changes in two major components: 1) it could affect the secondary structure of the encoded protein (as a result, LoF variant proteins could be generated as shown by MacArthur et al.), and 2) it could mis-target the ncRNAs by relaxing the base pairing consistency [<xref ref-type="bibr" rid="scirp.24041-ref12">12</xref>]. Such misparing might be more likely to affect the function of particular RNAs, and could also disturb cellular signaling pathways controlling gene expression, since ncRNAs regulate the expression of many genes [<xref ref-type="bibr" rid="scirp.24041-ref13">13</xref>]. It would be more interesting if genome sequence data also discussed the LoF variants of intronic sequences. This would help us understand whether the predicted LoF variants were generated by intronic variation. Recent evidence also suggests that certain sequences have the potential to generate ncRNAs with different functions (possibly through non-canonical ncRNA biogenesis pathways [<xref ref-type="bibr" rid="scirp.24041-ref14">14</xref>] ), which are important for cellular functions. This includes SnomicroRNA [<xref ref-type="bibr" rid="scirp.24041-ref15">15</xref>] and pi-sno-microRNA (x-ncRNA) [16,17], which are evolutionarily conserved. This leads to the intriguing question: how does a particular gene sequence produce structurally and functionally different ncRNAs (see <xref ref-type="fig" rid="fig1">Figure 1</xref>). The answer for this phenomenal question is currently unknown and the approaches to address this question are being recognized as an active area of investtigation. Whether or not the single-nucleotide polymerphism (SNP) variants of ncRNA could play crucial roles also remains to be clarified.</p><p>The primary ncRNA sequences are transcribed from DNA sequences [<xref ref-type="bibr" rid="scirp.24041-ref2">2</xref>] . It is not known how deletion, addition, repeats and other LoF variation in the DNA sequence of ncRNAs affect the structure of the primary transcript [<xref ref-type="bibr" rid="scirp.24041-ref18">18</xref>]. Genetic variation may play a critical rolenot only in the LoF of mRNAs but also in ncRNAs. Although experimental evidence suggests that several proteins (e.g. Dicer, RISC) are involved in the production of a particular ncRNA from precursor-ncRNA [19,20] , this evidence could not elucidate the secondary or tertiary structure of the primary transcripts. This is because of the stochasticity in the formation of the tertiary RNA structure which includes energetic formations (Gibbs theory of free energy function) of tertiary structure, chelation of divalent cations/multicharged ligand interaction in RNA sequences which are poorly understood [21,22] . Data analysis of LoF variants of coding genes confirm the presence of changes in the sequence of RNA and hence changes in the structure and function of the RNA. This phenomenon hints that ncRNA might be more susceptible to structural and functional changes due to changes in sequence [<xref ref-type="bibr" rid="scirp.24041-ref23">23</xref>] if the aforementioned RNA folding theories are true. It is thus also important to address the status of an ncRNA variant in the same environmental condition in any human genome data, since it might give a path to predict the biogenesis pathway of ncRNA families. For example, in the case of Cystic fibrosis, LoF variants of non-coding genes have deleterious effects [<xref ref-type="bibr" rid="scirp.24041-ref24">24</xref>] . Since most of the human genome accumulates and transmits genetic mutations over time, understanding the detailed status of the noncoding sequence mutations along with the genetic variation of coding sequences might help us overcome deleterious effects [<xref ref-type="bibr" rid="scirp.24041-ref25">25</xref>] and might allow us to see ongoing genetic evolution in this era.</p></sec><sec id="s3"><title>3. THEORIES OF GENETIC VARIATION, LOF AND SPECIATION</title><p>Theories on genetic variations have been discussed extensively for several decades. Recently, a report reviewed several proposed the theories of genetic variations [<xref ref-type="bibr" rid="scirp.24041-ref26">26</xref>] as: 1) “Less is more” hypothesis-advantageous effects of LoF variants; 2) “Less is less” hypothesis—deleterious effects of LoF variants; and 3) “less is nothing”—tolerated LoF variants situation. These theories describe the LoF of particular characteristics based on the comparison of populations or by tracking of an individual trait. The adaptation of a sub-population leads to the evolution of new species. This means that adopted characteristics are transferred to offspring which undergo natural selection, where the favorable traits are preserved allowing the sub-population to survive, which may then be subject to speciation. Further, Charles Darwin proposed that “each slight variation, if useful, is preserved” which determines how a sub-population adapts and evolves to form a new species in the process of natural selection [<xref ref-type="bibr" rid="scirp.24041-ref27">27</xref>].</p><p>A systematic survey of LoF variant genes using clinical sequencing data provides the status of individuals affected by a particular disease and infers that the selected individuals experience the deleterious effect by LoF variants in that particular environment [<xref ref-type="bibr" rid="scirp.24041-ref28">28</xref>]. While discussing the deleterious effects of genetic variation, it is important to extract more information. For example, it is not known whether these effects may become advantageous to the offspring, if transmitted. Further, the position of the mutation in the intronic gene sequence might play a significant role certain diseases. Most genomewide data lacks single nucleotide polymorphisms (SNPs) of introns. Several reports provide evidence that SNPs can affect the structure and function of ncRNAs, which can cause diseases such as cancer [<xref ref-type="bibr" rid="scirp.24041-ref29">29</xref>]. Providing this data will add much more value to understand how the “less is less hypothesis” plays a role in the selection of cells in a population by coordinating gene expression patterns via coding and ncRNAs.</p><p>The LoF variants in ncRNAs may play a significant role in the biogenesis of a particular type of ncRNA and hence lead to the production of multiple types of ncRNA from the same sequence. LoF variants in coding and ncRNA may thus control gene expression of cells in a given environment in a combinatorial manner for its survival. In human genome data, understanding the LoF variant function in ncRNA might therefore help us explore the real status of the gene expression system in a particular environment. This will allow us to study how the less is less (the role of deleterious effects in speciation) phenomenon occurs in the natural system.</p></sec><sec id="s4"><title>REFERENCES</title></sec><sec id="s5"><title>NOTES</title></sec></body><back><ref-list><title>References</title><ref id="scirp.24041-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Crick F. Central dogma of molecular biology, Nature 1970;227:561-563. doi:10.1038/227561a0</mixed-citation></ref><ref id="scirp.24041-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple"> 
Mattick JS. Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms, Bioessays 2003;25:930-939. doi:10.1002/bies.10332</mixed-citation></ref><ref id="scirp.24041-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple"> 
Dignam JD, Lebovitz RM, Roeder RG. Accurate transcription initiation by RNA polymerase II in a soluble extract from isolated mammalian nuclei, Nucleic Acids Res 1983;11:1475-1489. doi:10.1093/nar/11.5.1475</mixed-citation></ref><ref id="scirp.24041-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple"> 
Eddy SR. Non-coding RNA genes and the modern RNA world, Nat Rev Genet 2001;2:919-929. doi:10.1038/35103511</mixed-citation></ref><ref id="scirp.24041-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple"> 
Wilusz JE, Sunwoo H, Spector DL. Long noncoding RNAs: functional surprises from the RNA world, Genes Dev 2009;23:1494-1504. doi:10.1101/gad.1800909</mixed-citation></ref><ref id="scirp.24041-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple"> 
Mercer TR, Dinger ME, Mattick JS. Long noncoding RNAs: insights into functions, Nat Rev Genet 2009;10:155-159. doi:10.1038/nrg2521</mixed-citation></ref><ref id="scirp.24041-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple"> 
He L, Hannon GJ. MicroRNAs: small RNAs with a big role in gene regulation, Nat Rev Genet 2004;5:522-531. doi:10.1038/nrg1379</mixed-citation></ref><ref id="scirp.24041-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple"> 
McKnight SL, Kingsbury R. Transcriptional control signals of a eukaryotic protein-coding gene, Science 1982;217:316-324. doi:10.1126/science.6283634</mixed-citation></ref><ref id="scirp.24041-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple"> 
MacArthur DG, Balasubramanian S, Frankish A et al. A systematic survey of loss-of-function variants in human protein-coding genes, Science 2012;335:823-828. doi:10.1126/science.1215040</mixed-citation></ref><ref id="scirp.24041-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple"> 
Mayr E. Animal species and evolution. Cambridge,: Belknap Press of Harvard University Press, 1963.</mixed-citation></ref><ref id="scirp.24041-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple"> 
Hobolth A, Christensen OF, Mailund T et al. Genomic relationships and speciation times of human, chimpanzee, and gorilla inferred from a coalescent hidden Markov model, PLoS Genet 2007;3:e7. doi:10.1371/journal.pgen.0030007</mixed-citation></ref><ref id="scirp.24041-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple"> 
Ritz J, Martin JS, Laederach A. Evaluating our ability to predict the structural disruption of RNA by SNPs, BMC Genomics 2012;13 Suppl 4:S6. doi:10.1186/1471-2164-13-S4-S6</mixed-citation></ref><ref id="scirp.24041-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple"> 
Mattick JS, Makunin IV. Non-coding RNA, Hum Mol Genet 2006;15 Spec No 1:R17-29. doi:10.1093/hmg/ddl046</mixed-citation></ref><ref id="scirp.24041-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple"> 
Kim VN, Han J, Siomi MC. Biogenesis of small RNAs in animals, Nat Rev Mol Cell Biol 2009;10:126-139. doi:10.1038/nrm2632</mixed-citation></ref><ref id="scirp.24041-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple"> 
Scott MS, Avolio F, Ono M et al. Human miRNA Precursors with Box H/ACA snoRNA Features, Plos Computational Biology 2009;5.</mixed-citation></ref><ref id="scirp.24041-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple"> 
Kandhavelu M, Lammi C, Buccioni M et al. Existence of snoRNA, microRNA, piRNA characteristics in a novel non-coding RNA: x-ncRNA and its biological implication in Homo sapiens, Journal of Bioinformatics and Sequence Analysis 2009;1(2):31-40.</mixed-citation></ref><ref id="scirp.24041-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Kandhavelu M, Kandhavelu J. prepiRNA biogenesis mimics the pathway of miRNA. Biochemical Systematics and Ecology, 2012. doi:10.1016/j.bse.2012.03.012</mixed-citation></ref><ref id="scirp.24041-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple"> 
Eddy SR. Computational genomics of noncoding RNA genes, Cell 2002;109:137-140. doi:10.1016/S0092-8674(02)00727-4</mixed-citation></ref><ref id="scirp.24041-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple"> 
Hutvagner G, McLachlan J, Pasquinelli AE et al. A cellular function for the RNA-interference enzyme Dicer in the maturation of the let-7 small temporal RNA, Science 2001;293:834-838. doi:10.1126/science.1062961</mixed-citation></ref><ref id="scirp.24041-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple"> 
Gregory RI, Chendrimada TP, Cooch N et al. Human RISC couples microRNA biogenesis and posttranscriptional gene silencing, Cell 2005;123:631-640. doi:10.1016/j.cell.2005.10.022</mixed-citation></ref><ref id="scirp.24041-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple"> 
Draper DE. A guide to ions and RNA structure, Rna-a Publication of the Rna Society 2004;10:335-343.</mixed-citation></ref><ref id="scirp.24041-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple"> 
Draper DE. RNA folding: thermodynamic and molecular descriptions of the roles of ions, Biophys J 2008; 95:5489-5495. doi:10.1529/biophysj.108.131813</mixed-citation></ref><ref id="scirp.24041-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple"> 
Mathews DH, Sabina J, Zuker M et al. Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure, J Mol Biol 1999;288:911-940. doi:10.1006/jmbi.1999.2700</mixed-citation></ref><ref id="scirp.24041-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple"> 
Chillon M, Dork T, Casals T et al. A novel donor splice site in intron 11 of the CFTR gene, created by mutation 1811+1.6kbA--&gt;G, produces a new exon: high frequency in Spanish cystic fibrosis chromosomes and association with severe phenotype, Am J Hum Genet 1995;56:623-629.</mixed-citation></ref><ref id="scirp.24041-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple"> 
Montgomery SB, Dermitzakis ET. From expression QTLs to personalized transcriptomics, Nat Rev Genet 2011;12:277-282. doi:10.1038/nrg2969</mixed-citation></ref><ref id="scirp.24041-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple"> 
Quintana-Murci L. Genetics. Gene losses in the human genome, Science 2012;335:806-807. doi:10.1126/science.1219299</mixed-citation></ref><ref id="scirp.24041-ref27"><label>27</label><mixed-citation publication-type="other" xlink:type="simple"> 
Darwin C. On the origin of species by means of natural selection. London,: J. Murray, 1859.</mixed-citation></ref><ref id="scirp.24041-ref28"><label>28</label><mixed-citation publication-type="other" xlink:type="simple"> 
Cairns J. Mutation selection and the natural history of cancer, Nature 1975;255:197-200. doi:10.1038/255197a0</mixed-citation></ref><ref id="scirp.24041-ref29"><label>29</label><mixed-citation publication-type="other" xlink:type="simple"> 
Garzon R, Calin GA, Croce CM. MicroRNAs in Cancer, Annu Rev Med 2009;60:167-179. doi:10.1146/annurev.med.59.053006.104707</mixed-citation></ref></ref-list></back></article>