An obelisk is a microscopic genetic element that consists of a type of infectious agent composed of RNA. Described as a "viroid-like element", they consist of RNA in a circular rod shape without any protein shell coating.
Obelisks were identified in 2024 through computational analysis of vast genetic datasets. Their RNA sequences are entirely novel, and their placement within the tree of life remains uncertain as they do not appear to have a shared ancestry with any other life form, virus, or viroid. Obelisks are currently classified as an enigmatic taxon, forming a distinct phylogenetic group.
Obelisks were first described in a January 2024 preprint, by Stanford University scientists which sifted through genetic data. [1] Currently, only a few methods are available for the identification of these elements from NGS data. [2] The authors of the paper say that "Obelisks form their own distinct phylogenetic group", [3] [4] [1] as their RNA sequences, discovered by computer-aided metatranscriptomics, are not homologous with the genomic sequence of any other life form. [3] With their relationship to other organisms being unknown, they are an example of the incertae sedis , or "enigmatic taxa".
The authors of the January preprint named these sequences "obelisks" due to a predicted rod-like secondary structure: "At 1164 nt [ nucleotides ] in length, the rod-like secondary structure was striking [...]" [3]
Viroids were known to exist in plants and cause pathology, and there had been no evidence that they were in animals or bacteria. [5] This marks the first time a viroid or viroid-like object has been found in bacteria or animals. [1]
Obelisks have been found in human stool samples, and inside specimens of Streptococcus sanguinis , a species of bacteria, taken from human mouths. [6] Some human subjects hosted obelisks for more than 300 days. The initial study showed the presence of obelisks in about 7 percent of the stool samples, and about 50 percent of saliva samples, surveying individuals globally. [3] [7]
The effect of obelisks on human health, if any, is yet to be determined, [4] as are issues such as their life cycles, and what factors their replication depend on. [3]
Features of obelisks include circular RNA genome assemblies with around 1000 base pairs, and rod-like secondary structures that encompass the entire genome. In contrast to viroids, their RNA is translated into proteins, tentatively called "oblins" in the preprint. The two proteins listed there have been named Oblin-1 and Oblin-2. [3]
First structural predictions say that Oblin-1 can bind metal ions and thus could be involved in cellular signalling. Oblin-2 features a binding site which is typical of protein complexes, and might therefore bind to enzymes of its host cell. [4]
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA. The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as regulatory sequences, and often a substantial fraction of junk DNA with no evident function. Almost all eukaryotes have mitochondria and a small mitochondrial genome. Algae and plants also contain chloroplasts with a chloroplast genome.
Viroids are small single-stranded, circular RNAs that are infectious pathogens. Unlike viruses, they have no protein coating. All known viroids are inhabitants of angiosperms, and most cause diseases, whose respective economic importance to humans varies widely. A recent metatranscriptomics study suggests that the host diversity of viroids and other viroid-like elements is broader than previously thought and that it would not be limited to plants, encompassing even the prokaryotes.
Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, and ultimately affect a phenotype. These products are often proteins, but in non-protein-coding genes such as transfer RNA (tRNA) and small nuclear RNA (snRNA), the product is a functional non-coding RNA. The process of gene expression is used by all known life—eukaryotes, prokaryotes, and utilized by viruses—to generate the macromolecular machinery for life.
The human microbiome is the aggregate of all microbiota that reside on or within human tissues and biofluids along with the corresponding anatomical sites in which they reside, including the gastrointestinal tract, skin, mammary glands, seminal fluid, uterus, ovarian follicles, lung, saliva, oral mucosa, conjunctiva, and the biliary tract. Types of human microbiota include bacteria, archaea, fungi, protists, and viruses. Though micro-animals can also live on the human body, they are typically excluded from this definition. In the context of genomics, the term human microbiome is sometimes used to refer to the collective genomes of resident microorganisms; however, the term human metagenome has the same meaning.
Viral evolution is a subfield of evolutionary biology and virology that is specifically concerned with the evolution of viruses. Viruses have short generation times, and many—in particular RNA viruses—have relatively high mutation rates. Although most viral mutations confer no benefit and often even prove deleterious to viruses, the rapid rate of viral mutation combined with natural selection allows viruses to quickly adapt to changes in their host environment. In addition, because viruses typically produce many copies in an infected host, mutated genes can be passed on to many offspring quickly. Although the chance of mutations and evolution can change depending on the type of virus, viruses overall have high chances for mutations.
Lactobacillus acidophilus is a rod-shaped, Gram-positive, homofermentative, anaerobic microbe first isolated from infant feces in the year 1900. The species is commonly found in humans, specifically the gastrointestinal tract and oral cavity as well as some speciality fermented foods such as fermented milk or yogurt, though it is not the most common species for this. The species most readily grows at low pH levels, and has an optimum growth temperature of 37 °C. Certain strains of L. acidophilus show strong probiotic effects, and are commercially used in dairy production. The genome of L. acidophilus has been sequenced.
The Pospiviroidae are a incertae sedis family of ssRNA viroids with 5 genera and 39 species, including the first viroid to be discovered, PSTVd, which is part of genus Pospiviroid. Their secondary structure is key to their biological activity. The classification of this family is based on differences in the conserved central region sequence. Pospiviroidae replication occurs in an asymmetric fashion via host cell RNA polymerase, RNase, and RNA ligase. Its hosts are plants, specifically dicotyledons and some monocotyledons. The severity of the infection can vary from no effect to devastating and widespread damage to a population. This can also depend on the virus-host combination.
Computational genomics refers to the use of computational and statistical analysis to decipher biology from genome sequences and related data, including both DNA and RNA sequence as well as other "post-genomic" data. These, in combination with computational and statistical approaches to understanding the function of the genes and statistical association analysis, this field is also often referred to as Computational and Statistical Genetics/genomics. As such, computational genomics may be regarded as a subset of bioinformatics and computational biology, but with a focus on using whole genomes to understand the principles of how the DNA of a species controls its biology at the molecular level and beyond. With the current abundance of massive biological datasets, computational studies have become one of the most important means to biological discovery.
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protein-coding genes and non-coding genes. During gene expression, DNA is first copied into RNA. RNA can be directly functional or be the intermediate template for the synthesis of a protein.
Non-cellular life, also known as acellular life, is life that exists without a cellular structure for at least part of its life cycle. Historically, most definitions of life postulated that an organism must be composed of one or more cells, but, for some, this is no longer considered necessary, and modern criteria allow for forms of life based on other structural arrangements.
Mobile genetic elements (MGEs), sometimes called selfish genetic elements, are a type of genetic material that can move around within a genome, or that can be transferred from one species or replicon to another. MGEs are found in all organisms. In humans, approximately 50% of the genome are thought to be MGEs. MGEs play a distinct role in evolution. Gene duplication events can also happen through the mechanism of MGEs. MGEs can also cause mutations in protein coding regions, which alters the protein functions. These mechanisms can also rearrange genes in the host genome generating variation. These mechanism can increase fitness by gaining new or additional functions. An example of MGEs in evolutionary context are that virulence factors and antibiotic resistance genes of MGEs can be transported to share genetic code with neighboring bacteria. However, MGEs can also decrease fitness by introducing disease-causing alleles or mutations. The set of MGEs in an organism is called a mobilome, which is composed of a large number of plasmids, transposons and viruses.
16S ribosomal RNA is the RNA component of the 30S subunit of a prokaryotic ribosome. It binds to the Shine-Dalgarno sequence and provides most of the SSU structure.
In biology, a pathogen, in the oldest and broadest sense, is any organism or agent that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ.
Biological dark matter is an informal term for unclassified or poorly understood genetic material. This genetic material may refer to genetic material produced by unclassified microorganisms. By extension, biological dark matter may also refer to the un-isolated microorganisms whose existence can only be inferred from the genetic material that they produce. Some of the genetic material may not fall under the three existing domains of life: Bacteria, Archaea and Eukaryota; thus, it has been suggested that a possible fourth domain of life may yet be discovered, although other explanations are also probable. Alternatively, the genetic material may refer to non-coding DNA and non-coding RNA produced by known organisms.
The human virome is the total collection of viruses in and on the human body. Viruses in the human body may infect both human cells and other microbes such as bacteria. Some viruses cause disease, while others may be asymptomatic. Certain viruses are also integrated into the human genome as proviruses or endogenous viral elements.
In virology, realm is the highest taxonomic rank established for viruses by the International Committee on Taxonomy of Viruses (ICTV), which oversees virus taxonomy. Six virus realms are recognized and united by specific highly conserved traits:
Serine/threonine-protein kinase RIO1 is an enzyme that in humans is encoded by the RIOK1 gene.
The Genome Taxonomy Database (GTDB) is an online database that maintains information on a proposed nomenclature of prokaryotes, following a phylogenomic approach based on a set of conserved single-copy proteins. In addition to resolving paraphyletic groups, this method also reassigns taxonomic ranks algorithmically, updating names in both cases. Information for archaea was added in 2020, along with a species classification based on average nucleotide identity. Each update incorporates new genomes as well as automated and manual curation of the taxonomy.
There are several models of the branching order of bacterial phyla, one of these is the Genome Taxonomy Database (GTDB).
Retrozymes are a family of retrotransposons first discovered in the genomes of plants but now also known in genomes of animals. Retrozymes contain a hammerhead ribozyme (HHR) in their sequences, although they do not possess any coding regions. Retrozymes are nonautonomous retroelements, and so borrow proteins from other elements to move into new regions of a genome. Retrozymes are actively transcribed into covalently closed circular RNAs and are detected in both polarities, which may indicate the use of rolling circle replication in their lifecycle.