Genetic barcode reveals evolutionary route of new coronavirus

  In a recent study, researchers from Drexel University reported a method that can quickly identify and flag mutations of the COVID-19 virus. Their analysis using information in the global genetic information database collected from coronavirus tests showed that there are at least 8 to 14 different virus mutant strains in the United States, some of which are the same as before, while others are from Europe.

  The genetic analysis tool was originally developed as a method to analyze genetic samples to obtain a snapshot of bacterial mixtures. It can find patterns from a large amount of genetic information and identify whether the virus has been genetically altered. They can then use this model to classify viruses with small genetic differences through tags called informative subtype markers (ISM).

  The study was recently published in "PLoS Computational Biology", using the same method to process virus genetic data can quickly detect and classify the subtle genetic variation in SARS-CoV-2. The tool was designed by Zhengzhen Zhao, a researcher at Drexel University, and is publicly available on GitHub.

  Dr. Gail Rosen, a professor at the Drexel School of Engineering, said: “The types of SARS-CoV-2 viruses from Asia and Europe that we saw in our tests are different from those found in the United States. “Identifying mutations allows us to see the virus in different populations. The way it spreads between. It can also show us areas where COVID-19 has been successfully isolated through social distancing. "

  This tool called ISM is particularly useful because it does not need to analyze the complete gene sequence of the virus to identify its mutations. For SARS-CoV-2, this means reducing the virus’s 30,000 base-long genetic code to a 20-base-long tag.

  The ISM tool also identified certain positions in the genetic sequence of the virus, which changed along with the spread of the virus. Researchers found that from early April to late summer, three positions in the SARS-CoV-2 sequence were mutated at the same time. These positions are in different parts of the genetic sequence. Part of it is thought to be related to cell signaling and replication. The other part is related to the formation of spike protein.

  Researchers said that although more research is needed on how these simultaneous mutations affect virus transmission and severity, the subtype tag can be integrated into 11 bases using the changed sites, which can make downstream analysis more effective.

  In addition to helping scientists understand how the virus changes and spreads, this method can also reveal parts of its genetic code that still seem to be resistant to mutations. This discovery helps develop therapies to fight the virus.