Scientists in Houston on Wednesday released a study of more than 5,000 genetic sequences of the coronavirus, which reveals the virus’s continual accumulation of mutations, one of which may have made it more contagious.
That mutation is associated with a higher viral load among patients upon initial diagnosis, the researchers found.
The study, which has not been peer-reviewed, was posted Wednesday on the preprint server MedRxiv. It appears to be the largest single aggregation of genetic sequences of the virus in the United States. A larger batch of sequences was published this month by scientists in the United Kingdom, and, like the Houston study, concluded that a mutation that changes the structure of the “spike protein” on the surface of the virus may be driving the outsize spread of that strain.
The new report did not find that these mutations have made the virus deadlier. All viruses accumulate genetic mutations, and most are insignificant, scientists say. Coronaviruses such as SARS-CoV-2, which causes the illness COVID-19, are relatively stable as viruses go, because they have a proofreading mechanism as they replicate.
But every mutation is a roll of the dice, and with transmission so widespread in the United States — which continues to see tens of thousands of new, confirmed infections daily — the virus has had abundant opportunities to change, potentially with troublesome consequences, said study author James Musser of Houston Methodist Hospital.
“We have given this virus a lot of chances,” Musser told The Washington Post. “There is a huge population size out there right now.”
Scientists from Weill Cornell Medicine, the University of Chicago, Argonne National Laboratory and the University of Texas at Austin also contributed to the study.
David Morens, a virologist at the National Institute of Allergy and Infectious Diseases (NIAID), reviewed the new study and said the findings point to the likelihood that the virus, as it has moved through the population, has become more transmissible, and that this “may have implications for our ability to control it.”
Morens noted that this is a single paper, and that “you don’t want to over-interpret what this means.” But the virus, he said, could potentially be responding – through random mutations – to such interventions as mask-wearing and social distancing, Morens said Wednesday.
“Wearing masks, washing our hands, all those things are barriers to transmissibility, or contagion, but as the virus becomes more contagious it statistically is better at getting around those barriers,” said Morens, senior adviser to Anthony Fauci, the director of the NIAID.
This has implications for the formulation of vaccines, he said. As people gain immunity, either through infections or a vaccine, the virus could be under selective pressure to evade the human immune response.
“Although we don’t know yet, it is well within the realm of possibility that this coronavirus, when our population-level immunity gets high enough, this coronavirus will find a way to get around our immunity,” Morens said. “If that happened, we’d be in the same situation as with flu. We’ll have to chase the virus and, as it mutates, we’ll have to tinker with our vaccine.”
At Houston Methodist, whose main hospital is part of the Texas Medical Center in central Houston and which includes hospitals in the area, scientists have been sequencing the 30,000-character genome of the coronavirus since early March, when the virus first appears to have arrived in the metropolitan area of 7 million people. The paper documents 5,085 sequences.
The research shows that the virus moved through Houston neighborhoods in two waves, first striking wealthier and older individuals but then spreading, in the second wave, to younger people and lower-income neighborhoods – affecting many Latino residents.
At the same time, as the virus spread Zip code by Zip code, it compiled mutations, many affecting the spike protein. That structure on the surface of the virus, which resembles a tree decked with curled ribbons, enables the virus to enter cells.
The genetic data shows that the virus arrived in Houston many times, presumably at first by air travel. Notably, 71% of the viruses that arrived initially were characterized by a now scientifically famous mutation, which appears to have originated in China, that scientists increasingly suspect may give the virus a biological advantage in how it spreads. It is called D614G, referring to the substitution of an amino acid called aspartic acid (D) for one called glycine (G) in a region of the genome that encodes for the spike protein.
By the second wave of the outbreak in Houston, the study found that this variant had leaped to 99.9% prevalence – completing its domination of the outbreak. The researchers found that people infected with the strain had higher loads of virus in their upper respiratory tracts, a potential factor in making the strain spread more effectively.
Kristian Andersen, an immunologist at the Scripps Research Institute in California, who was not involved in the new research, downplayed the significance of the new study. He said it “just confirms what has already been described – G increased in frequency over time.” As for the numerous other mutations the study finds, “they just catalogue them, but we don’t know if any of them have any functional relevance.”
Musser said D614G has been increasingly dominant in Houston and other areas because it is better adapted to spreading among humans. He acknowledged that the scientific case is not closed on this matter.
“This isn’t a murder trial,” Musser said. “We’re not looking for beyond a reasonable doubt. This is a civil trial, and clearly, it’s the preponderance of the evidence that I think forces all of us into the same conclusion, which is there’s something biologically different about that strain, that family of strains.”
Recently, the even larger study of the spread of the coronavirus in the United Kingdom, based on about 25,000 genomes, also found evidence that this variant of the virus outdistances its competitors “in a manner consistent with a selective advantage.”
In general, scientists would expect natural selection to favor mutations that help the virus spread more effectively – since that allows it to make more copies of itself – but not necessarily ones that make it more virulent. Killing or incapacitating the host would generally not help the virus spread to more people.
The study found 285 separate mutation sites that actually change a physical building block of the spike protein, which is the most important part of the coronavirus in the sense that it is what allows it to infect and harm humans. Forty-nine of the changes at these sites had not been seen before in other genomes sequenced around the world.
The study characterizes some of the spike protein mutations as “disconcerting.” While the paper does not present strong proof that any additional evolution of the spike protein is occurring, it suggests that these repeated substitutions provide a hint that, as the virus interacts with our bodies and our immune systems, it may be learning new tricks that help it respond to its host.
“I think there’s pretty good evidence that’s consistent with immunologic selection acting on certain regions of the spike protein,” Musser said.
The actual mutations in the virus occur randomly as it makes mistakes trying to copy its genome within our cells. But every new case gives a chance for more mutations to occur, which increases the chance that one of these mutations will be useful to the virus, just as D614G apparently already has been.
The Washington Post’s Sarah Kaplan and Aaron Steckelberg contributed to this report.
Get Boston.com’s e-mail alerts:
Sign up and receive coronavirus news and breaking updates, from our newsroom to your inbox.