Preprint Article Version 2 Preserved in Portico This version is not peer-reviewed

Witnessing Evolution of SARS-CoV-2 through Comparative Phylogenomics: The Proximate Origin is Guangdong, not Wuhan

Version 1 : Received: 19 May 2020 / Approved: 21 May 2020 / Online: 21 May 2020 (03:27:52 CEST)
Version 2 : Received: 17 June 2020 / Approved: 21 June 2020 / Online: 21 June 2020 (16:10:26 CEST)

How to cite: Doğan, Ö.; Korkmaz, E.M.; Budak, M.; Çıplak, B.; Başıbüyük, H.H. Witnessing Evolution of SARS-CoV-2 through Comparative Phylogenomics: The Proximate Origin is Guangdong, not Wuhan. Preprints 2020, 2020050332. https://doi.org/10.20944/preprints202005.0332.v2 Doğan, Ö.; Korkmaz, E.M.; Budak, M.; Çıplak, B.; Başıbüyük, H.H. Witnessing Evolution of SARS-CoV-2 through Comparative Phylogenomics: The Proximate Origin is Guangdong, not Wuhan. Preprints 2020, 2020050332. https://doi.org/10.20944/preprints202005.0332.v2

Abstract

A new form of coronavirus called severe acute respiratory disease coronavirus type 2 (SARS-CoV-2) is currently causing a pandemic. A six-month evolutionary history of SARS-CoV-2 is witnessed by characterising the total genome of 821 samples using comparative phylogenomic approaches. Our analyses produced striking inclusive results that may guide scientists/professionals for the past/future of pandemic. Phylogenetic and time estimation analyses suggest the proximate origin of pandemic strain as Guangdong and the origin time as first half of September 2019, not Wuhan and December 2019, respectively. The viral genome experienced a substitution rate similar to other RNA viruses, but it is particularly high in some of the peptides encoding sequences such as leader protein, E gene, orf8, orf10, nsp10, N gene, S gene and M gene and nsp4, while low in nsp11, orf7a, 3C-like proteinase, nsp9, nsp8 and endoRNase. Most strikingly, the divergence rate of amino acid sequences is high proportional to nucleotide divergence. Additionally, specific non-synonymous mutations in nsp3 and nsp6 evolved under positive selection. The exponential growth rate (r), doubling time (Td) and R0 were estimated to be 47.43 per year, 5.39 days and 2.72, respectively. Comparison of synapomorphies distinguishing the SARS-CoV-2 and the candidate ancestor bat coronavirus indicates that mutation pattern in nsp3 and S gene enabled the new strain to invade human and become a pandemic strain. We arrive at the following main conclusions: (i) six months evolution of viral genome is nearly neutral, (ii) origin of pandemic is not Wuhan and predates formal reports, (iii) although viral population is ongoing an exponential growth, the doubling time is evolving towards shortening, and (iv) divergence rate of total genome is similar to other RNA viruses, but it is prominently high in some genes while low in some others and evolution in these genes should be closely monitored as their protein products intervening to pathogenicity, virulence and immune response.

Keywords

coronavirus; substitution rate; positive selection; demographic dynamics

Subject

Biology and Life Sciences, Biochemistry and Molecular Biology

Comments (1)

Comment 1
Received: 21 June 2020
Commenter: Ertan Mahir Korkmaz
Commenter's Conflict of Interests: Author
Comment: This version of the manuscript is revised as follows:
1. The overall manuscript is revised without changing the findings.
2. The title is changed as "Witnessing evolution of SARS-CoV-2 through comparative phylogenomics: The proximate origin is Guangdong, not Wuhan".
3. Aim paragraph is revised in the Introduction section.
4. The section of Result and Discussion is separated to two sections.
5. Some of figures and tables are changed to improve presentation.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.