2.7. Frequency analysis of mutations in SARS-CoV-2 spike glycoprotein corresponding to the mutated mAb epitope residues
Data from the GISAID hCoV-19 spike glycoprotein mutation surveillance dashboard was obtained for all spike protein variations in SARS-CoV-2. A total of 81,79,987 SARS-CoV-2 spike sequences (updated on February 19, 2022, by Raphael Tze Chuen Lee, GISAID) were compared to the reference sequence EPI_ISL_402124 for the annotation of individual mutations (mutation data obtained from GISAID). Among the spike glycoprotein mutations, the residues that are mAb binding sites as well as mutated in VOCs were analysed for naturally occurring mutations. The occurrence reported for each such mutation was divided by the total number of sequences (n=8179987) and converted into percentage. The mutations occurring at these particular residues with a frequency greater than 0.01% were selected for plotting. A stacked plot was generated with series in ascending order of frequency percentage values. Y-axis represents the frequency in logarithmic scale while X-axis represents the mutated mAb binding sites in the spike glycoprotein.