FIGURE 1. Network representation for 869 protein sequences of the “PETase core domain” linked by 318,773 edges. The protein sequences depicted here were selected by clustering at a threshold of 90% sequence identity. Edges (links) were selected at a threshold of 55% sequence similarity. Nodes are coloured according to their annotated source organisms, with Actinobacteria in red ⬤, Proteobacteria in blue ⬤, Fungi in cyan ⬤, Bacteroidetes in orange ⬤, other bacteria from the FCB group in yellow ⬤, Planctomycetes in green ⬤, and unknown bacteria coloured in white ○. See Methods section for more details on the network layout. Supplementary figures are available inFigures S2 and S3 .