Figure 7. (A) Target T1169, a mosquito protein relevant to pathogen transmission (PDB:8FJP) with four evaluation units defined: D1: 1-345; D2: 1302-2735; D3: 378-699,1223-1301; D4: 700-1222.(B) Parsing of SGS1 into domains as suggested by the authors of the structure 28.(C) Top HHsearch hits showing similarity of the query sequence to known folds in two areas: 395-670 (intermediate domain between the two beta-propellers - see panel B) and 1718-2735 (region after the lectin-CRD domain and up to the TM domain).
3.1.4 | Targets that were split into more EUs than suggested by Grishin plots
Two single-domain targets as suggested by the domain parsers (T1137s2 and T1137s3) were split into two domains for consistency with the other subunits of the same heteromeric complex. Target H1137 (PDB: 8fef) is a hetero 9-mer with six subunits forming an intertwined obligatory complex. The split was made in agreement with the results of template searches and splits of other related subunits.
Another target, T1125, was split into 6 domains instead of 5 suggested by the domain parsers. In this target the C-terminal region penetrates the N-terminal part forming one structural domain, but predictors were unable to model the circular fold of the protein. Thus, for the evaluation, the N-terminal domain (#1) and C-terminal domain (#6) were considered separately.
3.1.5 | Domain swaps
Four targets in CASP15 included domains involved in domain swaps: T1109, T1113, T1120 and T1176. Target T1120 was discussed above (3.1.3). The remaining three targets were un-swapped, and models were evaluated versus both swapped and un-swapped versions of the targets. For T1109 and T1113, models scored higher versus the original (swapped) version, and thus the original targets were used for the final evaluation; for T1176, the evaluation scores were higher for the un-swapped version, and that version was used as the target (T1176-D9: A1-138 + B139-170).