Reference genome
The reference genome was generated using 68x PacBio sequencing reads, Bionano Genomics optical maps, 10X Genomics linked-reads and Arima Hi-C reads. This allowed us to scaffold the assembly to chromosome-level (Rhie et al. 2021) and we successfully assigned 99.3% of the assembled sequence to 25 identified autosomes, two sex chromosomes and the mitochondrial genome, leaving only 95 scaffolds unlocalised. The total length of the primary haplotype assembly was 1.23 Gbp with a contig N50 of 22.0 Mb and a scaffold N50 of 85.5 Mb. It included 96% complete assembled single copy genes according to BUSCO analysis, with only 1.3% fragmented, 0.4% falsely duplicated, and 2.3% missing (n = 8,338 genes). This represents a high-quality assembly, surpassing the aspired VGP contiguity metrics ~20-fold (Rhieet al 2021).