MSPKmerCounter has been intensively evaluated on several real world DNA sequence datasets. Here we provide the links to the datasets used in our paper.
Budgerigar (bird)
This dataset is the sequence data of Budgerigar (bird) from the Assemblathon website (http://assemblathon.org). You can click here to browse the dataset. Thanks BGI for providing this dataset.
Lake Malawi cichlid (fish)
This dataset is the sequence data of Lake Malawi cichlid (fish), which is also from the Assemblathon website (http://assemblathon.org). You can click here to browse the dataset. Thanks Broad Insitute for providing this dataset.
Red tailed boa constrictor (snake)
This dataset is the sequence data of Red tailed boa constrictor (snake), which is also from the Assemblathon website (http://assemblathon.org) . You can click here to browse the dataset. Thanks Illumina for providing this dataset.
Soybean
This dataset is the sequence data of Soybean, which is from BGI. You can click here to browse the dataset. Thanks BGI for providing this dataset.