> OpenGenome2 training data
> ... included representative prokaryotic genomes available through GTDB release v214.1, and curated phage and plasmid sequences retrieved through IMG/VR and IMG/PR
> Eukaryotic reference genomes ... were downloaded from NCBI
> Metagenomes [Durrant et al., 2024]
> Eukaryotic organelle genomes ... "NCBI Organelle" web resource
Where is the data from?
23andme?
Looks like publicly-available data.
> OpenGenome2 training data > ... included representative prokaryotic genomes available through GTDB release v214.1, and curated phage and plasmid sequences retrieved through IMG/VR and IMG/PR > Eukaryotic reference genomes ... were downloaded from NCBI > Metagenomes [Durrant et al., 2024] > Eukaryotic organelle genomes ... "NCBI Organelle" web resource