KEGG Taxonomy Files and Browser
The KEGG database uses the NCBI taxonomy for classification of cellular organisms and viruses. For cellulcar organisms, the three- or four-letter KEGG organism codes are classified somewhat differently in the following Brite hierarchy files.
08601 is manually created to define the order of organism codes with hsa (Homo sapiens) at the top. 08610 is computationally generated using the abbreviated lineage of the NCBI taxonomy keeping the order of organism codes defined in 08601. In addition, 08610 contains taxonomy IDs for GENES Addendum (ag) entries. 08611 is another computationally generated file for the KEGG organisms with fixed levels of taxonomic ranks: phylum, class, order, family, genus and species.
For viruses, the taxonomy IDs of KEGG Viruses (GENOME vtax category and GENES vg category) are classified according to the NCBI taxonomy, which is based on the ICTV taxonomy, with the Baltimore classification added by KEGG. Both of these Brite hierarchy files are computationally generated and the lowest-level taxonomy IDs are linked to GENOME vtax entries. In the 08620 file the taxonomy IDs are shown in the full lineage of NCBI virus taxonomy, while the 08621 file is organized in the fixed levels of taxonomic ranks: realm, kingdom, phylum, class, order, family, genus and species.
KEGG Taxonomy Browser is implemented as the Brite hierarchy viewer for the taxonomy files shown above. The files of 08611 for cellular organisms and 08621 for viruses are used as default. The browser has a zooming capability to adjust the bottom level of the taxonomic tree, for example, family or class in eukaryotes and species or genus in prokaryotes.
For viruses, the taxonomy IDs of KEGG Viruses (GENOME vtax category and GENES vg category) are classified according to the NCBI taxonomy, which is based on the ICTV taxonomy, with the Baltimore classification added by KEGG. Both of these Brite hierarchy files are computationally generated and the lowest-level taxonomy IDs are linked to GENOME vtax entries. In the 08620 file the taxonomy IDs are shown in the full lineage of NCBI virus taxonomy, while the 08621 file is organized in the fixed levels of taxonomic ranks: realm, kingdom, phylum, class, order, family, genus and species.
KEGG Taxonomy Browser is implemented as the Brite hierarchy viewer for the taxonomy files shown above. The files of 08611 for cellular organisms and 08621 for viruses are used as default. The browser has a zooming capability to adjust the bottom level of the taxonomic tree, for example, family or class in eukaryotes and species or genus in prokaryotes.
Taxonomy Mapping
Taxonomy mapping is a method to integrate various biological data, especially for integrating genomic features and organism-level features. The following tool displays the taxonomic distributions of KOs (K numbers) and modules (M numbers) as genomic features, optionally combined with user-defined data such as for phenotypic features using the Join operation of KEGG Mapper.
Virus Taxonomy Mapping
Last updated: April 1, 2024