Gene3D - A database of CATH structural domain data projected onto the major sequence repositories. The files made available here are the copyright of Gene3D. Whilst we encourage you to download these files and to use them for research we do not permit the resale of this data and if you wish to display it within your own resource please acknowledge the latest NAR database issue publication. For any help please contact us at gene3d.contact@gmail.com Files ----- Release information: current_release_version.txt gives the current release number and the corresponding CATH release code. HMM files: * gene3d_hmmsearch.tar.gz - contains the HMMs and scripts to run the Gene3d assignmnet pipeline * hmms_no_dc.tar.gz - contains additional HMMs filtered to remove the discontinuous HMMs described in the Gene3D v16 paper. * model_to_family_map.csv.gz -contains HMM model ID to Domain Family mapping DomainAssignments: * all_domains.csv.gz -simple 3 column csv file with all Gene3D domain assignemnts columns are sequence_md5, domain_family, regions * representative_uniprot_genome_assignments.csv.gz - contains domain assignmnets for representative Uniprot Proteomes columns are domain_id, proteome_id,taxon_id, sequence_identifier, domain_family, sequence_regions, independent_evalue, domain_sequence PDB files: * models generated from the funmod pipeline for 10 organisms: v16_models.tar.gz