--%>

CATH-Structural database in bioinformatics

Define the term CATH used in the Structural database of bioinformatics?

E

Expert

Verified

CATH term goes like this:

C-CLASS – It is determined according to the composition of the secondary structure and their packing. The three major classes which are recognised are alpha, beta and alpha-beta. The last class includes both alternating alpha-beta and beta –alpha as well as alpha + beta.  It differs from SCOP in that it incorporates some automation in classifying protein structures.

A fourth class has come into existence which comprises of those proteins which have low secondary structure content. Although CATH unlike SCOP is not fully automated

A–ARCHITECTURE – It classifies according to the overall shape of the domain structures which are determined by the orientations of the secondary structures but that ignores any similarity between them. Currently this classification is done manually. Here reference for literature is also found if they are well known.

T-TOPOLOGY – It almost similar to the architecture only difference is that here structural similarity is also taken into account. The algorithm which is followed by this to do this is SSAP which was developed by taylor and Orengo in 1989 and another one is the CATHEDRAL developed by Harrison and et al.

SSAP score of 70 % and where larger proteins matches with the smaller proteins by 60 % are assigned one fold.

Some highly populated fold groups are found in this category such as beta 2- layer sandwich and alpha-beta-3 –layer sandwich.

Other structure based algorithms used by this database are DETECTIVE, PUU and DOMAK

NOTE: Due to how secondary structures are interconnected, varying topologies can still result in the same overall architecture.

H-HOMOLOGUS SUPERFAMILY - Those protein domains which share a common ancestor are categorized here. Similarities between them are found out by the SSAP by sequence profiling or structural similarity finding. the following criteria has to be followed if they have to be categorized under this platform.

• Sequence identity >= 35%, overlap >= 60% of larger structure equivalent to smaller.
• SSAP score >= 80.0, sequence identity >= 20%, 60% of larger structure equivalent to smaller.
• SSAP score >= 70.0, 60% of larger structure equivalent to smaller, and domains which have related functions, which is informed by the literature and Pfam protein family database.
• Significant () similarity from HMM-sequence searches and HMM-HMM comparisons using SAM, HMMER and PRC

   Related Questions in Biology

  • Q : Explain phrase price rationing means

    Explain phrase "price rationing" means.Price rationing is the procedure by which the market system allocates goods & services to consumers while quantity demanded exceeds quantity supplied.

  • Q : Define the term PH Define the term PH ?

    Define the term PH? What are the units of PH?

  • Q : Write a research paper on meth-

    My project is to write a research paper on meth- amphetamines on the system. This paper must be no less than 1,000 words in length (typed, double- spaced). I am allowed to cite no fewer than three published sources. For further questions, just contact me.     

  • Q : What is the fluid which fills the

    What is the fluid which fills the nucleus termed?

  • Q : Synthetic theory of evolution

    How does synthetic theory of evolution incorporate knowledge from the Genetics and Molecular Biology in the darwinism?

  • Q : Ethical judgments As the law is

    As the law is ambiguous, business managers will often face decisions that will rely on their ethical judgments. Comment on this statement, be sure to use an example to elaborate.

  • Q : Do phosphate and pentose give

    Do phosphate and pentose groups provide heterogeneity or homogeneity to the nucleic acid chains? What regarding the nitrogen-containing groups? Supported by that, which of such groups is expected to directly contribute in the highly diverse and heterogeneous genetic c

  • Q : Efficacy of vaccines State how the

    State how the immune memory leads to efficacy of vaccines and also produces allergies?

  • Q : T cell is stated to be class I

    What does the given sentence signify? T cell is stated to be class I restricted. Answer: This means that they can identify the antigen, that is, related with class I MHC molecules.   

  • Q : Animal and vegetal pole of the

    What are the animal and vegetal pole of the vertebrate egg?