What are the baselines for protein fold recognition?

L J McGuffin, K Bryson, D T Jones

Bioinformatics 2001 January

MOTIVATION: What constitutes a baseline level of success for protein fold recognition methods? As fold recognition benchmarks are often presented without any thought to the results that might be expected from a purely random set of predictions, an analysis of fold recognition baselines is long overdue. Given varying amounts of basic information about a protein-ranging from the length of the sequence to a knowledge of its secondary structure-to what extent can the fold be determined by intelligent guesswork? Can simple methods that make use of secondary structure information assign folds more accurately than purely random methods and could these methods be used to construct viable hierarchical classifications? EXPERIMENTS PERFORMED: A number of rapid automatic methods which score similarities between protein domains were devised and tested. These methods ranged from those that incorporated no secondary structure information, such as measuring absolute differences in sequence lengths, to more complex alignments of secondary structure elements. Each method was assessed for accuracy by comparison with the Class Architecture Topology Homology (CATH) classification. Methods were rated against both a random baseline fold assignment method as a lower control and FSSP as an upper control. Similarity trees were constructed in order to evaluate the accuracy of optimum methods at producing a classification of structure.

RESULTS: Using a rigorous comparison of methods with CATH, the random fold assignment method set a lower baseline of 11% true positives allowing for 3% false positives and FSSP set an upper benchmark of 47% true positives at 3% false positives. The optimum secondary structure alignment method used here achieved 27% true positives at 3% false positives. Using a less rigorous Critical Assessment of Structure Prediction (CASP)-like sensitivity measurement the random assignment achieved 6%, FSSP-59% and the optimum secondary structure alignment method-32%. Similarity trees produced by the optimum method illustrate that these methods cannot be used alone to produce a viable protein structural classification system.

CONCLUSIONS: Simple methods that use perfect secondary structure information to assign folds cannot produce an accurate protein taxonomy, however they do provide useful baselines for fold recognition. In terms of a typical CASP assessment our results suggest that approximately 6% of targets with folds in the databases could be assigned correctly by randomly guessing, and as many as 32% could be recognised by trivial secondary structure comparison methods, given knowledge of their correct secondary structures.

Full text links

We have located links that may give you full text access.

Show additional links to paperHide additional links to paper

PubMed

Add to Saved Papers

Get 1-tap access

Related Resources

The 'Ten Commandments' for the 2023 European Society of Cardiology guidelines for the management of endocarditis.Michael A Borger, Victoria DelgadoEuropean Heart Journal 2024 April 18

Challenges in Septic Shock: From New Hemodynamics to Blood Purification Therapies.Fernando Ramasco et al.Journal of Personalized Medicine 2024 Februrary 4

A Guide to the Use of Vasopressors and Inotropes for Patients in Shock.Anaas Moncef Mergoum et al.Journal of Intensive Care Medicine 2024 April 14

Prevention and treatment of ischaemic and haemorrhagic stroke in people with diabetes mellitus: a focus on glucose control and comorbidities.Simona Sacco et al.Diabetologia 2024 April 17

Diagnosis and Management of Cardiac Sarcoidosis: A Scientific Statement From the American Heart Association.Richard K Cheng et al.Circulation 2024 April 19

Eosinophilic Esophagitis: Clinical Pearls for Primary Care Providers and Gastroenterologists.Rohit Goyal, Amrit K Kamboj, Diana L SnyderMayo Clinic Proceedings 2024 April

Essential thrombocythaemia: A contemporary approach with new drugs on the horizon.Francisca Ferrer-Marín et al.British Journal of Haematology 2024 April 9

British Society for Rheumatology guideline on management of adult and juvenile onset Sjögren disease.Elizabeth J Price et al.Rheumatology 2024 April 17

British Society of Gastroenterology guidelines for the management of hepatocellular carcinoma in adults.Abid Suddle et al.Gut 2024 April 17

For the best experience, use the Read mobile app

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app

All material on this website is protected by copyright, Copyright © 1994-2024 by WebMD LLC.
This website also contains material copyrighted by 3rd parties.

By using this service, you agree to our terms of use and privacy policy.

Your Privacy Choices

You can now claim free CME credits for this literature searchClaim now

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app

What are the baselines for protein fold recognition?

Full text links

Related Resources

Trending Papers

For the best experience, use the Read mobile app