BACKGROUND: Studies of the yeast protein interaction network have revealed distinct correlations between the connectivity of individual proteins within the network and the average connectivity of their neighbours. Although a number of biological mechanisms have been proposed to account for these findings, the significance and influence of the specific datasets included in these studies has not been appreciated adequately. RESULTS: We show how the use of different interaction data sets, such as those resulting from high-throughput or small-scale studies, and different modelling methodologies for the derivation pair-wise protein interactions, can dramatically change the topology of these networks. Furthermore, we show that some of the previously reported features identified in these networks may simply be the result of experimental or methodological errors and biases. CONCLUSION: When performing network-based studies, it is essential to define what is meant by the term "interaction" and this must be taken into account when interpreting the topologies of the networks generated. Consideration must be given to the type of data included and appropriate controls that take into account the idiosyncrasies of the data must be selecte

Hakes, Luke

Oliver, Stephen G

Robertson, David L

English

PubMed

Abstract Background Studies of the yeast protein interaction network have revealed distinct correlations between the connectivity of individual proteins within the network and the average connectivity of their neighbours. Although a number of biological mechanisms have been proposed to account for these findings, the significance and influence of the specific datasets included in these studies has not been appreciated adequately. Results We show how the use of different interaction data sets, such as those resulting from high-throughput or small-scale studies, and different modelling methodologies for the derivation pair-wise protein interactions, can dramatically change the topology of these networks. Furthermore, we show that some of the previously reported features identified in these networks may simply be the result of experimental or methodological errors and biases. Conclusion When performing network-based studies, it is essential to define what is meant by the term "interaction" and this must be taken into account when interpreting the topologies of the networks generated. Consideration must be given to the type of data included and appropriate controls that take into account the idiosyncrasies of the data must be selected</p

Robertson David L

Hakes Luke

Oliver Stephen G

Directory of Open Access Journals

BMC Genomics

Effect of dataset selection on the topological interpretation of protein interaction networks

Background: Studies of the yeast protein interaction network have revealed distinct correlations

between the connectivity of individual proteins within the network and the average connectivity of

their neighbours. Although a number of biological mechanisms have been proposed to account for

these findings, the significance and influence of the specific datasets included in these studies has

not been appreciated adequately.

Results: We show how the use of different interaction data sets, such as those resulting from highthroughput

or small-scale studies, and different modelling methodologies for the derivation pairwise

protein interactions, can dramatically change the topology of these networks. Furthermore,

we show that some of the previously reported features identified in these networks may simply be

the result of experimental or methodological errors and biases.

Conclusion: When performing network-based studies, it is essential to define what is meant by

the term "interaction" and this must be taken into account when interpreting the topologies of the

networks generated. Consideration must be given to the type of data included and appropriate

controls that take into account the idiosyncrasies of the data must be selected

Robertson, David L.

Oliver, Stephen G.

Enlighten

Enlighten: Publications

Luke Hakes

David L Robertson

Stephen G Oliver

Springer - Publisher Connector

Background: Studies of the yeast protein interaction network have revealed distinct correlations between the connectivity of individual proteins within the network and the average connectivity of their neighbours. Although a number of biological mechanisms have been proposed to account for these findings, the significance and influence of the specific datasets included in these studies has not been appreciated adequately. Results: We show how the use of different interaction data sets, such as those resulting from high-throughput or small-scale studies, and different modelling methodologies for the derivation pair-wise protein interactions, can dramatically change the topology of these networks. Furthermore, we show that some of the previously reported features identified in these networks may simply be the result of experimental or methodological errors and biases. Conclusions: When performing network-based studies, it is essential to define what is meant by the term "interaction" and this must be taken into account when interpreting the topologies of the networks generated. Consideration must be given to the type of data included and appropriate controls that take into account the idiosyncrasies of the data must be selected © 2005 Hakes et al., licensee BioMed Central Ltd

The University of Manchester - Institutional Repository

file:///data/remote/core/dit/data/Springer-OA/pdf/1ee/aHR0cDovL2xpbmsuc3ByaW5nZXIuY29tLzEwLjExODYvMTQ3MS0yMTY0LTYtMTMxLnBkZg==.pdf

Effect of dataset selection on the topological interpretation of protein interaction networks

Abstract

Similar works

Full text

Available Versions

Directory of Open Access Journals

Enlighten

Enlighten: Publications

Springer - Publisher Connector

The University of Manchester - Institutional Repository

Springer - Publisher Connector