Beginning in mid-February 2008, the 1997-2007 online version of the Science Watch® newsletter, ESI-Topics.com, and in-cites.com, will all be featured together on the redesigned ScienceWatch.com. All previous content from the three sites will be permanently archived, and remain accessible from any existing bookmarks to the archived pages. No new content will be added to this site. Updates and new content (updated biweekly) are available at ScienceWatch.com now.

New Hot Paper Comments

By Jennifer Gardy

ESI Special Topics, July 2006
Citing URL - http://www.esi-topics.com/nhp/2006/july-06-JenniferGardy.html

Jennifer Gardy answers a few questions about this month's new hot paper in the field of Computer Science.


From •>>July 2006

Field: Computer Science
Article Title: PSORTb v.2.0: Expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis
Authors: Gardy, JL;Laird, MR;Chen, F;Rey, S;Walsh, CJ;Ester, M;Brinkman, FSL
Journal: BIOINFORMATICS
Volume: 21
Issue: 5
Page: 617-623
Year: MAR 1 2005
* Simon Fraser Univ, Dept Mol Biol & Biochem, Burnaby, BC V5A 1S6, Canada.
* Simon Fraser Univ, Dept Mol Biol & Biochem, Burnaby, BC V5A 1S6, Canada.
* Simon Fraser Univ, Dept Comp Sci, Burnaby, BC V5A 1S6, Canada.

ST:  Why do you think your paper is highly cited?

This paper reports a more accurate—or sensitive—version of the program PSORTb, the most precise computational method for the prediction of bacterial protein subcellular localization. Such predictions are a key component of genome-based studies of bacteria, and an important step in the prioritization of possible new drug and vaccine targets against bacterial pathogens. Our paper also describes the first version of PSORTb that can make predictions for all bacteria.


“Perhaps most interesting to the biomedical research community is that fact that by using our tool, a researcher can identify surface-exposed proteins encoded in a genome very rapidly – roughly 100 times faster than using high-throughput laboratory methods.”

As bioinformaticians, we are writing for two audiences—the computer science community and the biology community. In the case of this particular paper, the method we report represents not only an important computational advance in the use of machine-learning methods for classification problems, but it also describes a tool that the microbiologist can implement to great effect in their everyday research.

Scientists from both of these fields cite our paper in their work—computer scientists refer to it when discussing approaches to biological classification problems, while microbiologists cite it in papers describing the characterization of particular proteins or even whole genomes. I think this speaks to the success of our interdisciplinary approach to research. By building a team combining the expertise of both computer scientists and biologists, we not only created a final product whose performance exceeded that of any tool we would have developed, but we also created something that has been well-received by a very diverse audience.

ST:  Does it describe a new discovery, methodology, or synthesis of knowledge?

Our paper describes PSORTb v.2.0—an updated method for the prediction of protein subcellular localization in bacteria. In 2003, we released the first version of PSORTb, which implements a multi-component approach to prediction. It uses several different computational techniques to identify amino acid sequence features that are known to influence protein localization—signal peptides, transmembrane secondary structures, or sequence motifs, for example. In PSORTb v.2.0, however, we expanded the program’s predictive capability by introducing a new type of analysis, which combines data mining and support vector machine (SVM).

Previously, methods implementing SVM for biological classification had looked at proteins only in terms of their overall amino acid composition. While this does work, it doesn’t attain very high precision—amino acid composition can be thought of as a "low-resolution" characteristic of a protein, and one ends up with a fairly significant number of false positives. What our group did, however, was to look at proteins in terms of their structural and functional motifs.

By using data mining techniques to extract "frequent subsequences"—amino acid motifs that occur frequently in proteins found in a particular cellular compartment—we were able to use these frequent subsequences as the basis for classification. These motifs can be thought of as a "high-resolution" characteristic, and they result in much higher precision.

ST:  Could you summarize the significance of your paper in layman’s terms?

The software we’re describing in our paper takes a protein sequence—a string of text—as input, and returns a prediction of where in a bacterial cell that protein might be located—on the surface, inside, or in a membrane, for example. Our method is the most precise tool available for doing this, with a false positive rate of only 4%.

Microbiologists can now use this tool to generate highly confident predictions of protein localization—information that has a range of potential applications. Perhaps most interesting to the biomedical research community is the fact that by using our tool, a researcher can identify surface-exposed proteins encoded in a genome quite rapidly—roughly 100 times faster than using high-throughput laboratory methods.

These surface-exposed proteins are key players in the development of new drugs and vaccines. Thus our software is potentially able to speed up the discovery of new therapeutic targets significantly, a process which normally takes years.

ST:  How did you become involved in this research, and were there obstacles along the way?

The Brinkman Lab (Simon Fraser University, BC, Canada), which led development of PSORTb, has always been interested in combining bioinformatics with pathogenomics—the study of pathogenic organisms at the genome level. A secondary avenue of research that stems from Principal Investigator Fiona Brinkman’s earlier work is the analysis of bacterial outer membrane proteins. These proteins not only have a beautiful barrel shape, but they are also of significant medical importance—they are members of the group of surface-exposed proteins that represent drug and vaccine targets mentioned earlier.

The PSORTb project came about in 2001, when Fiona realized that the sole existing method for bacterial protein localization prediction, PSORT I, performed poorly at identifying this group of proteins. After a bit of investigating, she realized that the method had a number of other significant limitations, and so, with the permission of PSORT I developer Kenta Nakai of the University of Tokyo, she set out to develop a new localization prediction method.

I undertook the project as my PhD research in the Brinkman Lab, and over the next few years, we built up a team of collaborators from different fields, different institutions, and different countries. Although I’ve now moved on to a position as a postdoc at a new lab, Fiona and her lab are continuing development of PSORTb, and are also working on a version that makes predictions for yet more organisms.

By recognizing that we could not take on a project of PSORTb’s magnitude on our own and seeking out the best possible collaborators, we managed to avoid obstacles and roadblocks, and the development of PSORTb v.1.0 and 2.0 proceeded quite quickly and smoothly. Perhaps one of the biggest challenges in an interdisciplinary collaboration is effective communication between researchers from very different fields, but this is something we appreciated and we made an early and successful effort to overcome this.

ST:  Are there any social or political implications for your research?

I think the aspect of our work that matters most to the general public is its utility for drug and vaccine target discovery. According to WHO data, infectious diseases are the leading cause of productivity loss and are responsible for roughly a third of annual deaths worldwide.

Antimicrobial resistance is increasing at an alarming rate and researchers haven’t been able to generate new antibiotics as quickly as the bacteria have gained resistance to old ones. Newly emerging diseases are also causing considerable concern—including worries about the potentially significant economic and social impacts that new national or global disease outbreaks could have.

The discovery of new vaccine and drug targets has in the past been an expensive and slow process, but PSORTb has the potential to speed this process up significantly and really make an impact on the drug/vaccine development pipeline.End

Jennifer L. Gardy, PhD
Postdoctoral Fellow, R.E.W. Hancock Lab
Centre for Microbial Diseases and Immunity Research
University of British Columbia
Vancouver, BC, Canada

ESI Special Topics, July 2006
Citing URL - http://www.esi-topics.com/nhp/2006/july-06-JenniferGardy.html

•> Search Special Topics
New Hot Papers Menu || All Topics Menu
New Hot Papers Comments Menu
Help || About || Contact

ScienceWatch.com - Tracking Trends and Perfomance in Basic Research
Go to the new ScienceWatch.com

Write to the Webmaster with questions/comments. Terms of Usage.
The Research Services Group of Thomson Scientific |
(c) 2008 The Thomson Corporation.