Protein Structure Initiative: Phase 3 or Phase
Out
The production-line approach to finding
protein structures is rapidly filling up databases. But is it the data
researchers want, and is it worth the cost?
|
CREDIT:
|
In the early
1990s when structural biologist Andrzej Joachimiak was working in the labs of Paul Sigler and
Arthur Horwich at
Today,
as head of the Midwest Center for Structural Genomics, a consortium of
investigators at eight institutions in the United States and Canada, Joachimiak and his colleagues churn out some 180 such
structures a year, an average of one every 2 days. Not all are as difficult as
the GroEL structure, but Joachimiak
estimates that recent technological advances would al low them to solve
something as complex as GroEL within about 2 months.
That, Joachimiak says, "is a true
revolution."
But some
in the field say the revolution has gone far enough. Joachimiak's
center is one of four high-throughput structural biology centers participating
in the Protein Structure Initiative (PSI), a big-science project funded by the
U.S. National Institute of General Medical Sciences (NIGMS), part of the
National Institutes of Health (NIH) in
But with
PSI now halfway through its second 5-year phase, critics say the cost of the
program is too high. This year, NIGMS will spend approximately $80 million on
PSI. By the end of phase 2 in July 2010, the total tab will be more than
three-quarters of a billion dollars. At a time when NIH funding is flat, many
critics argue that the money is better spent on traditional small-scale
structural biology projects, ones geared toward solving particular questions
about the detailed working of proteins highly relevant to biology and medicine.
In December, that message was underscored by an external review committee of
prominent biologists charged with assessing PSI. Among the report's
conclusions: "The large PSI structure-determination centers are not
cost-effective in terms of benefit to biomedical research." Structural
biologist Gregory Petsko of
PSI
proponents have plenty of counterarguments, and the debate shows no signs of
waning. "It's a real hot point in the community," says Janet Smith, a
structural biologist at the
A
family affair
Whereas genomics can reveal the sequence of amino acids in a protein,
structural biology tells us how that sequence folds up into a particular shape,
which is key to a protein's function. These structures have long been seen as a
treasure trove of information about life's molecular machines. By revealing
structures through x-ray crystallography and nuclear magnetic resonance
spectroscopy, structural biologists glean insights into how they operate. In
some cases, those insights can discover the likely function of an unknown
protein, lead to a deep understanding of how misshaped proteins cause disease,
and potentially reveal a path to new drug treatments. For example, resolving
the structure of the HIV-1 protease led to the creation of the first protease
inhibitors used to fight AIDS.
Structural
biologists have traditionally taken a hypothesis-driven approach to their
science, asking questions about proteins known to be of interest. PSI, by
contrast, chose a novel and somewhat controversial strategy: a
"discovery-based" approach primarily targeted at proteins from
different structural classes, or "families," throughout the protein
landscape. Members of each family fold up into similar shapes, often adopting
similar functions, such as proteases, kinases, and phosphatases. One major goal of PSI has been to obtain
structures of representatives of as many of these families as possible, in
particular the large families that have the most members. Proponents argue that
each structure could be the key to many more: information on how the sequences
fold into proteins should enable computational biologists to create
"homology models," detailed simulations of closely related family
members for which no physical structure exists, and thereby glean insights into
their function.
Success
was far from certain. When the project started in July 2000, perhaps the
biggest question was whether PSI centers would be able to automate all of the many
steps involved in mapping proteins. Unlike genomics, which relies on speeding
up one technology--reading the sequence of DNA's nucleotide bases--PSI leaders
had to speed up numerous technologies including cloning genes into microbes,
expressing and purifying proteins, coaxing them to form crystals, testing their
quality, collecting x-ray data, and solving the structure. "Early on, we
didn't know whether we were going to be able to build these pipelines,"
says Ian Wilson, a structural biologist at the Scripps Research Institute in
But the recent
review panel concluded that PSI's technology
development had been "highly successful," with advances dramatically
speeding all phases of structure determination. In many cases, the panel's
report concludes, PSI has fostered technology that can be adopted by more
traditional structural biology efforts. The centers not only developed new
technology, they've applied it effectively, too: The large centers now crank
out an average of 135 protein structures each per year.
PSI
proponents argue that this production- line approach has dropped the cost of
solving structures from about $250,000 apiece in 2000 to about $66,000 today.
But PSI's success is not just about the bottom line,
they argue. It's also revealing a diversity in protein
structures never seen before. A 2006 analysis in Science by Steven Brenner and John-Marc Chandonia
of Lawrence Berkeley National Laboratory in
But
researchers are still divided over just how useful all this new information is.
"I had reservations from the outset," says Petsko,
who says he objected because protein structures are only useful when they can
answer specific biochemical questions about the detailed workings of a protein.
The recent PSI assessment report echoes this criticism, calling PSI's strategy of focusing primarily on novelty
"seriously flawed." One problem, Smith and her co-authors argue, is
that the number of new protein families identified by gene-sequencing efforts
worldwide continues to grow more rapidly than the number of protein structures
being produced. A team of researchers reported last March in PLoS Biology, for example, that a random
sequencing of DNA from the world's oceans showed that more than half of all the
protein families they found had never been seen before, suggesting that
researchers are nowhere near completing their survey of the diversity of
protein families. That makes the challenge of obtaining representative
structures from each family "an open-ended problem," say the authors
of the assessment report.
What is
more, the assessment panel concluded that although having a protein structure
can help computer modelers make models of other members of that protein family,
those models almost always have a low resolution and lack detail of the precise
location of all the protein's different amino acid residues. Such detail is key to nailing down the exact biochemical workings of a
protein and often its specific function. "The ability to model structures,
particularly complex ones, is very far from being able to connect most PSI
structures to function," the report states. Even if an accurate model can
be made, using that to discern a protein's function is not a straightforward
task. A structure, Smith says, "is a little bit
of data" that can be used to discern a protein's function. "But it's
not as much as folks had hoped it would be."
On top of
these problems, critics say PSI's data are not
getting picked up by the broader community of biologists. In part, they argue
that's because only a relatively small fraction of this broader community knows
how to use this type of structural information. The bottom line, Smith says, is
that "the number of structures provided [by PSI] is not providing a boon
to biology." By contrast, she adds, when the Human Genome Project began to
release its data, it was instantly seized upon: "There was no need to ask,
'Was this worthwhile?' "
Function
follows form
PSI leaders counter that although it's true that the number of protein families
is growing rapidly, most of the newly discovered families have only a few
members. The majority of proteins are found in a small number of large families
that are the focus of PSI's targeting. Gaetano Montelione, a structural
biologist at
Many
researchers also dispute the claim that many PSI structures lack biological
relevance. A fraction of PSI targets are chosen for biological interest. And
any of its structures' relevance, as with the value of any basic research,
takes time to grow, they argue. "The benefits we will see 2 to 3 years
from now will be very great,"
David
Baker, a computational biologist at the
A question of value
Smith and others say they readily agree that PSI is producing good science, but
they question whether it's worth the cost. "It's
how do you get the most bang for your buck," says Philip Cole, a
pharmacologist who specializes in signal transduction at the Johns Hopkins
School of Medicine in
Still,
Cole and others worry that even if PSI isn't funded for a third phase, there's
no guarantee that money saved will flow to traditional structural biology
groups. That's not how science funding works. "If PSI were to be
discontinued, the money would go back to the general pool within NIGMS,"
Berg says. Structural biology funding, he adds, accounts for about 10% of the
NIGMS budget, with traditional single-investigator grants taking up about 6.3%.
So doing away with PSI would likely increase the share of funding for
individual structural biology grants from about 6.3% to perhaps 6.7%, Berg
says. What is more, structural biologists currently working on PSI would then
be competing for those funds. So the net result could wind up being "a
pretty big negative" for the community, Berg says.
|
(SOURCE):
PSI |
So what's
next? Berg says NIGMS is currently evaluating all of its large-scale projects
to decide which ones to continue. The PSI assessment panel argued against
continuing the project in its present form. "Future effort might be
focused on smaller projects with much higher experimental coupling to
biological function and improving computational methods of analyzing and
predicting protein structure," the report concluded. In his response to
the report, Montelione agreed that connecting more
directly with the priorities of biologists "needs to be a priority"
in designing PSI 3.
Others agree
that perhaps the best solution is to focus more tightly on protein targets with
known biological relevance, such as multiprotein
complexes, proteins that are embedded in cell membranes, and proteins from
disease-causing microbes. "This can evolve," says Joel Sussman, a structural biologist at the Weizmann
Institute of Science in