By Angie Hinrichs
ESI Special Topics,
June 2007
Citing URL - http://www.esi-topics.com/fbp/2007/june07-AngieHinrichs.html
|
Angie Hinrichs
answers a
few questions about this month's fast
breaking paper in the
field of Biology & Biochemistry. The
author has also
sent along images of their work.
From
•>>June 2007
Field:
Biology & Biochemistry
Article Title: The UCSC Genome Browser Database: update 2006
Authors:
Hinrichs, AS;Karolchik, D;Baertsch, R;Barber,
GP;Bejerano, G;Clawson, H;Diekhans, M;Furey, TS;Harte, RA;Hsu,
F;Hillman-Jackson, J;Kuhn, RM;Pedersen, JS;Pohl, A;Raney,
BJ;Rosenbloom, KR;Siepel, A;Smith, KE;Sugnet, CW;Sultan-Qurraie,
A;Thomas, DJ;Trumbower, H;Weber, RJ;Weirauch, M;Zweig,
AS;Haussler, D;Kent, WJ
Journal: NUCL ACID RES
Volume: 34
Issue:
Page: :D590-D598
Year: Sp. Iss. SI JAN 1 2006
* Univ Calif Santa Cruz, Ctr Biomol Sci & Engn, Sch Engn, Santa
Cruz, CA 95064 USA.
* Univ Calif Santa Cruz, Ctr Biomol Sci & Engn, Sch Engn, Santa
Cruz, CA 95064 USA.
* Univ Calif Santa Cruz, Howard Hughes Med Inst, Santa Cruz, CA
95064 USA.
* Duke Univ, Inst Gen Sci & Policy, Durham, NC 27708 USA.
|
 |
| |
July
1,
2007:
This paper has also been named the New Hot Paper in
Biology & Biochemistry for
July 2007. |
|
Why
do you think your paper is highly cited?
|
 |
|
|
“While not all progress is necessarily good,
isn't progress still preferable to stagnation?” |
|
|
It describes a useful resource: a database of genomic
sequences and annotations that now includes 18 vertebrate
and 19 invertebrate genomes, with a layer of web
applications for visualization, querying, and sequence
similarity search. In 2006, our website served up an average
of 12 million pages to 73,600 different IP addresses per
month. We answer many usage questions daily from researchers
around the world.
Does
it describe a new discovery, methodology, or synthesis of
knowledge?
It is a synthesis: a collection of data and analyses from
other sources, combined with some local analyses, presented
in a dynamic and interactive way.
It supports new bioinformatics methodologies by making it
easy to download large amounts of public sequence and
annotations, to narrow the volume of data by complex
querying, and to visualize genomic data.
Could
you summarize the significance of your paper in layman's terms?
We provide a website with data and tools for molecular
biologists and bioinformaticians. There are graphical tools,
search tools, data-mining tools, bulk downloads, and
terabytes of underlying data from humans and many other
species.
Some example types of data are genome sequences, genes,
and similarities between different species’ genomes,
variation between different members of the same species, and
information about when and where particular genes are turned
on or off.
We don't generate the biological sequence data; we
collect it, organize it, run programs to analyze it, and
provide tools that help to understand it. Scientists can
upload their own data to be viewed or queried together with
our database, collect data for research, and draw figures
for presentations or publications.
How did you become involved in this research, and were there
any particular problems encountered along the way?
The Genome Browser and other tools on our website are all
the brainchildren of Jim Kent and David Haussler. My
background is in computing (hardware and software); in 2002,
I started reading up on the exciting field of
bioinformatics, learned about Jim’s work, saw that he was
hiring to build up this resource and managed to convince him
that I could help.
The group has since grown to include many programmers and
quality assurance engineers as well as postdocs and grad
students who contribute annotations and code while using the
Genome Browser to visualize results of their research. It's
a terrific team effort.
The amount of public genomic data has been growing
exponentially, and that has become a challenge—the disk
space and computation requirements are considerable but
manageable, but meanwhile just tracking all of the data sets
in our processing and testing pipeline has been taking more
and more time. We are working on automating as much of our
pipeline as possible.
Are there any social or political implications for your
research?
We take public data, and make it more accessible. Some
might be concerned that certain genomic data, e.g., the
sequence of the
SARS
virus, are inherently dangerous and should not be out in the
open for anyone to see. However, I feel that, on balance, it
is better for knowledge to be available to everyone than
locked up. I believe that openness of information enables
progress in general. While not all progress is necessarily
good, isn’t progress still preferable to stagnation?
Angie Hinrichs
Genome Browser Software Developer
Genome Bioinformatics Group
Center for Biomolecular Science and Engineering
University of California at Santa Cruz
Santa Cruz, CA, USA
|
|
A Closer Look...
|
 |
Below
are images sent in by Angie Hinrichs which correspond with the featured
paper, or current research. |
|
|
Figure 1:
 |
|
Figure 1: This screenshot
shows the display portion of the
Genome Browser tool. At the top is a small
rendering of the currently viewed chromosome and
its bands, with a bright red box indicating the
region drawn in the main display below. The main
display can be thought of as a graph with
genomic position along the horizontal axis, and
parallel "tracks" of data stacked vertically.
The top line shows the genomic position as base
pair offsets from the start of the chromosome
sequence. Below that, successive tracks appear,
each with a label followed by graphical
renderings. Tracks shown include gene
transcripts, expression values, likelihood of
conservation, similarity with other species,
polymorphisms observed in humans, and areas of
known repetitive sequence. This screenshot shows
only 6 of the 100 annotation tracks available
for the March 2006 (NCBI 36.1) assembly of the
human genome. The image itself is highly
interactive: clicking on any part of it leads to
more detail. Controls outside the image (not
shown) support moving around in the genome,
adding or removing tracks from the display,
zooming in (as far as the base level) or out (as
far as the entire chromosome). |
|
|
|
Team
photo from 2005:
 |
|
From left: Angie Hinrichs, Andy
Pohl, Ann Zweig, Jennifer Jackson, Mark Diekhans,
Fan Hsu, Robert Kuhn, Jim Kent, Brian Raney,
David Haussler, Hiram Clawson (back), Heather
Trumbower (front), Jorge Garcia, Galt Barber,
Donna Karolchik, Rachel Harte, Kate Rosenbloom,
Patrick Gavin, Ali Sultan-Qurraie. Not pictured:
Robert Baertsch, Gill Bejerano, Terry Furey,
Jakob Pedersen, Brooke Rhead, Adam Siepel, Kayla
Smith, Chuck Sugnet, Archana Thakkapallayil,
Daryl Thomas, Ryan Weber, Matt Weirauch. |
|
| |
|
ESI Special Topics,
June 2007
Citing URL - http://www.esi-topics.com/fbp/2007/june07-AngieHinrichs.html
|
|
|