Beginning in mid-February 2008, the 1997-2007 online version of the Science Watch® newsletter, ESI-Topics.com, and in-cites.com, will all be featured together on the redesigned ScienceWatch.com. All previous content from the three sites will be permanently archived, and remain accessible from any existing bookmarks to the archived pages. No new content will be added to this site. Updates and new content (updated biweekly) are available at ScienceWatch.com now.

Fast Breaking Comments

By Angie Hinrichs

ESI Special Topics, June 2007
Citing URL - http://www.esi-topics.com/fbp/2007/june07-AngieHinrichs.html

Angie HinrichsAngie Hinrichs answers a few questions about this month's fast breaking paper in the field of Biology & Biochemistry. The author has also sent along images of their work.


From •>>June 2007

Field: Biology & Biochemistry
Article Title: The UCSC Genome Browser Database: update 2006
Authors: Hinrichs, AS;Karolchik, D;Baertsch, R;Barber, GP;Bejerano, G;Clawson, H;Diekhans, M;Furey, TS;Harte, RA;Hsu, F;Hillman-Jackson, J;Kuhn, RM;Pedersen, JS;Pohl, A;Raney, BJ;Rosenbloom, KR;Siepel, A;Smith, KE;Sugnet, CW;Sultan-Qurraie, A;Thomas, DJ;Trumbower, H;Weber, RJ;Weirauch, M;Zweig, AS;Haussler, D;Kent, WJ
Journal: NUCL ACID RES
Volume: 34
Issue:
Page: :D590-D598
Year: Sp. Iss. SI JAN 1 2006
* Univ Calif Santa Cruz, Ctr Biomol Sci & Engn, Sch Engn, Santa Cruz, CA 95064 USA.
* Univ Calif Santa Cruz, Ctr Biomol Sci & Engn, Sch Engn, Santa Cruz, CA 95064 USA.
* Univ Calif Santa Cruz, Howard Hughes Med Inst, Santa Cruz, CA 95064 USA.
* Duke Univ, Inst Gen Sci & Policy, Durham, NC 27708 USA.

  July 1, 2007: This paper has also been named the New Hot Paper in Biology & Biochemistry for July 2007.

ST:  Why do you think your paper is highly cited?


“While not all progress is necessarily good, isn't progress still preferable to stagnation?”


It describes a useful resource: a database of genomic sequences and annotations that now includes 18 vertebrate and 19 invertebrate genomes, with a layer of web applications for visualization, querying, and sequence similarity search. In 2006, our website served up an average of 12 million pages to 73,600 different IP addresses per month. We answer many usage questions daily from researchers around the world.

ST:  Does it describe a new discovery, methodology, or synthesis of knowledge?

It is a synthesis: a collection of data and analyses from other sources, combined with some local analyses, presented in a dynamic and interactive way.

It supports new bioinformatics methodologies by making it easy to download large amounts of public sequence and annotations, to narrow the volume of data by complex querying, and to visualize genomic data.

ST:  Could you summarize the significance of your paper in layman's terms?

We provide a website with data and tools for molecular biologists and bioinformaticians. There are graphical tools, search tools, data-mining tools, bulk downloads, and terabytes of underlying data from humans and many other species.

Some example types of data are genome sequences, genes, and similarities between different species’ genomes, variation between different members of the same species, and information about when and where particular genes are turned on or off.

We don't generate the biological sequence data; we collect it, organize it, run programs to analyze it, and provide tools that help to understand it. Scientists can upload their own data to be viewed or queried together with our database, collect data for research, and draw figures for presentations or publications.

How did you become involved in this research, and were there any particular problems encountered along the way?

The Genome Browser and other tools on our website are all the brainchildren of Jim Kent and David Haussler. My background is in computing (hardware and software); in 2002, I started reading up on the exciting field of bioinformatics, learned about Jim’s work, saw that he was hiring to build up this resource and managed to convince him that I could help.

The group has since grown to include many programmers and quality assurance engineers as well as postdocs and grad students who contribute annotations and code while using the Genome Browser to visualize results of their research. It's a terrific team effort.

The amount of public genomic data has been growing exponentially, and that has become a challenge—the disk space and computation requirements are considerable but manageable, but meanwhile just tracking all of the data sets in our processing and testing pipeline has been taking more and more time. We are working on automating as much of our pipeline as possible.

Are there any social or political implications for your research?

We take public data, and make it more accessible. Some might be concerned that certain genomic data, e.g., the sequence of the SARS virus, are inherently dangerous and should not be out in the open for anyone to see. However, I feel that, on balance, it is better for knowledge to be available to everyone than locked up. I believe that openness of information enables progress in general. While not all progress is necessarily good, isn’t progress still preferable to stagnation?End

Angie Hinrichs
Genome Browser Software Developer
Genome Bioinformatics Group
Center for Biomolecular Science and Engineering
University of California at Santa Cruz
Santa Cruz, CA, USA


A Closer Look...

A closer look... Below are images sent in by Angie Hinrichs which correspond with the featured paper, or current research.

Figure 1:

Figure 1: This screenshot shows the display portion of the Genome Browser tool. At the top is a small rendering of the currently viewed chromosome and its bands, with a bright red box indicating the region drawn in the main display below. The main display can be thought of as a graph with genomic position along the horizontal axis, and parallel "tracks" of data stacked vertically. The top line shows the genomic position as base pair offsets from the start of the chromosome sequence. Below that, successive tracks appear, each with a label followed by graphical renderings. Tracks shown include gene transcripts, expression values, likelihood of conservation, similarity with other species, polymorphisms observed in humans, and areas of known repetitive sequence. This screenshot shows only 6 of the 100 annotation tracks available for the March 2006 (NCBI 36.1) assembly of the human genome. The image itself is highly interactive: clicking on any part of it leads to more detail. Controls outside the image (not shown) support moving around in the genome, adding or removing tracks from the display, zooming in (as far as the base level) or out (as far as the entire chromosome).

Team photo from 2005:

From left: Angie Hinrichs, Andy Pohl, Ann Zweig, Jennifer Jackson, Mark Diekhans, Fan Hsu, Robert Kuhn, Jim Kent, Brian Raney, David Haussler, Hiram Clawson (back), Heather Trumbower (front), Jorge Garcia, Galt Barber, Donna Karolchik, Rachel Harte, Kate Rosenbloom, Patrick Gavin, Ali Sultan-Qurraie. Not pictured: Robert Baertsch, Gill Bejerano, Terry Furey, Jakob Pedersen, Brooke Rhead, Adam Siepel, Kayla Smith, Chuck Sugnet, Archana Thakkapallayil, Daryl Thomas, Ryan Weber, Matt Weirauch.  
   

ESI Special Topics, June 2007
Citing URL - http://www.esi-topics.com/fbp/2007/june07-AngieHinrichs.html

•> Search Special Topics
Fast Breaking Papers Menu || All Topics Menu
Fast Breaking Papers Comments Menu
Help || About || Contact

ScienceWatch.com - Tracking Trends and Perfomance in Basic Research
Go to the new ScienceWatch.com

Write to the Webmaster with questions/comments. Terms of Usage.
The Research Services Group of Thomson Scientific |
(c) 2008 The Thomson Corporation.