Beginning in mid-February 2008, the 1997-2007 online version of the Science Watch® newsletter, ESI-Topics.com, and in-cites.com, will all be featured together on the redesigned ScienceWatch.com. All previous content from the three sites will be permanently archived, and remain accessible from any existing bookmarks to the archived pages. No new content will be added to this site. Updates and new content (updated biweekly) are available at ScienceWatch.com now.

Fast Breaking Comments

By Nils Lid Hjort & Gerda Claeskens

ESI Special Topics, August 2005
Citing URL - http://www.esi-topics.com/fbp/2005/august05-Hjort_Claeskens.html

Nils Lid Hjort & Gerda Claeskens answers a few questions about this month's fast breaking paper in the field of Mathematics.


From •>>August 2005  

Field: Mathematics
Article Title: Frequentist model average estimators
Authors: Hjort, NL;Claeskens, G
Journal: J AMER STATIST ASSN
Volume: 98
Page: 879-899
Year: DEC 2003
* Univ Oslo, Dept Math, N-0316 Oslo, Norway.
* Univ Oslo, Dept Math, N-0316 Oslo, Norway.
* Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA.

ST:  Why do you think your paper is highly cited?

Nils Lid HjortGerda Claeskens
“We are the first to give accurate descriptions of distributions and risk functions of estimators- after- model- selection and of estimators averaged across models.”

We are quite pleased to learn that our paper on "frequentist model averaging" is highly cited. Part of the reason is that it hopefully is seen as having broad relevance, dealing with issues that concern not only specialists but actually mainstream statistics, both theoretical and applied. The paper deals with an issue that has been if not ignored, then "left aside" for much of the history of statistical model fitting: How does the selection of a model influence further inference? We are glad that other people are taking up this issue as well, and are now beginning to use more appropriate methods, as opposed to the still-common practice that ignores the model selection step in data analyses, leading to underreporting variability and over-optimism about confidence intervals and testing results, which we quantify in our paper.

ST:  Does it describe a new discovery or a new methodology that's useful to others?

Our paper provides a framework for studying the effects of statistical model selection and more generally of model averaging strategies. We are the first to give accurate descriptions of distributions and risk functions of estimators after model selection and of estimators averaged across models. Our framework effectively handles the aspect of modelling bias—that most models do not quite match the complex reality. The results we reach are rather general in nature, and facilitate comparison of different selection methods. Our research has also led to completely new model selection methods, e.g., the "Focused Information Criterion" (see Claeskens and Hjort, Journal of the American Statistical Association, 98: 900-916, 2003).

ST:  Could you summarize the significance of your paper in layman's terms?

"All models are wrong, but some are useful." - George E.P. Box

Thus there is a need for constructing statistical model selection methods, and some have been around for at least 30 years. Typical data analysis, from simple regressions to highly complex phenomena, therefore take the form of: (1) select a model, (2) give conclusions in terms of confidence intervals, p-values, etc. The trouble has been that stage (2) really ought to take stage (1) into account, but statistical practice has been to proceed with (2) as if the selected model had been chosen in advance. That this is not good enough should be made clear from the fact that when an experiment is run a second time, a data set is generated that might be only slightly different from the first, but easily leads to selecting a different final model. Our approach deals with this uncertainty and gives a precise description of, for example, the distribution of estimators obtained by using a statistically selected model. When using these results, correct conclusions of the data analysis can be stated, without being overly optimistic regarding, e.g., confidence intervals and p-values (as has been the common practice).

ST:  How did you become involved in this research?

We have collaborated on various projects since we first met at the Australian National University in 2000, where we had been invited by Professor Peter Hall, independently of each other. One of our papers was concerned with testing the fit of a given parametric model, where we used model selection criteria in the construction of the test. This led to considerations of "how does a test perform inside a randomly selected model?" Obviously, its distribution is influenced by not knowing in advance which model gets selected. To our surprise no such equivalent theory existed for estimators in general models. In the literature some authors already pointed out the problem several years ago, but no satisfactory solution had been provided. We then decided to take the challenge and tackle this long standing problem.End

Nils Lid Hjort
Professor
Department of Mathematics
University of Oslo
Oslo, Norway

Gerda Claeskens
Associate Professor
OR & Business Statistics
Katholieke Universiteit Leuven
Leuven, Belgium

ESI Special Topics, August 2005
Citing URL - http://www.esi-topics.com/fbp/2005/august05-Hjort_Claeskens.html

•> Search Special Topics
Fast Breaking Papers Menu || All Topics Menu
Fast Breaking Papers Comments Menu
Help || About || Contact

ScienceWatch.com - Tracking Trends and Perfomance in Basic Research
Go to the new ScienceWatch.com

Write to the Webmaster with questions/comments. Terms of Usage.
The Research Services Group of Thomson Scientific |
(c) 2008 The Thomson Corporation.