How Reliable Are University Rankings?

I think most of you probably know the answer to this question already, but now there’s a detailed study on this topic. Here is the abstract of a paper on the arXiv on the subject

University or college rankings have almost become an industry of their own, published by US News \& World Report (USNWR) and similar organizations. Most of the rankings use a similar scheme: Rank universities in decreasing score order, where each score is computed using a set of attributes and their weights; the attributes can be objective or subjective while the weights are always subjective. This scheme is general enough to be applied to ranking objects other than universities. As shown in the related work, these rankings have important implications and also many issues. In this paper, we take a fresh look at this ranking scheme using the public College dataset; we both formally and experimentally show in multiple ways that this ranking scheme is not reliable and cannot be trusted as authoritative because it is too sensitive to weight changes and can easily be gamed. For example, we show how to derive reasonable weights programmatically to move multiple universities in our dataset to the top rank; moreover, this task takes a few seconds for over 600 universities on a personal laptop. Our mathematical formulation, methods, and results are applicable to ranking objects other than universities too. We conclude by making the case that all the data and methods used for rankings should be made open for validation and repeatability.

The italics are mine.

I have written many times about the worthlessness of University league tables (e.g. here).

Among the serious objections I have raised is that the way they are presented is fundamentally unscientific because they do not separate changes in data (assuming these are measurements of something interesting) from changes in methodology (e.g. weightings). There is an obvious and easy way to test for the size of the weighting effect, which is to construct a parallel set of league tables each year, with the current year’s input data but the previous year’s methodology, which would make it easy to isolate changes in methodology from changes in the performance indicators. No scientifically literate person would accept the result of this kind of study unless the systematic effects can be shown to be under control.

Yet purveyors of league table twaddle all refuse to perform this simple exercise. I myself asked the Times Higher to do this a few years ago and they categorically refused, thus proving that they are not at all interested in the reliability of the product they’re peddling.

Snake oil, anyone?

6 Responses to “How Reliable Are University Rankings?”

  1. An obvious demonstration of the shortcomings of these rankings is the way a given college can change positions dramatically from one year to another. Universities are like cities; they just don’t change that fast.

    • telescoper Says:

      Yes, this “churn” is entirely manufactured by adjusting weightings each year.

      An exception is of course when Maynooth shot up in the rankings when I arrived.

      • Phillip Helbig Says:

        There are probably some universities where the rank goes up when someone arrives, and simultaneously goes up at the place he left. 🙂 Not in your case, of course.

      • telescoper Says:

        When I left Nottingham to go Cardiff my former PhD supervisor emailed me to say precisely that. 😐

  2. Phillip Helbig Says:

    Another common error is assuming that one university is better than another because it has a higher rank. Even assuming that the ranking measures something meaningful, it of course has some uncertainty associated with it. The top 3, or to 30, universities might all have the same rank within the associated uncertainties.

    My favourite example of gaming the system: someone was told that he could not publish a paper in ApJ until the beginning of the next year, because someone else had published a paper in Nature. Thus, additional publications would decrease the fraction of publications in Nature, and since this fraction was weighted higher than the total number of publications and the number of publications in Nature, the decision was to reduce scientific output in reputable journals in order to increase the rank.

  3. Phillip Helbig Says:


    The Lyrid meteor shower peaks tonight. The Moon is almost new, and where I am and in many other places the sky is completely cloudless.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: