Informer
Newsletter of the BCS Information Retrieval Specialist Group
  • Informer Home
  • Issues
    • Autumn 2018
    • Winter 2019
    • Spring 2019
    • Summer 2019
    • Autumn 2019
    • Winter 2020
    • Spring 2020
    • Summer 2020
    • Autumn 2020
    • Winter 2021
    • Spring 2021
    • Summer 2021
    • Autumn 2021
    • Winter 2022
    • Spring 2022
    • Summer 2022
  • Articles by Topic
  • Authors
  • About Informer
Browse: Home / 2013 / October / Celebrating Stephen Robertson’s Retirement

Celebrating Stephen Robertson’s Retirement

By John Tait on 31st October 2013

Stephen Robertson retired from the Microsoft Research Lab. in Cambridge during the summer of 2013 after a long career as one of the most influential, well liked and eminent researchers in Information Retrieval throughout the world.

Stephen Robertson at SIGIR 2013 in Dublin

Stephen began his research career in the late 1960’s when he took an M.Sc. in Information Science at City University, before beginning a Ph.D. at University College London, under the renowned Information Scientist B. C. Brookes. Stephen was always more mathematically sophisticated than most Information Scientists having taken a First in mathematics at Cambridge, and this engagement with mathematics characterized his career throughout.

During this period, whilst also working as a Research Assistant at Aslib, the Association of Special Libraries and Information Bureaux, he produced his first paper, the two part: “The Parametric Description of Retrieval Tests” [1], which can still be read with profit by today’s researchers. In particular it contaisn a careful and timeless analysis of issues to do with the presentation of results of performance curves (like precision/recall) and the advocacy of the use of recall/fallout rather than recall/precision.

Stephen left Aslib to study full time with Brookes, on award of a Royal Society Information Science Fellowship, a signal honor. The only other people to have been awarded this Fellowship have been Karen Sparck Jones and Keith van Rijsbergen!

After obtaining his Ph.D. in 1976, Stephen moved from UCL back to City where he took up Lectureship in the Department of Information Science, a department with which he has had a continuous association for more than forty years, and indeed continues through his holding of Emeritus Professorial title at City, although in a somewhat different organizational context.

In 1998 he took up a position at the then very new Microsoft Cambridge Research Laboratory, whilst continuing part time at City. Being the modest and charming man he is, Stephen always said he was hired as a playmate for Karen Sparck Jones, the wife of the founding Microsoft Laboratory Director, Roger Needham. We are not so sure: we rather suspect prescience on Roger’s part over the emerging importance of search, and the difficulty of hiring his own wife!

In 2008 Stephen took on a third role: returning to University College London as Visiting Professor, a role which he will continue in his retirement.

Stephen Robertson’s research at University College London, City University, London, and Microsoft Research has had a significant and sustained impact on both the theory and practice of search engine technology and information retrieval. His international status and significance for the research and practice in information retrieval was recognized in 2000 when he received the ACM SIGIR Gerard Salton Award. He also received the Tony Kent Strix Award in 1998 from the Chartered Institute of Library and Information Professionals, and many other awards and honors.

Stephen Robertson’s work in the 1970’s on models for probabilistic term weighting [2] and his probabilistic principle for ranking documents [3] had substantial impact on the theory of information retrieval. Robertson’s probabilistic model allows for reasoning about the weighting of search terms, something that the vector space models of that time – that needed so-called tf.idf term weighting – were unable to do. The model also allowed re-weighting of search terms based on user feedback, which was hard to obtain at the time, but now available in abundance as user click-through data of large search engines. His model and term weighting approaches are discussed in all major educational books on information retrieval, and are cited thousands of times in Google Scholar.

Stephen Robertson is one of the main architects, with Stephen Walker, of the experimental search system Okapi, one of the first information retrieval systems that allowed for natural language queries (without boolean operators AND, OR and NOT). Okapi was one of the most successful search systems at the early Text Retrieval Conferences (TREC) that are organized by the US National Institute of Standards and Technology (NIST). The Okapi TREC reports are cited over 2500 times in Google Scholar. At TREC, the Okapi system was not only applied to ad hoc search, but to various other tasks too, including interactive search and information routing and filtering. The TREC adaptive filtering task that was organized by Robertson from 2000 until 2002, researched approaches to filter information from vast streams of documents, a scenario very similar to analysis of Twitter streams today.

With the Okapi team, Robertson aimed at building pragmatic best match term weighting implementations of the probabilistic models that he proposed earlier. One of these approaches, coined BM25 [4], was very successful in research studies, and found its way in many information retrieval systems and products, including open source search systems like Lucene/Solr, Lemur/Indri, Galago, GrapeShot, Xapian, and Terrier. BM25 is used as one of the most important signals in large web search engines, certainly in Microsoft’s Bing, and probably in other web search engines too. BM25 is also used in various other Microsoft products such as Microsoft Sharepoint and Microsoft SQL Server. Robertson and his colleagues at Microsoft developed a version of BM25 for fielded search queries specifically for Microsoft products like Sharepoint in 2004 [5]. Arguably, Robertson’s BM25 is the most influential information retrieval term weighting and ranking algorithm that exists today. Certainly Bruce Croft, of the University of Massachusetts Amherst, probably the leading academic expert on Information Retrieval in the world, is of the view that the performance of BM25 can only be exceeded by using various forms of machine learning or modeling algorithms on the collection, the query set, click streams and so on.

He continues to publish actively, having had an important paper published as recently as SIGIR 2012.

Stephen Robertson has served as a lively and informative keynote speaker and panel member at many conferences, including ACM SIGIR, ECIR (European Conference on Information Retrieval organized amongst others by the BCS), MIREX (International Conference on Music Information Retrieval), IIiX, (Information Interaction in Context), and NTCIR (NII Testbeds and Community for Information access Research). Robertson has given numerous invited talks, for instance at the S.R. Ranganathan Memorial Lectures, and at UCLA. These are invariably well received.

Stephen has not only been an important intellectual and practical contributor to the emergence of modern Information Retrieval. He has also been an important contributor at a personal level to the development of the IR community.

He is charming and helpful, and always has been. He has an ability to point out what is wrong with someone’s thinking in a way which is supportive and encouraging, rather than with any element of point scoring. This is particularly appreciated by his former research students. He deals with all members of the community with patience and equity, whether they are research student at the beginning of their career, or the most senior professor. One of the present authors (Tait) remembers Stephen most firmly, courteously and politely pointing in a program committee meeting that a paper could not be rejected on the basis the ideas had been discussed many years before in a Cambridge Coffee Room, unless one could identify where they were written up. Others would have left the object of the criticism feeling wounded: not Stephen!

Stephen plans to continue to be active in Information Retrieval in his retirement: but “only doing the bits I like”.

I’m sure the whole of the IR community will join the authors in wishing Stephen a long and happy retirement.

[1] S.E. Robertson, The parametric description of retrieval tests. Part 1: the basic parameters; Part 2: overall measures. Journal of Documentation 25, 1-27, 93-107 (1969).
[2] Stephen Robertson and Karen Sparck-Jones, Relevance weighting of search terms. Journal of the American Society of Information Science 27, pp. 129-146, 1976.
[3] Stephen Robertson, The probability ranking principle in information retrieval, Journal of Documentation 29(6), 481-485, 1977.
[4] Stephen Robertson and Stephen Walker. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In Proceedings of the 17th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 232-241, 1994. (900 citations in Google Scholar)
[5] Stepen Robertson, Hugo Zaragoza and Michael Taylor, Simple BM25 extension to multiple weighted fields. In Proceedings of the ACM Conference on Knowledge and Information Management, CIKM, pp 42-49, 2004. (385 citations in Google Scholar)

 

Authors: Djoerd Hiemstra, John Tait, Andrew MacFarlane, Nick Belkin

 

 

About John Tait
John Tait

John Tait obtained a Ph.D. from the University of Cambridge in 1983 for a thesis entitled “Automatic Summarising of English Text”. He subsequently followed a career in industry, mainly working on problems of large scale information retrieval and management, before taking up a post at the University of Sunderland in 1991, where he eventually became Professor of Intelligent Information Systems and Associate Dean of Computing And Technology, as well as leading the University of Sunderland Information Retrieval Group. He now runs his own boutique consultancy, specialising in patent search and other areas of specialised information management. John is Chair of the BCS IRSG, a past Programme Committee chair of the ACM SIGIR conference (2005), an Associate Editor of ACM Transaction on Information Systems, joint Editor of Natural Language Engineering and has published over 100 refereed conference and journal papers. His current research focuses on problems of multi-lingual search and on patent retrieval.

« Previous Next »

Search

Recent comments

  • Tony Russell-Rose on And finally….from the Editor
  • New Informer – Winter 2021 | Information Interaction on ECIR 2022 and ECIR 2023 – locations confirmed
  • New Informer – Winter 2021 | Information Interaction on Strix Lecture 2020 event – 26 November 2020
  • New Informer – Winter 2021 | Information Interaction on Editorial
  • New Informer – Autumn 2020 | Information Interaction on ECIR 2021 – planning for a virtual conference

Categories

  • Autumn 2022
  • News and alerts
  • ECIR2023
  • Summer 2022
  • Spring 2022
  • IRSG management
  • Winter 2022
  • Autumn 2021
  • ECIR 2022
  • Summer 2021
  • ECIR 2021 Conference Supplement
  • Awards
  • Spring 2021
  • Winter 2021
  • Autumn 2020
  • Summer 2020
  • Spring 2020
  • Winter 2020
  • Autumn 2019
  • Summer 2019
  • Spring 2019
  • Winter 2019
  • Autumn 2018
  • Summer 2018
  • Spring 2018
  • Winter 2018
  • Autumn 2017
  • Summer 2017
  • Spring 2017
  • Winter 2017
  • Autumn 2016
  • Summer 2016
  • Spring 2016
  • Winter 2016
  • Autumn 2015
  • Promotion
  • Summer 2015
  • Spring 2015
  • Winter 2015
  • Autumn 2014
  • Summer 2014
  • Spring 2014
  • Winter 2014
  • Autumn 2013
  • Summer 2013
  • Org Overview
  • Spring 2013
  • Winter 2013
  • Conference Review
  • Feature Article
  • Editorial
  • Events
  • Book Review
  • Autumn 2012
  • Summer 2012
  • Spring 2012
  • Winter 2012
  • Uncategorized

Tags

awards BCS Boolean City University clustering conference conferences design ECIR editorial enterprise seach enterprise search events Faceted search facets HCIR information architecture information discovery Information Retrieval information seeking interaction design IR IR practice IRSG log analysis MSR multimedia retrieval navigation recruitment search Search Solutions search strategies sensemaking site search ss12 survey taxonomy text analytics tutorial user experience user study wayfinding web search weka workshop

Authors

  • Agnes Molnar (1)
  • Alberto Purpura (1)
  • Aldo Lipani (1)
  • Alejandra Gonzalez-Beltran (1)
  • Allan Hanbury (1)
  • Amit Kumar Jaiswal (1)
  • Andy Macfarlane (50)
  • Benjamin Kille (1)
  • Benno Stein (1)
  • Birger Larsen (1)
  • Carsten Eickhoff (1)
  • Cathal Gurrin (8)
  • Charlie Hull (2)
  • Chris Madge (1)
  • Thomas Mandl (1)
  • Claudia Hauff (1)
  • Colin Wilkie (1)
  • David Elsweiler (1)
  • David Haynes (1)
  • David Maxwell (1)
  • Deirdre Lungley (1)
  • Dennis Aumiller (2)
  • Djoerd Hiemstra (1)
  • Franco Maria Nardini (1)
  • Frank Hopfgartner (13)
  • Gabriel Tanase (1)
  • Gabriella Kazai (5)
  • Giorgio Maria Di Nunzio (1)
  • Haiming Liu (2)
  • Helen Clegg (1)
  • Helen Lippell (1)
  • Iadh Ounis (1)
  • Ingo Frommholz (9)
  • Joao Magalheis (4)
  • Jochen L. Leidner (3)
  • John Tait (7)
  • Jolanta Pietraszko (1)
  • Jon Chamberlain (4)
  • Jose Alberto Equivel (1)
  • Julie Glanville (1)
  • Kamran Abbasi (1)
  • Katherine Allen (3)
  • Kurt Kragh Sørensen (1)
  • Linda Achilles (1)
  • Luca Soldaini (1)
  • Marc Sloan (2)
  • Marco Palomino (2)
  • Marianne Sweeny (1)
  • Marina Santini (1)
  • Markus Schedl (1)
  • Martin White (144)
  • Mateusz Dubiel (1)
  • Michael Oakes (1)
  • Mike Salampasis (1)
  • Mohammad Aliannejadi (1)
  • Morgan Harvey (1)
  • Nandita Tripathi (1)
  • Natasha Chowdory (1)
  • Norbert Fuhr (1)
  • Olivia Foulds (1)
  • Parth Mehta (2)
  • Paul Cleverley (1)
  • Paul Matthews (2)
  • Pedro Ruas (1)
  • Philipp Mayr (1)
  • Roland Roller (1)
  • Roman Kern (1)
  • Ronan Cummins (2)
  • Sam Marshall (1)
  • Samuel Dodson (1)
  • Selina Meyer (1)
  • Silviu Paun (1)
  • Song Chen (1)
  • Stefan Rueger (1)
  • Stephane Goldstein (1)
  • Stephanie Segura Rodas (1)
  • Steven Zimmerman (8)
  • Thanh Vu (1)
  • Tony Russell-Rose (31)
  • Trung Huynh (1)
  • Tu Bui (1)
  • Tyler Tate (8)
  • Udo Kruschwitz (35)
  • Val Gillet (1)

Copyright © 2023 Informer.

Powered by WordPress and Hybrid.