A Longitudinal Analysis of Search Engine Index Size

Antal Van den Bosch, Toine Bogers, Maurice De Kunder

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

3 Citationer (Scopus)

Abstrakt

One of the determining factors of the quality of Web search engines is the size of their index. In addition to its influence on search result quality, the size of the indexed Web can also tell us something about which parts of the WWW are directly accessible to the everyday user. We propose a novel method of estimating the size of a Web search engine’s index by extrapolating from document frequencies of words observed in a large static corpus of Web pages. In addition, we provide a unique longitudinal perspective on the size of Google and Bing’s indexes over a nine-year period, from March 2006 until January 2015. We find that index size estimates of these two search engines tend to vary dramatically over time, with Google generally possessing a larger index than Bing. This result raises doubts about the reliability of previous one-off estimates of the size of the indexed Web. We find that much if not all of this variability can be explained by changes in the indexing and ranking infrastructure of Google and Bing. This casts further doubt on whether Web search engines can be used reliably for cross-sectional webometric studies.
OriginalsprogEngelsk
TitelProceedings of ISSI 2015 Istanbul : 15th International Society of Scientometrics and Informetrics Conference
RedaktørerAlbert Ali Salah, Yasar Tonta, Alkim Almila Akdag Salah, Cassidy Sugimoto, Umut Al
Antal sider12
ForlagBoğaziçi University Printhouse
Publikationsdatojun. 2015
Sider71-82
ISBN (Trykt)978-975-518-381-7
StatusUdgivet - jun. 2015
Begivenhed15th International Society of Scientometrics and Informetrics Conference (ISSI 2015) - Istanbul, Tyrkiet
Varighed: 29 jun. 20153 jul. 2015
http://www.issi2015.org/en/

Konference

Konference15th International Society of Scientometrics and Informetrics Conference (ISSI 2015)
LandTyrkiet
ByIstanbul
Periode29/06/201503/07/2015
Internetadresse

Fingeraftryk Dyk ned i forskningsemnerne om 'A Longitudinal Analysis of Search Engine Index Size'. Sammen danner de et unikt fingeraftryk.

  • Aktiviteter

    • 1 Organisering af eller deltagelse i konference

    15th International Society of Scientometrics and Informetrics Conference (ISSI 2015)

    Toine Bogers (Deltager)

    29 jun. 20153 jul. 2015

    Aktivitet: Deltagelse i faglig begivenhedOrganisering af eller deltagelse i konference

    Citationsformater

    Van den Bosch, A., Bogers, T., & De Kunder, M. (2015). A Longitudinal Analysis of Search Engine Index Size. I A. A. Salah, Y. Tonta, A. A. A. Salah, C. Sugimoto, & U. Al (red.), Proceedings of ISSI 2015 Istanbul : 15th International Society of Scientometrics and Informetrics Conference (s. 71-82). Boğaziçi University Printhouse. http://www.issi2015.org/files/downloads/all-papers/0071.pdf