Abstract
The emergence and success of web platforms nurtured a trend
within social studies: “Hyperlink is dead!”. Capturing their users
into mobile applications and specialised web interface to propose
them a specific user experience (and business model), the platforms
indeed created new information silos in the open World Wide Web
space. The simplified availability of user behavioural data through
these platforms APIs reinforced this idea in academic communities
by providing scholars with an easy way to collect rich user centric
data for their research. After discussing the methodological aspects
of the web divide between platforms and classical websites, we will
argue that although it becomes more and more invisible, the hyperlink, modern incarnation of intertextual links between documents,
is still a central and structural element of the web. Hyperlinks remain an invaluable resource to turn the web into a research field
in spite of the complexity to collect, manipulate and curate them.
We will illustrate those methodological challenges by describing
the choices we made in designing Hyphe, a tool dedicated to the
creation of web corpora tailored for mining hypertexts.
within social studies: “Hyperlink is dead!”. Capturing their users
into mobile applications and specialised web interface to propose
them a specific user experience (and business model), the platforms
indeed created new information silos in the open World Wide Web
space. The simplified availability of user behavioural data through
these platforms APIs reinforced this idea in academic communities
by providing scholars with an easy way to collect rich user centric
data for their research. After discussing the methodological aspects
of the web divide between platforms and classical websites, we will
argue that although it becomes more and more invisible, the hyperlink, modern incarnation of intertextual links between documents,
is still a central and structural element of the web. Hyperlinks remain an invaluable resource to turn the web into a research field
in spite of the complexity to collect, manipulate and curate them.
We will illustrate those methodological challenges by describing
the choices we made in designing Hyphe, a tool dedicated to the
creation of web corpora tailored for mining hypertexts.
Original language | English |
---|---|
Title of host publication | WS.2 2018 - Proceedings of the 2nd International Conference on Web Studies : Seeing Through the Web |
Editors | Everardo Reyes, Mark Bernstein, Giancarlo Ruffo, Imad Saleh |
Number of pages | 7 |
Volume | 2 |
Publisher | Association for Computing Machinery |
Publication date | 3 Oct 2018 |
Pages | 12-18 |
ISBN (Electronic) | 9781450364386 |
DOIs | |
Publication status | Published - 3 Oct 2018 |
Externally published | Yes |
Event | 2nd International Conference on Web Studies: Seeing Through the Web, WS.2 2018 - Paris, France Duration: 3 Oct 2018 → 5 Oct 2018 |
Conference
Conference | 2nd International Conference on Web Studies: Seeing Through the Web, WS.2 2018 |
---|---|
Country/Territory | France |
City | Paris |
Period | 03/10/2018 → 05/10/2018 |
Sponsor | Université Paris 8 |
Series | ACM International Conference Proceeding Series |
---|
Keywords
- Corpus
- Crawler
- Curation
- Hyperlink
- Hypertext
- Network analysis
- Web mining