Skip to main content

Table 1 Overview of data files/data sets

From: Crowdsourced dataset to study the generation and impact of text highlighting in classification tasks

Label

Name of data file/data set

File types (file extension)

Data repository and identifier (DOI or accession number)

Crowd highlights

crowdsourced_highlights.csv: the dataset containing highlighted passages provided by workers from Figure Eight

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4

ML highlights

ml_highlights.csv: the dataset containing the highlighted passages produced by automatic techniques

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4

Classification OA crowd highlights

classification_oa-crowd-highlights.csv: first dataset from Experiment 1. OA predicate using crowd-generated highlights

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4

Classification tech crowd highlights

classification_tech-crowd-highlights.csv: second dataset from Experiment 1. Tech predicate using crowd-generated highlights

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4

Classification Amazon crowd highlights

classification_amazon-crowd-highlights.csv: third dataset from Experiment 1. AMZ predicate using crowd-generated highlights

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4

Classification tech 3 × 12 crowd highlights

classification_tech-3 × 12-crowd-highlights.csv: first dataset from Experiment 2. tech predicate using crowd-generated highlights. Layout 3 × 12

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4

Classification tech 6 × 6 crowd highlights

classification_tech-6 × 6-crowd-highlights.csv: second dataset from Experiment 2. tech predicate using crowd-generated highlights. layout 6 × 6

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4

Classification OA ML highlights

classification_oa-ML-highlights.csv: first dataset from Experiment 3. OA predicate using machine-generated highlights

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4

Classification Tech ML highlights

classification_tech-ML-highlights.csv: second dataset from Experiment 3. Tech predicate using machine-generated highlights

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4

Classification Amazon ML highlights

classification_amazon-ML-highlights.csv: third dataset from Experiment 3. AMZ predicate using machine-generated highlights

Comma-separated values (.csv)

https://0-doi-org.brum.beds.ac.uk/10.6084/m9.figshare.9917162.v4