WG1: E-Mail Retrieval for Contacting Survey Respondents

Task:

scrape e-mails from department websites so we can send out survey to wide variety of departments all around the world

Method:

R packages {rvest} and {Rselenium} - retrieve all texts and hyperlinks on website that look like e-mails incl. on subpages

Caveat:

might include some non-wanted E-mails (e.g. administration), data still needs more cleaning


Overall Numbers

Succesfully scraped 159,811 e-mails from:

49

Countries

394

Universities

2933

Departments

This represents (of total planned sample):

100%

of Countries

97.77%

of all Universities (403)

81.77%

of all Departments (3587)

Geographic Distribution of Scraped E-Mails

Number of E-Mails per Country

Coverage (Table)