WG1: E-Mail Retrieval for Contacting Survey Respondents
Task:
scrape e-mails from department websites so we can send out survey to wide variety of departments all around the world
Method:
R packages {rvest} and {Rselenium} - retrieve all texts and hyperlinks on website that look like e-mails incl. on subpages
Caveat:
might include some non-wanted E-mails (e.g. administration), data still needs more cleaning
Overall Numbers
Succesfully scraped 159,811 e-mails from:
49
Countries
394
Universities
2933
Departments
This represents (of total planned sample):
100%
of Countries
97.77%
of all Universities (403)
81.77%
of all Departments (3587)