How I scraped data from Google Scholar


Nature spoke to Alberto Martín-Martín, a PhD student in Spain, about how he and his colleagues scraped data from academic search engine Google Scholar - a platform notoriously difficult to mine. The team spent months collecting data about 2.3 million papers listed on Google Scholar to find out how often the popular service points readers to versions of research papers that are free to read.