Do text mining / retrieving full text

As mentioned in our Open Access Subset page, the majority of articles in PMC are subject to traditional copyright restrictions, and are not available for downloading in bulk. However, for the Open Access Subset, we provide several ways of programmatically retrieving the full text.

Use E-Fetch to get the full text XML of a PMC article in the OA subset:

https://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=pmc&id=4304705

You can also use the OAI-PMH service to retrieve full text of articles in the OA subset:

https://www.ncbi.nlm.nih.gov/pmc/oai/oai.cgi?verb=GetRecord&identifier=oai:pubmedcentral.nih.gov:4304705&metadataPrefix=pmc

Additionally, all of the full-text source files, including PDF, images, and supplementary material, for the Open Access Subset is available from our FTP site, as described in our FTP Service page.

Support Center

Last updated: Mon, 10 Apr 2017