The right to read is the right to mine.
Openness inspires innovation, and PLOS is committed to making scientific work easily shared on as many platforms as possible—for human and machine readers alike.
As the volume of published literature grows, Text and Data Mining (TDM) is an increasingly important research methodology that allows insights derived from automated analysis of text and data. To advance science, PLOS encourages all publishers to open their content stores to TDM efforts with minimal barriers or obstacles.
Our approach to TDM is simple: PLOS articles may be mined, reused, and shared by anyone, anywhere, for any purpose.
200,000+ articles and growing.
Technical details: This zip file contains JATS-standard XML content of every PLOS article, including all Articles and Front Matter. It does not include Figures or Supplemental Data. It’s just under five GB in size, and is updated every day with new articles. We also make our articles available through PubMed Central and our API.
PLOS participates in industry efforts to further facilitate TDM research, including the construction of The Hague Declaration on Knowledge Discovery in the Digital Age. This lays out a set of five core principles and a roadmap to enable researchers to mine digital content on the web without legal repercussions.
Add your voice.