Tuesday, May 1, 2012

2 GB corpus of biomedical open research indexed for text analysis / data mining

As of 01 May 2012 BioMed Central (with Chemistry Central and SpringerOpen) has published 123400 articles of peer-reviewed research, all of which are covered by our open access license agreement which allows free distribution and re-use of the full-text article, including the highly structured XML version.
As a result, BioMed Central's open access corpus is ideally suited for use by text mining researchers.


http://www.biomedcentral.com/about/datamining