Do you work on the project producing this data? Can you help us keep this page up to date? If so, why not update the package information.
Neurocommons text mining pilot (neurocommons_text_mining_pilot)
http://sw.neurocommons.org/2007/text-mining.html
About
The complete dataset is composed of a set of smaller datasets. Each download is in one of two formats: (1) WARC or (2) tar.gz. You can read about the WARC format by following this link to the mailing list. The tar.gz format is a tarred and gzipped file containing triples given in the N-Triples syntax.
Data exposed: extracted from Temis software applied to 7% of Medline records Size of dump and data set: 24 MB Notes: released without contract
Openness
Data is comprised of other datasets - most of which are open.
Resources
| URL | Format | Description | Hash |
|---|---|---|---|
| http://purl.org/hcls/2007/kb-sources/neurocommons-text-mining.tgz |

An 