dataset   3295

« earlier    

CNN Transcript Collection (2000-2012) : CNN : Free Download & Streaming : Internet Archive
"For over a decade, CNN (Cable News Network) has been providing transcripts of shows, events and newscasts from its broadcasts. The archive has been maintained and the text transcripts have been dependably available at transcripts.cnn.com. This is a just-in-case grab of the years of transcripts for later study and historical research."
data  history  archive  dataset  news  from delicious
16 days ago by ssn
SEED: Software Engineering Evidence Database
"Welcome to SEED: Software Engineering Evidence Database"
programming  reference  research  dataset 
17 days ago by emerose
Google Ngram Viewer
Here is the motherload of datasets. Word frequency in books over last 200 years from the GOOG.
brilliant  to  make  game  word  association  google  data  dataset  from delicious
17 days ago by playfulsystems
Research: Text Segmentation and Classification in Email Messages
Our dataset consists of 11881 annotated lines from almost 400 email messages drawn at random from the Enron email corpus. We use the database dump of the Enron corpus (219Mb) released by Andrew Fiore and Jeff Heer. This version of the corpus has been processed to remove duplicate messages and to normalise sender and recipient names, resulting in just over 250,000 email messages. No attachments are included. Our annotations are made by a single annotator.
email  functional  zones  data  dataset 
23 days ago by davidar
The Miso Project :: Dataset
Dataset is a JavaScript client-side data transformation and management library. Dataset makes managing client-side data easy by handling loading, parsing, sorting, querying & manipulating data from all sorts of sources.
parser  js  dataset  tools  library  visualization  data  javascript  from delicious
24 days ago by adycoles

« earlier    

Copy this bookmark:



description:


tags: