CoffeeLint - Lint your CoffeeScript
24 days ago by rybesh
CoffeeLint is a style checker that helps keep CoffeeScript code clean and consistent. CoffeeScript does a great job at insulating programmers from many of JavaScript's bad parts, but it won't help enforce a consistent style across a code base. CoffeeLint can help with that.
coffeescript
tools
24 days ago by rybesh
Domeo - The Annotation Toolkit
28 days ago by rybesh
Domeo is an extensible web application enabling users to visually and efficiently create and share ontology-based stand-off annotation on HTML or XML document targets. The tool supports manual, fully automated, and semi-automated annotation with complete provenance records, as well as personal or community annotation with access authorization and control.
science
annotation
tools
scholarlycommunication
28 days ago by rybesh
ImageOptim — make websites and apps load faster (Mac app)
4 weeks ago by rybesh
ImageOptim optimizes images — so they take up less disk space and load faster — by finding best compression parameters and by removing unnecessary comments and color profiles. It handles PNG, JPEG and GIF animations.
web
image
tools
4 weeks ago by rybesh
Modern Web Development
5 weeks ago by rybesh
Great deep dive into the WebKit developer tools: #webinf #web #development #tools #html #css #javascript
css
web
html
tools
development
webinf
javascript
from twitter
5 weeks ago by rybesh
dotjs — hack the web
6 weeks ago by rybesh
dotjs is a Google Chrome extension that executes JavaScript files in ~/.js based on their filename.
jquery
interface
tools
6 weeks ago by rybesh
SaltwaterC/http-get
7 weeks ago by rybesh
Simple to use node.js HTTP / HTTPS client for fetching remote resources. Supports transparent gzip decoding via gzbz2.
The client sends GET requests for fetching the remote objects. You may send HEAD requests if you just need to check the availability of a remote resource. The error reporting is implemented with care. The module itself is used in production for background data processing of thousands of remote resources, therefore it is not your average HTTP / HTTPS node.js client. It is in use for both of the transfer modes: buffered responses or streamed to the disk responses. Most of the decisions that made their way into the http-get are based onto the experience of working with a large URL database where a lot of things can go wrong.
nodejs
http
tools
The client sends GET requests for fetching the remote objects. You may send HEAD requests if you just need to check the availability of a remote resource. The error reporting is implemented with care. The module itself is used in production for background data processing of thousands of remote resources, therefore it is not your average HTTP / HTTPS node.js client. It is in use for both of the transfer modes: buffered responses or streamed to the disk responses. Most of the decisions that made their way into the http-get are based onto the experience of working with a large URL database where a lot of things can go wrong.
7 weeks ago by rybesh
DITA For Publishers | Free software downloads at SourceForge.net
8 weeks ago by rybesh
Provides general-purpose DITA map, topic, and domain specializations for the Publishing industry, as well as supporting processors (e.g., DITA Open Toolkit plugins).
publishing
xml
tools
8 weeks ago by rybesh
Apache Stanbol - Welcome to Apache Stanbol (incubating)
8 weeks ago by rybesh
Apache Stanbol (currently in incubation) is an open source modular software stack and reusable set of components for semantic content management.
Apache Stanbol components are meant to be accessed over RESTful interfaces to provide semantic services for content management. Thus, one application is to extend traditional content management systems with (internal or external) semantic services.
nlp
semweb
CMS
tools
editorsnotes
Apache Stanbol components are meant to be accessed over RESTful interfaces to provide semantic services for content management. Thus, one application is to extend traditional content management systems with (internal or external) semantic services.
8 weeks ago by rybesh
RDFa 1.1 Distiller and Parser
9 weeks ago by rybesh
This version accompanies the developement of RDFa 1.1 Core. As that document is not final yet, this service, and the underlying code, will change frequently until the development of RDFa 1.1 is finalized. The implementation may actually run ahead of the “official” version and implement the version in the editors’ draft already… Also, the package available for download may be out of sync with the code running this service.
rdfa
tools
webinfo
9 weeks ago by rybesh
timjurka/RTextTools
11 weeks ago by rybesh
RTextTools is a free, open source machine learning package for automatic text classification that makes it simple for both novice and advanced users to get started with supervised learning. The package includes nine algorithms for ensemble classification (svm, slda, boosting, bagging, random forests, glmnet, decision trees, neural networks, maximum entropy), comprehensive analytics, and thorough documentation.
textanalysis
classification
tools
research
11 weeks ago by rybesh
Octopress
11 weeks ago by rybesh
Octopress is a framework designed by Brandon Mathis for Jekyll, the blog aware static site generator powering Github Pages. To start blogging with Jekyll, you have to write your own HTML templates, CSS, Javascripts and set up your configuration. But with Octopress All of that is already taken care of. Simply clone or fork Octopress, install dependencies and the theme, and you’re set.
blog
tools
11 weeks ago by rybesh
An HTML Notebook IPython — IPython 0.13.dev documentation
11 weeks ago by rybesh
The IPython Notebook consists of two related components:
An JSON based Notebook document format for recording and distributing Python code and rich text.
A web-based user interface for authoring and running notebook documents.
python
programming
teaching
tools
An JSON based Notebook document format for recording and distributing Python code and rich text.
A web-based user interface for authoring and running notebook documents.
11 weeks ago by rybesh
alimanfoo/petl
12 weeks ago by rybesh
petl is a tentative Python module for extracting, transforming and loading tables of data.
python
data
tools
12 weeks ago by rybesh
ivarch.com: Pipe Viewer
12 weeks ago by rybesh
pv - Pipe Viewer - is a terminal-based tool for monitoring the progress of data through a pipeline. It can be inserted into any normal pipeline between two processes to give a visual indication of how quickly data is passing through, how long it has taken, how near to completion it is, and an estimate of how long it will be until completion.
unix
tools
cli
dataflow
dataprocessing
12 weeks ago by rybesh
Piazza – Ask. Answer. Explore. Whenever.
march 2012 by rybesh
Welcome to Piazza—a place where students can come together to ask, answer, and explore under the guidance of their instructor. It'll save you time, and your students will love using it.
course
discussion
education
teaching
tools
march 2012 by rybesh
Natural Language Software Registry
february 2012 by rybesh
The Natural Language Software Registry (NLSR) is a concise summary of the capabilities and sources of a large amount of natural language processing (NLP) software available to the NLP community. It comprises academic, commercial and proprietary software with specifications and terms on which it can be acquired clearly indicated.
nlp
linguistics
tools
february 2012 by rybesh
any23 - Anything to Triples - Google Project Hosting
february 2012 by rybesh
Anything To Triples (Any23) is a library, a Web service and a set of command line tools for extracting structured data in RDF format from a variety of Web documents.
rdf
semweb
tools
scraping
february 2012 by rybesh
Automatic text analytics using DBpedia and PoolParty – A Live Demo |The Semantic Puzzle
february 2012 by rybesh
Let me show you which steps have to be taken to generate a high-quality text mining application, ready to be used to annotate and to categorize any kind of text or documents covering nearly any domain. With our approach of thesaurus based text mining your documents can also be linked to the world of linked (open) data; enrich your documents with data from the LOD cloud!
webinfo
inls520
semweb
textanalysis
classification
skos
tools
february 2012 by rybesh
Olivier Labs | Jason
february 2012 by rybesh
Jason is a JSON viewer & editor for Mac OS X. It can open local documents as well as download JSON data via HTTP and, in case of invalid data, an error message is presented and the line containing the error is highlighted.
json
tools
february 2012 by rybesh
Blacksmith
january 2012 by rybesh
A static site generator built with Node.js, JSDOM, and Weld.
nodejs
web
tools
blog
january 2012 by rybesh
Mr. Data Converter
january 2012 by rybesh
I will convert your Excel data into one of several web-friendly formats, including HTML, JSON and XML.
data
json
xml
tools
january 2012 by rybesh
mattweber/elasticsearch-mocksolrplugin - GitHub
december 2011 by rybesh
This plugin will allow you to use tools that were built to interact with Solr with ElasticSearch.
solr
search
tools
december 2011 by rybesh
REST service :: CollateX
november 2011 by rybesh
This is the REST service of CollateX. To call it, you can post witness data as specified below and get the collation result back in a number of formats.
collation
editing
digitalhumanities
tools
webservices
november 2011 by rybesh
Britt, Perfetti, Van Dyke, & Gabrys - The Sourcer's Apprentice
november 2011 by rybesh
A tool for document-supported history instruction.
history
education
discourse
tools
november 2011 by rybesh
Open Source - DocumentCloud
november 2011 by rybesh
As we work on DocumentCloud, we're constantly building pieces of infrastructure that could be useful for other organizations that work with similar kinds of data. We're releasing as we go by extracting useful components as standalone open source projects. Please follow our work if you're interested in what lies under the hood.
documentation
annotation
research
journalism
history
tools
november 2011 by rybesh
script-cover - Javascript code coverage detector for web pages in Chrome - Google Project Hosting
october 2011 by rybesh
SCRIPTCOVER is a Chrome extension for Javascript coverage analysis.
It reports which Javascript statements/instructions from internal and external scripts have been executed (and how many times) while a web page loads and then when a user interacts with the page without any user modifications to the code.
javascript
performance
tools
testing
It reports which Javascript statements/instructions from internal and external scripts have been executed (and how many times) while a web page loads and then when a user interacts with the page without any user modifications to the code.
october 2011 by rybesh
Grep the Web
september 2011 by rybesh
Submit a series of strings or patterns and we will show you the urls on which they appear (in rank order).
search
tools
september 2011 by rybesh
Getting Started with Roo
september 2011 by rybesh
RAD tool for Java.
java
development
tools
spring
cloud
september 2011 by rybesh
Closure Tools - Google Code
july 2011 by rybesh
The Closure tools help developers to build rich web applications with JavaScript that is both powerful and efficient.
google
javascript
code
library
tools
july 2011 by rybesh
AKSW : Projects / FOX
july 2011 by rybesh
FOX is a framework that integrates the Linked Data Cloud and makes uses of the diversity of NLP algorithms to extract RDF triples of high accuracy out of NL. In its current version, it integrates and merges the results of Named Entity Recognition, Keyword Extraction and Relation Extraction tools.
semweb
extraction
nlp
tools
ner
july 2011 by rybesh
Scientific workflow management and the Kepler system - Ludäscher - 2005 - Concurrency and Computation: Practice and Experience - Wiley Online Library
june 2011 by rybesh
Many scientific disciplines are now data and information driven, and new scientific knowledge is often gained by scientists putting together data analysis and knowledge discovery ‘pipelines’. A related trend is that more and more scientific communities realize the benefits of sharing their data and computational services, and are thus contributing to a distributed data and computational community infrastructure (a.k.a. ‘the Grid’). However, this infrastructure is only a means to an end and ideally scientists should not be too concerned with its existence. The goal is for scientists to focus on development and use of what we call scientific workflows. These are networks of analytical steps that may involve, e.g., database access and querying steps, data analysis and mining steps, and many other steps including computationally intensive jobs on high-performance cluster computers. In this paper we describe characteristics of and requirements for scientific workflows as identified in a number of our application projects. We then elaborate on Kepler, a particular scientific workflow system, currently under development across a number of scientific data management projects. We describe some key features of Kepler and its underlying Ptolemy II system, planned extensions, and areas of future research. Kepler is a community-driven, open source project, and we always welcome related projects and new contributors to join.
science
dataflow
workflow
tools
june 2011 by rybesh
IEEE Xplore - Scientific workflow systems - can one size fit all?
june 2011 by rybesh
The past decade has witnessed a growing trend in designing and using workflow systems with a focus on supporting the scientific research process in bioinformatics and other areas of life sciences. The aim of these systems is mainly to simplify access, control and orchestration of remote distributed scientific data sets using remote computational resources, such as EBI web services. In this paper we present the state of the art in the field by reviewing six such systems: Discovery Net, Taverna, Triana, Kepler, Yawl and BPEL. We provide a high-level framework for comparing the systems based on their control flow and data flow properties with a view of both informing future research in the area by academic researchers and facilitating the selection of the most appropriate system for a specific application task by practitioners.
science
dataflow
workflow
tools
june 2011 by rybesh
A taxonomy of scientific workflow systems for grid computing
june 2011 by rybesh
With the advent of Grid and application technologies, scientists and engineers are building more and more complex applications to manage and process large data sets, and execute scientific experiments on distributed resources. Such application scenarios require means for composing and executing complex workflows. Therefore, many efforts have been made towards the development of workflow management systems for Grid computing. In this paper, we propose a taxonomy that characterizes and classifies various approaches for building and executing workflows on Grids. The taxonomy not only highlights the design and engineering similarities and differences of state-of-the-art in Grid workflow systems, but also identifies the areas that need further research.
science
dataflow
workflow
tools
june 2011 by rybesh
Scientific workflow system - Wikipedia, the free encyclopedia
june 2011 by rybesh
A Scientific Workflow Systems is a specialized form of a workflow management system designed specifically to compose and execute a series of computational or data manipulation steps, or a workflow, in a scientific application.
science
workflow
dataflow
tools
june 2011 by rybesh
Bioinformatics workflow management systems - Wikipedia, the free encyclopedia
june 2011 by rybesh
A bioinformatics workflow management system is a specialized form of workflow management system designed specifically to compose and execute a series of computational or data manipulation steps, or a workflow, in a specific domain of science, bioinformatics.
science
dataflow
workflow
tools
june 2011 by rybesh
Workflow software survey
june 2011 by rybesh
A survey of existing (Python) libraries, frameworks, and tools for managing semi-automated workflows and pipelines of tasks. There are a number of loose, overlapping terms for these kinds of systems: BPM (business process management), workflow, pipeline, flowchart, state machine, visual programming, dataflow, flow-based programming, dependency management, task management, etc.
dataflow
workflow
pipeline
tools
python
june 2011 by rybesh
LinkedDataSail - GitHub
june 2011 by rybesh
LinkedDataSail gathers RDF data incrementally, dereferencing URIs in response to queries.
semweb
linkeddata
tools
java
june 2011 by rybesh
maui-indexer - Maui - Multi-purpose automatic topic indexing - Google Project Hosting
june 2011 by rybesh
Maui automatically identifies main topics in text documents. Depending on the task, topics are tags, keywords, keyphrases, vocabulary terms, descriptors, index terms or titles of Wikipedia articles.
Maui performs the following tasks:
term assignment with a controlled vocabulary (or thesaurus)
subject indexing
topic indexing with terms from Wikipedia
keyphrase extraction
terminology extraction
automatic tagging
It can also be used for terminology extraction and semi-automatic topic indexing.
indexing
vocabulary
tools
nlp
machinelearning
java
Maui performs the following tasks:
term assignment with a controlled vocabulary (or thesaurus)
subject indexing
topic indexing with terms from Wikipedia
keyphrase extraction
terminology extraction
automatic tagging
It can also be used for terminology extraction and semi-automatic topic indexing.
june 2011 by rybesh
Wikipedia Miner - Home
may 2011 by rybesh
Wikipedia Miner is a toolkit for navigating and making use of the structure and content of Wikipedia. It aims to make it easy for you to integrate Wikipedia's knowledge into your own applications, by:
providing simplified, object-oriented access to Wikipedia's structure and content.
measuring how terms and concepts in Wikipedia are connected to each other.
detecting and disambiguating Wikipedia topics when they are mentioned in documents.
wikipedia
textmining
nlp
webservices
tools
datamining
providing simplified, object-oriented access to Wikipedia's structure and content.
measuring how terms and concepts in Wikipedia are connected to each other.
detecting and disambiguating Wikipedia topics when they are mentioned in documents.
may 2011 by rybesh
CRAN - Package SPARQL
may 2011 by rybesh
Load SPARQL result table from an end-point as a data.frame
sparql
R
tools
statistics
visualization
RDF
may 2011 by rybesh
PDFMiner
may 2011 by rybesh
PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows one to obtain the exact location of text in a page, as well as other information such as fonts or lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes than text analysis.
pdf
python
tools
may 2011 by rybesh
Wolfram|Alpha Blog : Must-Have Genealogy Tools from Wolfram|Alpha
april 2011 by rybesh
Wolfram|Alpha is a powerful tool for finding information about the universe at large, but sometimes we are interested in a much smaller universe: our families. Genealogical research is an increasingly popular hobby, and one which Wolfram|Alpha can make easier using features across several of its subject areas.
genealogy
tools
digitalhumanities
april 2011 by rybesh
Languages - Accentuate.us - Really Easy Computer Input
april 2011 by rybesh
Accentuate.us uses statistics to predict where special characters are needed on a language-by-language basis.
language
input
python
tools
webservices
api
machinelearning
april 2011 by rybesh
OData Validator
april 2011 by rybesh
The goal of this tool is to enable OData service authors to validate their implementation against the OData specification to ensure the service interoperates well with any OData client.
odata
tools
validation
testing
april 2011 by rybesh
main.py
april 2011 by rybesh
Pycco is a Python port of Docco: the original quick-and-dirty, hundred-line-long, literate-programming-style documentation generator. It produces HTML that displays your comments alongside your code. Comments are passed through Markdown, and code is passed through Pygments syntax highlighting.
python
documentation
tools
april 2011 by rybesh
txt2re: headache relief for programmers :: regular expression generator
april 2011 by rybesh
This is incredibly awesome.
regex
tools
april 2011 by rybesh
Analyzing Social Media Networks: Learning by Doing with NodeXL
april 2011 by rybesh
The NodeXL Template for Microsoft Excel 2007 is a free and open source extension to the widely used spreadsheet application that provides a range of basic network analysis and visualization features. NodeXL uses a highly structured workbook template that includes multiple worksheets to store all the information needed to represent a network graph. Network relationships (i.e., graph edges) are represented as an “edge list”, which contains all pairs of vertices that are connected in the network. Other worksheets contain information about each vertex (i.e., node) and cluster. Visualization features allow users to display a range of network graph representations and map data attributes to visual properties including shape, color, size, transparency, and location.
social
networks
analysis
tools
april 2011 by rybesh
Underscore.js
april 2011 by rybesh
Underscore is a utility-belt library for JavaScript that provides a lot of the functional programming support that you would expect in Prototype.js (or Ruby), but without extending any of the built-in JavaScript objects. It's the tie to go along with jQuery's tux.
javascript
tools
functional
april 2011 by rybesh
TiMBL: Tilburg Memory-Based Learner
april 2011 by rybesh
TiMBL is an open source software package implementing several memory-based learning algorithms, among which IB1-IG, an implementation of k-nearest neighbor classification with feature weighting suitable for symbolic feature spaces, and IGTree, a decision-tree approximation of IB1-IG. All implemented algorithms have in common that they store some representation of the training set explicitly in memory. During testing, new cases are classified by extrapolation from the most similar stored cases.
For the past decade, TiMBL has been mostly used in natural language processing as a machine learning classifier component, but its use extends to virtually any supervised machine learning domain. Due to its particular decision-tree-based implementation, TiMBL is in many cases far more efficient in classification than a standard k-nearest neighbor algorithm would be.
nlp
machinelearning
tools
For the past decade, TiMBL has been mostly used in natural language processing as a machine learning classifier component, but its use extends to virtually any supervised machine learning domain. Due to its particular decision-tree-based implementation, TiMBL is in many cases far more efficient in classification than a standard k-nearest neighbor algorithm would be.
april 2011 by rybesh
Reverted Indexing
march 2011 by rybesh
Traditional interactive information retrieval systems function by creating inverted lists, or term indexes. For every term in the vocabulary, a list is created that contains the documents in which that term occurs and its frequency within each document. Retrieval algorithms then use these term frequencies alongside other collection statistics to identify matching documents for a query.
Term-based search, however, is just one example of interactive information seeking. Other examples include offering suggestions of documents similar to ones already found, or identifying effective query expansion terms that the user might wish to use. More generally, these fall into several categories: query term suggestion, relevance feedback, and pseudo-relevance feedback.
We can combine the inverted index with the notion of retrievability to create an efficient query expansion algorithm that is useful for a number of applications, such as query expansion and relevance (and pseudo-relevance) feedback. We call this kind of index a reverted index because rather than mapping terms onto documents, it maps document ids onto queries that retrieved the associated documents.
IR
tools
search
lucene
Term-based search, however, is just one example of interactive information seeking. Other examples include offering suggestions of documents similar to ones already found, or identifying effective query expansion terms that the user might wish to use. More generally, these fall into several categories: query term suggestion, relevance feedback, and pseudo-relevance feedback.
We can combine the inverted index with the notion of retrievability to create an efficient query expansion algorithm that is useful for a number of applications, such as query expansion and relevance (and pseudo-relevance) feedback. We call this kind of index a reverted index because rather than mapping terms onto documents, it maps document ids onto queries that retrieved the associated documents.
march 2011 by rybesh
Data Science Toolkit
march 2011 by rybesh
A collection of the best open data sets and open-source tools for data science, wrapped in an easy-to-use REST/JSON API with command line, Python and Javascript interfaces. Available as a self-contained VM or EC2 AMI that you can deploy yourself.
data
tools
nlp
ec2
webservices
march 2011 by rybesh
Green Unicorn - Welcome
march 2011 by rybesh
Gunicorn 'Green Unicorn' is a Python WSGI HTTP Server for UNIX. It's a pre-fork worker model ported from Ruby's Unicorn project. The Gunicorn server is broadly compatible with various web frameworks, simply implemented, light on server resources, and fairly speedy.
python
web
tools
django
march 2011 by rybesh
Dotspotting
march 2011 by rybesh
Getting dots on maps, without legacy code or any baggage. Just that, to start. Dots on maps.
maps
infoviz
tools
march 2011 by rybesh
Lippmannian Device
march 2011 by rybesh
Lippmannian device is named after Lippmann, and provides a coarse means of showing actor partisanship.
research
tools
analysis
nlp
rhetoric
march 2011 by rybesh
CorporaCamp
march 2011 by rybesh
The tool we’ve built is code-named Woodchipper. It allows the user to search and select text from participating collections and display them as a visualization which shows relationships among texts.
digitalhumanities
tools
infoviz
corpora
march 2011 by rybesh
RStudio
march 2011 by rybesh
RStudio™ is a new integrated development environment (IDE) for R. RStudio combines an intuitive user interface with powerful coding tools to help you get the most out of R.
statistics
tools
march 2011 by rybesh
Deducer - A graphical data analysis system for use with JGR - RForge.net
february 2011 by rybesh
An intuitive, cross-platform graphical data analysis system. It uses menus and dialogs to guide the user efficiently through the data manipulation and analysis process, and has an excel like spreadsheet for easy data frame visualization and editing.
R
statistics
tools
february 2011 by rybesh
ToolDatabase < Dmi
february 2011 by rybesh
List of tools for doing research into the "natively digital".
web
research
tools
digitalhumanities
february 2011 by rybesh
Finding trending topics using Google Books n-grams data and Apache Hive on Elastic MapReduce : Articles & Tutorials : Amazon Web Services
february 2011 by rybesh
Finding trending topics using Google Books n-grams data and Apache Hive on Elastic MapReduce
hadoop
digitalhumanities
amazon
cloud
howto
textmining
tools
february 2011 by rybesh
elasticsearch - - Open Source, Distributed, RESTful, Search Engine
february 2011 by rybesh
It is an Open Source (Apache 2), Distributed, RESTful, Search Engine built on top of Lucene.
search
ir
tools
rest
java
json
february 2011 by rybesh
Django Packages : Wikis
february 2011 by rybesh
A comparison of various Django-based wiki applications and projects.
django
wiki
tools
february 2011 by rybesh
Beyond the PDF
january 2011 by rybesh
The goal of the workshop was not to produce a white paper! Rather it was to identify a set of requirements, and a group of willing participants to develop a mandate, open source code and a set of deliverables to be used by scholars to accelerate data and knowledge sharing and discovery . Our starting point, and the only prerequisite to participating, was the belief that we need to move Beyond the PDF (meant to capture a common philosophy, not necessarily to be taken literally).
In a heady moment we might also describe our efforts as the desire to contribute to the development of a free and open digital printing press for the 21st century. A platform, when utilized, moves us beyond a static and disparate data and knowledge representation to a rich integrated content which grows and changes the more we learn. A system (content plus platform) from which a scholar can interact and once evaluated shows improved understanding and interest.
publishing
data
scholarship
tools
KR
In a heady moment we might also describe our efforts as the desire to contribute to the development of a free and open digital printing press for the 21st century. A platform, when utilized, moves us beyond a static and disparate data and knowledge representation to a rich integrated content which grows and changes the more we learn. A system (content plus platform) from which a scholar can interact and once evaluated shows improved understanding and interest.
january 2011 by rybesh
TemaTres Vocabulary Server | The way to manage formal representations of knowledge
january 2011 by rybesh
TemaTres is an open source vocabulary server, web application to manage and exploit vocabularies, thesauri, taxonomies and formal representations of knowledge.
vocabulary
tools
inls520
january 2011 by rybesh
#grid
january 2011 by rybesh
It inserts a layout grid in web pages, allows you to hold it in place, and toggle between displaying it in the foreground or background.
css
design
grid
javascript
tools
january 2011 by rybesh
Modular toolkit for Data Processing (MDP)
december 2010 by rybesh
Modular toolkit for Data Processing (MDP) is a Python data processing framework.
From the user's perspective, MDP is a collection of supervised and unsupervised learning algorithms and other data processing units that can be combined into data processing sequences and more complex feed-forward network architectures.
datamining
machinelearning
python
tools
From the user's perspective, MDP is a collection of supervised and unsupervised learning algorithms and other data processing units that can be combined into data processing sequences and more complex feed-forward network architectures.
december 2010 by rybesh
Stanford CoreNLP
december 2010 by rybesh
Stanford CoreNLP provides a set of natural language analysis tools which can take raw English language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, and mark up the structure of sentences in terms of phrases and word dependencies, and indicate which noun phrases refer to the same entities. It provides the foundational building blocks for higher level text understanding applications.
Stanford CoreNLP integrates all our NLP tools for the English language, including the part-of-speech (POS) tagger, the named entity recognizer (NER), the parser, and the coreference resolution system. The goal of this project is to enable people to quickly and painlessly get complete linguistic annotations of natural language texts. It is designed to be highly flexible and extensible, i.e., with a single option you can change which tools should be enabled and which should be disabled.
nlp
research
tools
java
nlproc
Stanford CoreNLP integrates all our NLP tools for the English language, including the part-of-speech (POS) tagger, the named entity recognizer (NER), the parser, and the coreference resolution system. The goal of this project is to enable people to quickly and painlessly get complete linguistic annotations of natural language texts. It is designed to be highly flexible and extensible, i.e., with a single option you can change which tools should be enabled and which should be disabled.
december 2010 by rybesh
Maperitive
november 2010 by rybesh
Maperitive is a FREE desktop application for drawing maps based on OpenStreetMap and GPS data. You can define what gets on the map and how it is painted. You can also export these maps into bitmaps and SVG files and print them.
svg
maps
cartography
locative
tools
november 2010 by rybesh
tm - Text Mining Package
october 2010 by rybesh
tm (shorthand for Text Mining Infrastructure in R) provides a framework for text mining applications within R.
The tm package offers functionality for managing text documents, abstracts the process of document manipulation and eases the usage of heterogeneous text formats in R. The package has integrated database backend support to minimize memory demands. An advanced meta data management is implemented for collections of text documents to alleviate the usage of large and with meta data enriched document sets.
R
textmining
datamining
nlp
tools
statistics
The tm package offers functionality for managing text documents, abstracts the process of document manipulation and eases the usage of heterogeneous text formats in R. The package has integrated database backend support to minimize memory demands. An advanced meta data management is implemented for collections of text documents to alleviate the usage of large and with meta data enriched document sets.
october 2010 by rybesh
FlexPaper - the open source document viewer solution for pdf, doc, ..
october 2010 by rybesh
FlexPaper displays documents in your favorite browser using flash. Its way of reusing display containers makes it possible to view large documents and books.
pdf
flex
flash
tools
interface
web
october 2010 by rybesh
GEOLocate - Software for Georeferencing Natural History Data
october 2010 by rybesh
The GEOLocate project is an effort to develop software and services for translating textual locality descriptions associated with biodiversity collections data into geographic coordinates.
locative
tools
georeferencing
nlp
october 2010 by rybesh
MALLET homepage
october 2010 by rybesh
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
datamining
java
machinelearning
nlp
tools
october 2010 by rybesh
Online Stickies - lino
october 2010 by rybesh
lino is a free sticky & canvas service that requires nothing but a Web browser.
collaboration
tools
infoviz
october 2010 by rybesh
Timeline « HyperStudio – Digital Humanities at MIT
september 2010 by rybesh
Chronos Timeline is designed specifically for needs in the humanities and social sciences to represent time-based data. Chronos allows scholars and students to dynamically present historical data in a flexible online environment. Switching easily between vertical and horizontal orientations, researchers can quickly scan large number of events, highlight and filter events based on subject matter or tags, and recontextualize historical data.
timeline
infoviz
tools
jquery
events
september 2010 by rybesh
Online javascript beautifier
september 2010 by rybesh
This little beautifier will reformat and reindent bookmarklets, ugly javascript, unpack scripts packed by the popular Dean Edward's packer, as well as deobfuscate scripts processed by javascriptobfuscator.com.
javascript
tools
september 2010 by rybesh
TileStache
august 2010 by rybesh
TileStache is a Python-based server application that can serve up map tiles based on rendered geographic data.
geo
gis
framework
tools
locative
august 2010 by rybesh
Journal of Statistical Software — Show
august 2010 by rybesh
This user guide describes a Python package, PyMC, that allows users to efficiently code a probabilistic model and draw samples from its posterior distribution using Markov chain Monte Carlo techniques.
statistics
tools
python
august 2010 by rybesh
related tags
3d ⊕ academia ⊕ actionscript ⊕ advertising ⊕ aesthetics ⊕ ai ⊕ ajax ⊕ amazon ⊕ analysis ⊕ analytics ⊕ anime ⊕ annotation ⊕ AP170 ⊕ api ⊕ architecture ⊕ archives ⊕ arctic ⊕ argumentation ⊕ art ⊕ atom ⊕ attention ⊕ audio ⊕ authoring ⊕ authority ⊕ automation ⊕ backup ⊕ bayes ⊕ berkeley ⊕ bibliography ⊕ biblography ⊕ biography ⊕ blog ⊕ books ⊕ bugtracking ⊕ c++ ⊕ cache ⊕ camera ⊕ cartography ⊕ cataloging ⊕ charts ⊕ cinema ⊕ citation ⊕ citations ⊕ classification ⊕ cli ⊕ clipart ⊕ cloud ⊕ CMS ⊕ code ⊕ coffeescript ⊕ cogsci ⊕ collaboration ⊕ collation ⊕ collectiveaction ⊕ color ⊕ comics ⊕ commercial ⊕ commons ⊕ communication ⊕ community ⊕ computervision ⊕ concepts ⊕ conference ⊕ consumer ⊕ content ⊕ contentanalysis ⊕ convergence ⊕ corpora ⊕ course ⊕ creative ⊕ creativity ⊕ cs294project ⊕ css ⊕ culture ⊕ custom ⊕ dam ⊕ data ⊕ database ⊕ dataflow ⊕ datamining ⊕ dataprocessing ⊕ datawrangling ⊕ debugging ⊕ decisionmaking ⊕ delivery ⊕ deployment ⊕ design ⊕ development ⊕ digital ⊕ digitalcamera ⊕ digitalhumanities ⊕ digitalyouth ⊕ discourse ⊕ discussion ⊕ display ⊕ dissertation ⊕ distributed ⊕ django ⊕ documentary ⊕ documentation ⊕ documents ⊕ dom ⊕ drawing ⊕ drm ⊕ drupal ⊕ ec2 ⊕ economics ⊕ editing ⊕ editorsnotes ⊕ education ⊕ effects ⊕ election ⊕ emacs ⊕ entitydetection ⊕ entityrecognition ⊕ erlang ⊕ events ⊕ exercise ⊕ expertise ⊕ extraction ⊕ fans ⊕ fiction ⊕ filesystem ⊕ firefox ⊕ flash ⊕ flex ⊕ flickr ⊕ framework ⊕ functional ⊕ future ⊕ game ⊕ games ⊕ genealogy ⊕ geo ⊕ geocoding ⊕ georeferencing ⊕ gis ⊕ google ⊕ government ⊕ graphics ⊕ graphs ⊕ grid ⊕ hadoop ⊕ hardware ⊕ HCI ⊕ history ⊕ hosting ⊕ howto ⊕ html ⊕ html5 ⊕ http ⊕ humanities ⊕ hypermedia ⊕ hypertext ⊕ i-school ⊕ i18n ⊕ ideas ⊕ identity ⊕ IM ⊕ image ⊕ imagination ⊕ indexing ⊕ indiana ⊕ inference ⊕ information ⊕ infoviz ⊕ inls520 ⊕ input ⊕ interactive ⊕ interface ⊕ intermediation ⊕ internet ⊕ interpretation ⊕ ir ⊕ japan ⊕ java ⊕ javascript ⊕ jitter ⊕ journalism ⊕ jquery ⊕ json ⊕ key-value ⊕ kr ⊕ labs ⊕ language ⊕ latex ⊕ library ⊕ life ⊕ linguistics ⊕ linkeddata ⊕ lisp ⊕ local ⊕ locative ⊕ logic ⊕ logreg ⊕ logs ⊕ lucene ⊕ lyrics ⊕ machinelearning ⊕ machinima ⊕ management ⊕ mapreduce ⊕ maps ⊕ marketing ⊕ math ⊕ matlab ⊕ max/msp ⊕ maxent ⊕ media ⊕ memory ⊕ messaging ⊕ metadata ⊕ methods ⊕ migration ⊕ mit ⊕ mobile ⊕ modeling ⊕ mono ⊕ mp3 ⊕ mpeg-4 ⊕ mpeg-7 ⊕ mpeg-21 ⊕ msmdx ⊕ multimedia ⊕ museum ⊕ music ⊕ narrative ⊕ ner ⊕ network ⊕ networking ⊕ networks ⊕ newmedia ⊕ news ⊕ nlp ⊕ nlproc ⊕ nltk ⊕ nodejs ⊕ ocaml ⊕ odata ⊕ ontology ⊕ opendata ⊕ opensource ⊕ opinion ⊕ organization ⊕ osx ⊕ owl ⊕ p2p ⊕ packaging ⊕ parsing ⊕ pdf ⊕ people ⊕ performance ⊕ perl ⊕ philosophy ⊕ photography ⊕ php ⊕ pipeline ⊕ planning ⊕ platform ⊕ playlist ⊕ plugin ⊕ podcast ⊕ postgresql ⊕ presentation ⊕ process ⊕ programming ⊕ prolog ⊕ proxy ⊕ public ⊕ publicsphere ⊕ publishing ⊕ python ⊕ qa ⊕ qualitative ⊕ quality ⊕ quantitative ⊕ quicktime ⊕ quote ⊕ R ⊕ radio ⊕ rails ⊕ rdf ⊕ rdfa ⊕ readme ⊕ reasoner ⊕ reasoning ⊕ recognition ⊕ reference ⊕ regex ⊕ registry ⊕ relaxng ⊕ remix ⊕ research ⊕ rest ⊕ reviews ⊕ rhetoric ⊕ ruby ⊕ s3 ⊕ schematron ⊕ scholarlycommunication ⊕ scholarship ⊕ science ⊕ scraping ⊕ screencast ⊕ search ⊕ searchengine ⊕ security ⊕ semiotics ⊕ semweb ⊕ sfbayarea ⊕ sharing ⊕ simulation ⊕ skos ⊕ smil ⊕ sms ⊕ social ⊕ socialscience ⊕ sociology ⊕ software ⊕ SoI ⊕ solr ⊕ sources ⊕ sparql ⊕ spreadsheet ⊕ spring ⊕ standards ⊕ stanford ⊕ statistics ⊕ storage ⊕ storyboard ⊕ strategy ⊕ streaming ⊕ style ⊕ subtitle ⊕ svg ⊕ swrl ⊕ syllabus ⊕ symbian ⊕ syndication ⊕ sysadmin ⊕ tagging ⊕ tangible ⊕ teaching ⊕ technology ⊕ teenagers ⊕ tei ⊕ telephony ⊕ temporality ⊕ testing ⊕ textanalysis ⊕ textmining ⊕ theory ⊕ time ⊕ timeline ⊕ timetags ⊕ tools ⊖ toys ⊕ transcription ⊕ travel ⊕ triplestore ⊕ trust ⊕ tv ⊕ typography ⊕ ubicomp ⊕ ui ⊕ unix ⊕ unmediated ⊕ urban ⊕ usability ⊕ validation ⊕ versioning ⊕ video ⊕ vim ⊕ virtualreality ⊕ visualization ⊕ vj ⊕ vocabulary ⊕ voice ⊕ voip ⊕ web ⊕ web2.0 ⊕ webinf ⊕ webinfo ⊕ webpub ⊕ webserver ⊕ webservices ⊕ website ⊕ wiki ⊕ wikipedia ⊕ windows ⊕ wireless ⊕ wishlist ⊕ workflow ⊕ writing ⊕ xml ⊕ xpath ⊕ xquery ⊕ xslt ⊕ yahoo ⊕ YRB ⊕ yui ⊕Copy this bookmark: