Introduction to text manipulation on UNIX-based systems
10 weeks ago by sweeney
This introduction to text manipulation on UNIX platforms provides an overview of some common commands widely available and installed standard on most UNIX-based releases. Many times these standard utilities are ignored in favor of more modern text-processors such as Perl, Python, or Ruby, which are not always installed on a system. An introductory review of these tools helps practitioners who are learning UNIX or Linux or those who may be looking to renew forgotten knowledge.
linux
shell
text
10 weeks ago by sweeney
Extractomatic
january 2010 by sweeney
a very simple web-based API to detect and extract the main content from a web page, removing all of the clutter, such as headers, footers, advertising and so on. I guess it’s somewhat similar to Readability or Instapaper, but more suitable to building into your own applications. Watch this space.
web
api
extract
text
html
january 2010 by sweeney
Sphinx - Free open-source SQL full-text search engine
march 2009 by sweeney
Sphinx documentation
php
mysql
search
text
engine
sphinx
manual
march 2009 by sweeney
Copy this bookmark: