rybesh + python   242

mimeparse - Basic functions for handling mime-types in Erlang, JavaScript, Perl, PHP, Python, Ruby, Java - Google Project Hosting
This module provides basic functions for parsing mime-type names and matching them against a list of media-ranges.

See section 14.1 of RFC 2616 (the HTTP specification) for a complete explanation.
http  mime  python  javascript 
2 days ago by rybesh
Welcome | Flask (A Python Microframework)
Flask is a microframework for Python based on Werkzeug, Jinja 2 and good intentions.
python  web  framework 
10 days ago by rybesh
gutenbergToTei.py
Reformats and renames etexts downloaded from Project Gutenberg.
tei  python  digitalhumanities 
8 weeks ago by rybesh
An HTML Notebook IPython — IPython 0.13.dev documentation
The IPython Notebook consists of two related components:

An JSON based Notebook document format for recording and distributing Python code and rich text.
A web-based user interface for authoring and running notebook documents.
python  programming  teaching  tools 
11 weeks ago by rybesh
alimanfoo/petl
petl is a tentative Python module for extracting, transforming and loading tables of data.
python  data  tools 
12 weeks ago by rybesh
web2py Web Framework
Free open source full-stack framework for rapid development of fast, scalable, secure and portable database-driven web-based applications. Written and programmable in Python.
python  web  framework 
12 weeks ago by rybesh
wcauchois / pysvmlight / overview — Bitbucket
A Python binding to the popular "SVM-Light" support vector machine library.
svm  machinelearning  python 
12 weeks ago by rybesh
splitta - statistical sentence boundary detection
Sentence tokenizer written in python. Includes proper tokenization and models for very high accuracy sentence boundary detection (English only for now). The models are trained from Wall Street Journal news combined with the Brown Corpus which is intended to be widely representative of written English. Error rates on test news data are near 0.25%.
nlp  python 
january 2012 by rybesh
The Hitchhiker’s Guide to Python! — pythonguide 0.0.1 documentation
This opinionated guide exists to provide both novice and expert Python developers a best-practice handbook to the installation, configuration, and usage of Python on a daily basis.
python  reference 
january 2012 by rybesh
chromium-compact-language-detector - C++ library and Python bindings for detecting language from UTF8 text, extracted from the Chromium browser - Google Project Hosting
This is a straight port from the CLD (Compact Language Detector) library embedded in Google's Chromium browser. The library detects the language from provided UTF8 text (plain text or HTML). It's implemented in C++, with very basic Python bindings.
language  detection  nlp  python 
november 2011 by rybesh
python-patch - patch utility in python - Google Project Hosting
Cross-platform alternative to unix patch utility capable to apply unified diffs.
python  diff 
october 2011 by rybesh
Requests: HTTP for Humans — Requests 0.7.4 documentation
Requests is an elegant and simple HTTP library for Python, built for human beings.
python  http 
october 2011 by rybesh
gevent: A coroutine-based network library for Python
gevent is a coroutine-based Python networking library that uses greenlet to provide a high-level synchronous API on top of the libevent event loop.
python  networking  async 
october 2011 by rybesh
Hyde - A Python Static Website Generator
Hyde is a static website generator powered by Python & Django. Hyde supports all the Django template tags & filters and even has a few of its own. The built-in web server + auto-generator provide instant refresh and unlimited flexibility.
python  django 
october 2011 by rybesh
digitalbazaar/pyld - GitHub
JSON-LD processor written in Python.
json  linkeddata  python 
october 2011 by rybesh
Pylons Project : Pyramid : About
Pyramid is a very general open source Python web framework. As a framework, its primary job is to make it easier for a developer to create an arbitrary web application. The type of application being created isn’t really important; it could be a spreadsheet, a corporate intranet, or an “oh-so-Web-2.0” social networking platform. Pyramid is general enough that it can be used in a wide variety of circumstances.
python  web  framework 
september 2011 by rybesh
jgorset/facepy - GitHub
Facepy is a client for Facebook's Graph API that doesn't suck.
facebook  python  api  webservices 
august 2011 by rybesh
pandas: a python data analysis library — pandas v0.4.0dev documentation
pandas is a python package providing convenient data structures for time series, cross-sectional, or any other form of “labeled” data, with tools for building statistical and econometric models.
python  statistics  dataprocessing  analysis 
august 2011 by rybesh
Bulbflow: a New Python Framework for Graph Databases
Bulbs is an open-source Python persistence framework for graph databases and the first piece of a larger Web-development toolkit that will be released in the upcoming weeks.

It’s like an ORM for graphs, but instead of SQL, you use the graph-traveral language Gremlin to query the database.

You can use it to connect to any Blueprints-enabled database, including TinkeGraph, Neo4j, OrientDB, Dex, and OpenRDF (and there is an InfiniteGraph implementation in development).

You can use Bulbs from within any Python Web-development framework, including Flask, Pyramid, and Django.
python  graph  database  framework 
july 2011 by rybesh
urschrei/pyzotero - GitHub
This is a first pass at implementing a Python wrapper for the Zotero API.
python  zotero  webservices 
june 2011 by rybesh
Akara
Akara is a platform for developing data services available on the Web, using REST architecture. Akara is open source software written in Python and C.
python  rest  webservices  pipeline  composition 
june 2011 by rybesh
Workflow software survey
A survey of existing (Python) libraries, frameworks, and tools for managing semi-automated workflows and pipelines of tasks. There are a number of loose, overlapping terms for these kinds of systems: BPM (business process management), workflow, pipeline, flowchart, state machine, visual programming, dataflow, flow-based programming, dependency management, task management, etc.
dataflow  workflow  pipeline  tools  python 
june 2011 by rybesh
difflib — SequenceMatcher
SequenceMatcher is a flexible class for comparing pairs of sequences of any type, so long as the sequence elements are hashable.
python  textanalysis 
may 2011 by rybesh
Python Package Index : python-Levenshtein 0.10.2
Python extension computing string distances and similarities.
python  textanalysis  search 
may 2011 by rybesh
Store 'virtuoso'
The Virtuoso Redland RDF Provider is an implementation of the Storage API, Model and Query interfaces of the Redland framework for RDF. This provider enables the execution of queries via the Redland Rasqal query engine or via Virtuoso query engine directly against the Virtuoso OpenSource Quad Store.
rdf  database  python  api 
may 2011 by rybesh
PDFMiner
PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows one to obtain the exact location of text in a page, as well as other information such as fonts or lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes than text analysis.
pdf  python  tools 
may 2011 by rybesh
imaplib - IMAP4 client library - Python Module of the Week
imaplib implements a client for communicating with Internet Message Access Protocol (IMAP) version 4 servers. The IMAP protocol defines a set of commands sent to the server and the responses delivered back to the client. Most of the commands are available as methods of the IMAP4 object used to communicate with the server.
python  imap  reference 
april 2011 by rybesh
HTML/CSS to PDF converter written in Python - HTML2PDF Converter
XHTML2PDF is a converter for HTML/XHTML and CSS to PDF and a Python package.
css  html  pdf  python  django 
april 2011 by rybesh
Using Subdomains in Django « ThingsILearned
Django’s url dispatcher doesn’t handle subdomains.  If you want/need to use subdomains you’ve got to use session middleware.
django  python  howto  urls 
april 2011 by rybesh
Languages - Accentuate.us - Really Easy Computer Input
Accentuate.us uses statistics to predict where special characters are needed on a language-by-language basis.
language  input  python  tools  webservices  api  machinelearning 
april 2011 by rybesh
main.py
Pycco is a Python port of Docco: the original quick-and-dirty, hundred-line-long, literate-programming-style documentation generator. It produces HTML that displays your comments alongside your code. Comments are passed through Markdown, and code is passed through Pygments syntax highlighting.
python  documentation  tools 
april 2011 by rybesh
brainsik/virtualenv-burrito - GitHub
With one command, have a working virtualenv + virtualenvwrapper environment.
python  development 
march 2011 by rybesh
Green Unicorn - Welcome
Gunicorn 'Green Unicorn' is a Python WSGI HTTP Server for UNIX. It's a pre-fork worker model ported from Ruby's Unicorn project. The Gunicorn server is broadly compatible with various web frameworks, simply implemented, light on server resources, and fairly speedy.
python  web  tools  django 
march 2011 by rybesh
Dumbo - GitHub
Dumbo is a project that allows you to easily write and run Hadoop programs in Python. More generally, Dumbo can be considered to be a convenient Python API for writing MapReduce programs.
python  hadoop  mapreduce 
march 2011 by rybesh
Python Interface to Stanford Parser
A python interface to the Stanford Parser. It uses JPype to create a Java virtual machine, instantiate the parser, and call methods on it. Most of the code is focused on getting the Stanford Dependencies, but it's easy to add API to call any method on the parser.
java  python  nlp 
february 2011 by rybesh
Modular toolkit for Data Processing (MDP)
Modular toolkit for Data Processing (MDP) is a Python data processing framework.

From the user's perspective, MDP is a collection of supervised and unsupervised learning algorithms and other data processing units that can be combined into data processing sequences and more complex feed-forward network architectures.
datamining  machinelearning  python  tools 
december 2010 by rybesh
HotQueue User Documentation — HotQueue v0.2.1 documentation
HotQueue is a Python library that allows you to use Redis as a message queue within your Python programs.
python  messaging  queue 
december 2010 by rybesh
MongoEngine User Documentation — MongoEngine v0.3 documentation
MongoEngine is an Object-Document Mapper, written in Python for working with MongoDB.
mongodb  orm  django  python  database 
october 2010 by rybesh
Software Maniacs blog » ijson
There is a certain value in processing JSON as a stream in SAX-like manner when the size of the payload starts to be counted in megabytes and if you don't need to actually store the whole object in memory.
json  python 
september 2010 by rybesh
django-treebeard
django-treebeard is a library that implements efficient tree implementations for the Django Web Framework 1.0+. It includes 3 different tree implementations: Adjacency List, Materialized Path and Nested Sets.
django  python 
september 2010 by rybesh
Text Processing APIs and Python NLTK Demos | Text Mining | Stemming | Tagging | Python NLTK Demo
The Text Processing API supports the following functionality:

Stemming & Lemmatization
Sentiment Analysis
Tagging and Chunk Extraction
nlp  api  webservices  python 
august 2010 by rybesh
Object Description Mapper — Open Knowledge Foundation RDF v0.14 documentation
It provides constructs that will seem familiar to those used to ORMs like SQLAlchemy or Django but which are backed by an RDF store containing multiple graphs and are ultimately expressed in OWL.
python  semweb  rdf  owl  database  code 
august 2010 by rybesh
Journal of Statistical Software — Show
This user guide describes a Python package, PyMC, that allows users to efficiently code a probabilistic model and draw samples from its posterior distribution using Markov chain Monte Carlo techniques.
statistics  tools  python 
august 2010 by rybesh
Hookbox
Hookbox is a Comet server and message queue that tightly integrates with your existing web application via web hooks and a REST interface.
comet  messaging  python  queue  ajax  REST  api 
july 2010 by rybesh
Pyjamas - Python Javascript Compiler, Desktop Widget Set and RIA Web Framework
Pyjamas is a Rich Internet Application (RIA) Development Platform for both Web and Desktop.

It contains a Python-to-Javascript compiler, an AJAX framework and a Widget Set API. Pyjamas started life as a Python port of Google Web Toolkit, the Java-to-Javascript compiler.
python  ajax  javascript  framework 
july 2010 by rybesh
SANE - Introduction
SANE stands for "Scanner Access Now Easy" and is an application programming interface (API) that provides standardized access to any raster image scanner hardware (flatbed scanner, hand-held scanner, video- and still-cameras, frame-grabbers, etc.). The SANE API is public domain and its discussion and development is open to everybody. The current source code is written for UNIX (including GNU/Linux) and is available under the GNU General Public License (the SANE API is available to proprietary applications and backends as well, however). More details about the license can be found on our license page. Ports to MacOS X, OS/2 and Microsoft Windows are either already done or in progress.
scanner  code  python 
july 2010 by rybesh
Ian Bicking: a blog :: WebTest HTTP testing
If you are running local tests against your application using WebTest, with just a little tweaking you can turn those tests into HTTP tests (i.e., actually connect to a socket).
python  testing  web 
april 2010 by rybesh
google-diff-match-patch - Project Hosting on Google Code
The Diff Match and Patch libraries offer robust algorithms to perform the operations required for synchronizing plain text.
text  code  diff  collaboration  python 
march 2010 by rybesh
htmlfill — FormEncode v1.2.2 documentation
formencode.htmlfill is a library to fill out forms, both with default values and error messages. It’s like a template library, but more limited, and it can be used with the output from other templates.
python  web  forms 
march 2010 by rybesh
PiCloud | Cloud Computing. Simplified.
“import cloud; cloud.call(my_function, arguments)” serialises my_function and its arguments, pushes it up to one of their EC2 servers and hands you back a job ID which you can poll (or block on) for a response.
cloud  python 
february 2010 by rybesh
Python Package Index : Unidecode 0.04.1
What Unidecode provides is a function, 'unidecode(...)' that takes Unicode data and tries to represent it in ASCII characters (i.e., the universally displayable characters between 0x00 and 0x7F). The representation is almost always an attempt at transliteration -- i.e., conveying, in Roman letters, the pronunciation expressed by the text in some other writing system.
python  unicode  i18n 
february 2010 by rybesh
python-colormath - Project Hosting on Google Code
The python-colormath is a python module that abstracts common color math operations. For example, converting from CIE L*a*b to XYZ, or from RGB to CMYK.
python  color 
february 2010 by rybesh
libcloud python library - a unified interface to cloud server providers
libcloud is a pure python client library for interacting with many of the popular cloud server providers. It was created to make it easy for developers to build products that work between any of the services that it supports.
python  cloud  tools  api 
february 2010 by rybesh
TOPP Cloud — toppcloud v0.0 documentation
This is a script that:
1. Can create and destroy virtual servers, using a Cloud service API (something supported by libcloud).
2. Sets up an Ubuntu server to a known configuration.
3. Deploys Python web applications to these servers.
python  cloud  tools  deployment 
february 2010 by rybesh
Unit testing a Tornado web application - Tornado Web Server | Google Groups
A fully-functional Python unittest example where we have a Tornado web application that is being tested.
python  tornado  testing 
january 2010 by rybesh
rlisagor's freshen at master - GitHub
Freshen is an acceptance testing framework for Python.
python  django  testing 
january 2010 by rybesh
Beaker
Beaker is a library for caching and sessions for use with web applications and stand-alone Python scripts and applications. It comes with WSGI middleware for easy drop-in use with WSGI based web applications, and caching decorators for ease of use with any Python based application.
python  cache  performance 
december 2009 by rybesh
The OS X Spatial Stack :: Drive-by Digressions
Seth explains how to get all the necessary spatial hoo-hah working on Mac OS X without using MacPorts. GDAL, Mapnik, NumPy, Boost, Cascadenik and other necessary bits and pieces all included.
python  maps  software  osx  howto 
december 2009 by rybesh
Debugging in Python « Python Conquers The Universe
The missing manual for Python’s powerful pdb debugger.
python  debugging  howto 
november 2009 by rybesh
How to Make a US County Thematic Map Using Free Tools | FlowingData
How to make a county-specific choropleth map using only free tools.
maps  svg  cartography  infoviz  python  howto 
november 2009 by rybesh
Diesel: How Python Does Comet
diesel is a framework for writing network applications using asynchronous I/O in Python.
python  comet  code  networking  asynchronous 
september 2009 by rybesh
Tornado Web Server
Tornado is a scalable, non-blocking web server and tools. Because it is non-blocking and uses epoll, it can handle thousands of simultaneous standing connections, which means it is ideal for real-time web services.
python  webserver  comet  ajax  realtime 
september 2009 by rybesh
Python Package Index : topia.termextract 1.1.0
This package determines important terms within a given piece of content. It uses linguistic tools such as Parts-Of-Speech (POS) and some simple statistical analysis to determine the terms and their strength.
python  nlp  extraction 
august 2009 by rybesh
Deploying with Fabric & Mercurial / Steve Losh
After you’ve gotten all of this set up the first time it will start saving you time every time you deploy. It also prevents stupid mistakes like FTP’ing your files to the wrong directory on the server. It frees you from those headaches and lets you concentrate on the real work to be done instead of the busywork.
python  django  mercurial  versioning  deployment  fabric  howto 
august 2009 by rybesh
Improved DEBUG logging only when it's needed
This is a little snippet that'll let you sprinkle DEBUG level logging throughout your code and only have it appear in your log file when it's needed (for example when there's an exception.)
python  debugging  code 
august 2009 by rybesh
10.7. itertools — Functions creating iterators for efficient looping — Python v2.6.2 documentation
The module standardizes a core set of fast, memory efficient tools that are useful by themselves or in combination. Together, they form an “iterator algebra” making it possible to construct specialized tools succinctly and efficiently in pure Python.
python 
july 2009 by rybesh
Fabric, Django, Git, Apache, mod_wsgi, virtualenv and pip deployment | Morethanseven
This particular fabric file makes setting up and deploying a django application much easier, but it does make a few assumptions. Namely that you’re using Git, Apache and mod_wsgi and your using Debian or Ubuntu.
python  development  django  automation  tools 
july 2009 by rybesh
Fabric — Fabric v0.9 beta 1 documentation
Fabric is a Python library and command-line tool designed to streamline deploying applications or performing system administration tasks via the SSH protocol. It provides tools for running arbitrary shell commands (either as a normal login user, or via sudo), uploading and downloading files, and so forth.
python  django  tools 
july 2009 by rybesh
galvez's xmlwitch at master - GitHub
Python library for building XML trees.
python  xml  code 
july 2009 by rybesh
Tools of the Modern Python Hacker: Virtualenv, Fabric and Pip
For the modern Python programmer, some of the most important tools to aid in reduced complexity and repetition are virtualenv, Fabric, and pip.
python  code  development  management  howto 
july 2009 by rybesh
Geniusql - Trac
Geniusql is a public domain, low-level Object-Relational Mapper for Python applications. It primarily uses a generic Table Data Gateway architecture (as opposed to the more tightly-coupled Active Record architecture recently popularized by Ruby On Rails and Django).
python  database  sql  code  orm 
june 2009 by rybesh
Supervisor
Supervisor is a client/server system that allows its users to monitor and control a number of processes on UNIX-like operating systems.
unix  distributed  management  tools  python 
june 2009 by rybesh
simonw's geocoders at master - GitHub
Ultra simple API for geocoding a single string against various web services.
python  locative  code  geocoding 
may 2009 by rybesh
davisp's python-spidermonkey at master - GitHub
Execute arbitrary JavaScript code from Python. Allows you to reference arbitrary Python objects and functions in the JavaScript VM.
javascript  python  tools 
may 2009 by rybesh
disturbyte's zenqueue at master - GitHub
ZenQueue is an incredibly simple message queueing system.
python  messaging  json  queue 
may 2009 by rybesh
python-sqlparse - Google Code
sqlparse is a non-validating SQL parser for Python. It provides support for parsing, splitting and formatting SQL statements.
python  database  sql  presentation  sparql  parsing 
april 2009 by rybesh
Haystack - Search for Django
Haystack provides modular search for Django. It features a unified, familiar API that allows you to plug in different search backends (such as Solr, Whoosh, etc.) without having to modify your code.
django  python  search  framework  searchengine  tools 
april 2009 by rybesh
Developing Django apps with zc.buildout
It’s an exceedingly civilized way to develop an app.
django  python  packaging  howto  development 
april 2009 by rybesh
« earlier      

related tags

3d  ajax  amazon  analysis  animation  anime  apache  api  architecture  async  asynchronous  atom  audio  authoring  automation  bayes  berkeley  blog  bugtracking  c  c#  c++  cache  cartography  chronology  classification  clipart  cloud  code  collaboration  color  comet  commons  communication  composition  computerscience  contentanalysis  css  dam  data  database  dataflow  datamining  dataprocessing  debugging  deployment  design  detection  development  diff  digitalhumanities  distributed  django  documentation  documents  editing  emacs  entitydetection  extraction  fabric  facebook  fiction  flash  forms  framework  games  geocoding  google  graph  graphics  hadoop  hosting  howto  html  http  hypermedia  i18n  ideas  image  imap  information  infoviz  input  interactive  interface  international  IR  japan  java  javascript  json  kr  language  library  linkeddata  linking  locative  machinelearning  management  mapreduce  maps  math  mercurial  messaging  metadata  migration  mime  mobile  model  mongodb  mono  msmdx  multimedia  networking  nlp  nltk  opensource  orm  osx  owl  p2p  packaging  parsing  pdf  performance  perl  php  pipeline  planning  playlist  presentation  printing  programming  project  python  qualitative  queue  R  rdf  realtime  recommendation  reference  regex  remix  research  rest  ruby  scanner  science  search  searchengine  semweb  similarity  sms  social  software  SoI  sparql  spreadsheet  sql  standards  statistics  subtitle  svg  svm  symbian  syndication  teaching  tei  testing  text  textanalysis  time  tools  tornado  transportation  tv  unicode  unix  urls  versioning  video  viewer  virtualworlds  visualization  voip  web  webserver  webservices  wiki  wikipedia  windows  workflow  xml  xslt  yahoo  YRB  yui  zotero 

Copy this bookmark:



description:


tags: