Python Storlet Writing and Deployment Guideline

This is the Python specific storlet writing and deploying guide. This guide complements the more general guide for writing and deploying storlets which should be read first.

A python module implementing a storlet looks like this:

class <Class name>(object):
    def __init__(self, logger):
        self.logger = logger

    def __call__(self, in_files, out_files, params):
        """
        The function called for storlet invocation
        :param in_files: a list of StorletInputFile
        :param out_files: a list of StorletOutputFile
        :param params: a dict of request parameters
        """

Below is a class diagram illustrating the classes behind the in_files, out_files, and logger. The diagram lists only the methods that the storlet writer is expected to work with.

Python Programming Model Class Diagram
  1. The StorletInputFile is used to stream object’s data into the storlet. StorletInputFile has the same read methods as python FileObject. Trying to write to a StorletInputFile yields NotImplemented error. Whenever a storlet is invoked, an instance of this class is provided. To consume the metadata call the StorletInputFile.get_metadata method.
  2. The StorleOutputFile is used for writing the storlet output. StorletOutputFile has the same write methods as python FileObject. Trying to read from a StorletOutputFile yields NotImplemented error. Whenever a storlet is invoked, an instance of this class is provided. Use the StorletInputFile.set_metadata method to set the Object’s metadata. Note that the storlet must call the StorletInputFile set_metadata method. Moreowver, StorletInputFile.set_metadata must be called before writing the data.
  3. StorletLogger. The StorletLogger class implements the same log methods as the Python logger.

When invoked via the Swift REST API the __call__ method will be called as follows:

  1. The in_files list would include one or more element(s) of type StorleInputFile representing the object appearing in the request’s URI (and possibly extra resources).
  2. The out_files would include a single element of type StorleOutputFile representing the response returned to the user.
  3. The parameters is a dictionary with the execution parameters sent. These parameters can be specified in the storlet execution request.
  4. A StorletLogger instance.

Deploying a Python Storlet

Below are specific guidelines for deploying a Python storlet:

  1. The object name of the python module containing the storlet class implemetation must end with .py
  2. Any python modules that the class implementation is dependent on should be uploaded as separate .py(s).
  3. The ‘X-Object-Meta-Storlet-Main’ metadata key shold be of the form: <module_name>.<class_name>. For example, if the storlet name is SimpleStorlet and it resides in simple_storlet.py, then the ‘X-Object-Meta-Storlet-Main’ metadata key shold be “simple_storlet.SimpleStorlet”

Deploying a Python Dependency

  1. Currently, there is no limitation as to what is being uploaded as a dependency.

Writing Storlet App with IPython/Jupyter Notebook

Storlets supports IPython/Jupyter Notebookd extension to upload your own storlet apps with the following steps:

Note

To upload a storlet app to Swift one needs to provide the authentication information of the Storlet enabled Swift account. This is done by setting environment variables similar to those used by swift client. The exact variables that need to be set are dependent on the auth middleware used and the auth protocol version. For more details please refer to python-swiftclient docs.

In case you are working with an s2aio,sh installation just add a new cell with the following:

import os
os.environ['OS_AUTH_VERSION'] = '3'
os.environ['OS_AUTH_URL'] = 'http://127.0.0.1:5000/v3'
os.environ['OS_USERNAME'] = 'tester'
os.environ['OS_PASSWORD'] = 'testing'
os.environ['OS_USER_DOMAIN_NAME'] = 'default'
os.environ['OS_PROJECT_DOMAIN_NAME'] = 'default'
os.environ['OS_PROJECT_NAME'] = 'test'
  1. Enables storlets extension on your app
%reload_ext storlets.tools.extensions.ipython
  1. Add another cell for your app and add storletapp command to the top of the cell
%%storletapp test.TestStorlet

class TestStorlet(object):
    def __init__(self, logger):
        self.logger = logger

    def __call__(self, in_files, out_files, params):
        """
        The function called for storlet invocation
        :param in_files: a list of StorletInputFile
        :param out_files: a list of StorletOutputFile
        :param params: a dict of request parameters
        """
        self.logger.debug('Returning metadata')
        metadata = in_files[0].get_metadata()
        metadata['test'] = 'simple'
        out_files[0].set_metadata(metadata)

        self.logger.debug('Start to return object data')
        while True:
            buf = in_files[0].read(16)
            if not buf:
                break
        self.logger.debug('Recieved %d bytes' % len(buf))
        self.logger.debug('Writing back %d bytes' % len(buf))
        out_files[0].write(buf)
        self.logger.debug('Complete')
        in_files[0].close()
        out_files[0].close()
  1. If you want to run the app with actual data set, please specify some options like:
%%storletapp test.TestStorlet --with-invoke --input path:/<container>/<object> --print-result

N.B. the useful commands like ‘dry-run’, etc... is under development. And more details for options are in the next section.

Extension docs

Implementation of magic funcs for interaction with the OpenStack Storlets.

This extension is desined to use os environment variables to set authentication and storage target host. (for now)

class storlets.tools.extensions.ipython.StorletMagics(shell=None, **kwargs)

Bases: IPython.core.magic.Magics

Magics to interact with OpenStack Storlets

storletapp(line, cell)
%storletapp [-c CONTAINER] [-d DEPENDENCIES] [--with-invoke]
                [--input INPUT] [--print-result]
                module_class
positional arguments:
module_class module and class name to upload
optional arguments:
-c CONTAINER, --container CONTAINER
 Storlet container name, “storlet” in default
-d DEPENDENCIES, --dependencies DEPENDENCIES
 Storlet container name, “storlet” in default
--with-invoke An option to run storlet for testing. This requires –input option
--input INPUT Specifiy input object path that must be of the form “path:/<container>/<object>”
--print-result Print result objet to stdout. Note that this may be a largebinary depends on your app
uploadfile(line, cell)
%uploadfile container_obj

Upload the contents of the cell to OpenStack Swift.

positional arguments:
container_obj container/object path to upload