Inline Scripts ============== **mitmproxy** has a powerful scripting API that allows you to modify flows on-the-fly or rewrite previously saved flows locally. The mitmproxy scripting API is event driven - a script is simply a Python module that exposes a set of event methods. Here's a complete mitmproxy script that adds a new header to every HTTP response before it is returned to the client: .. literalinclude:: ../../examples/add_header.py :caption: examples/add_header.py :language: python The first argument to each event method is an instance of :py:class:`~libmproxy.script.ScriptContext` that lets the script interact with the global mitmproxy state. The **response** event also gets an instance of :py:class:`~libmproxy.script.ScriptContext`, which we can use to manipulate the response itself. We can now run this script using mitmdump or mitmproxy as follows: >>> mitmdump -s add_header.py The new header will be added to all responses passing through the proxy. Examples -------- mitmproxy comes with a variety of example inline scripts, which demonstrate many basic tasks. We encourage you to either browse them locally or on `GitHub`_. Events ------ .. TODO: Split this into Connection, HTTP and TCP events once we have TCP events. The ``context`` argument passed to each event method is always a :py:class:`~libmproxy.script.ScriptContext` instance. It is guaranteed to be the same object for the scripts lifetime and is not shared between multiple inline scripts. You can safely use it to store any form of state you require. Events are listed in the order they usually occur. .. py:function:: start(context, argv) Called once on startup, before any other events. :param List[str] argv: The inline scripts' arguments. For example, ``mitmproxy -s 'example.py --foo 42'`` sets argv to ``["--foo", "42"]``. .. py:function:: clientconnect(context, root_layer) Called when a client initiates a connection to the proxy. Note that a connection can correspond to multiple HTTP requests. .. versionchanged:: 0.14 :param Layer root_layer: The root layer (see :ref:`protocols` for an explanation what the root layer is), which provides transparent access to all attributes of the :py:class:`~libmproxy.proxy.RootContext`. For example, ``root_layer.client_conn.address`` gives the remote address of the connecting client. .. py:function:: request(context, flow) Called when a client request has been received. The ``flow`` object is guaranteed to have a non-None ``request`` attribute. :param HTTPFlow flow: The flow containing the request which has been received. The object is guaranteed to have a non-None ``request`` attribute. .. py:function:: serverconnect(context, server_conn) Called before the proxy initiates a connection to the target server. Note that a connection can correspond to multiple HTTP requests. :param ServerConnection server_conn: The server connection object. It is guaranteed to have a non-None ``address`` attribute. .. py:function:: responseheaders(context, flow) Called when the headers of a server response have been received. This will always be called before the response hook. :param HTTPFlow flow: The flow containing the request and response. The object is guaranteed to have non-None ``request`` and ``response`` attributes. ``response.content`` will be ``None``, as the response body has not been read yet. .. py:function:: response(context, flow) Called when a server response has been received. :param HTTPFlow flow: The flow containing the request and response. The object is guaranteed to have non-None ``request`` and ``response`` attributes. ``response.body`` will contain the raw response body, unless response streaming has been enabled. .. py:function:: error(context, flow) Called when a flow error has occurred, e.g. invalid server responses, or interrupted connections. This is distinct from a valid server HTTP error response, which is simply a response with an HTTP error code. :param HTTPFlow flow: The flow containing the error. It is guaranteed to have non-None ``error`` attribute. .. py:function:: serverdisconnect(context, server_conn) Called when the proxy has closed the server connection. .. versionadded:: 0.14 :param ServerConnection server_conn: see :py:func:`serverconnect` .. py:function:: clientdisconnect(context, root_layer) Called when a client disconnects from the proxy. .. versionchanged:: 0.14 :param Layer root_layer: see :py:func:`clientconnect` .. py:function:: done(context) Called once on script shutdown, after any other events. API --- The canonical API documentation is the code, which you can browse here, locally or on `GitHub`_. *Use the Source, Luke!* The main classes you will deal with in writing mitmproxy scripts are: :py:class:`~libmproxy.script.ScriptContext` - A handle for interacting with mitmproxy's Flow Master from within scripts. :py:class:`~libmproxy.models.ClientConnection` - Describes a client connection. :py:class:`~libmproxy.models.ServerConnection` - Describes a server connection. :py:class:`~libmproxy.models.HTTPFlow` - A collection of objects representing a single HTTP transaction. :py:class:`~libmproxy.models.HTTPRequest` - An HTTP request. :py:class:`~libmproxy.models.HTTPResponse` - An HTTP response. :py:class:`~libmproxy.models.Error` - A communications error. :py:class:`netlib.odict.ODDict` - A dictionary-like object for managing sets of key/value data. There is also a variant called ODictCaseless that ignores key case for some calls (used mainly for headers). :py:class:`netlib.certutils.SSLCert` - Exposes information SSL certificates. :py:class:`libmproxy.flow.FlowMaster` - The "heart" of libmproxy, usually subclassed as :py:class:`libmproxy.dump.DumpMaster` or :py:class:`libmproxy.console.ConsoleMaster`. Script Context -------------- .. autoclass:: libmproxy.script.ScriptContext :members: :undoc-members: Running scripts in parallel --------------------------- We have a single flow primitive, so when a script is blocking, other requests are not processed. While that's usually a very desirable behaviour, blocking scripts can be run threaded by using the :py:obj:`libmproxy.script.concurrent` decorator. **If your script does not block, you should avoid the overhead of the decorator.** .. literalinclude:: ../../examples/nonblocking.py :caption: examples/nonblocking.py :language: python Make scripts configurable with arguments ---------------------------------------- Sometimes, you want to pass runtime arguments to the inline script. This can be simply done by surrounding the script call with quotes, e.g. ```mitmdump -s 'script.py --foo 42'``. The arguments are then exposed in the start event: .. literalinclude:: ../../examples/modify_response_body.py :caption: examples/modify_response_body.py :language: python Running scripts on saved flows ------------------------------ Sometimes, we want to run a script on :py:class:`~libmproxy.models.Flow` objects that are already complete. This happens when you start a script, and then load a saved set of flows from a file (see the "scripted data transformation" example `here `_). It also happens when you run a one-shot script on a single flow through the ``|`` (pipe) shortcut in mitmproxy. In this case, there are no client connections, and the events are run in the following order: **start**, **request**, **responseheaders**, **response**, **error**, **done**. If the flow doesn't have a **response** or **error** associated with it, the matching events will be skipped. Spaces in the script path ------------------------- By default, spaces are interpreted as a separator between the inline script and its arguments (e.g. ``-s 'foo.py 42'``). Consequently, the script path needs to be wrapped in a separate pair of quotes if it contains spaces: ``-s '\'./foo bar/baz.py\' 42'``. .. _GitHub: https://github.com/mitmproxy/mitmproxy