NAME

   Sub::Throttler - Rate limit sync and async function calls

VERSION

   This document describes Sub::Throttler version v0.2.1

SYNOPSIS

       # Load throttling engine
       use Sub::Throttler qw( throttle_it );

       # Enable throttling for existing sync/async functions/methods
       throttle_it('Mojo::UserAgent::get');
       throttle_it('Mojo::UserAgent::post');

       # Load throttling algorithms
       use Sub::Throttler::Limit;
       use Sub::Throttler::Rate::AnyEvent;
       use Sub::Throttler::Periodic::EV;

       # Configure throttling algorithms
       my $throttle_parallel_requests
           = Sub::Throttler::Limit->new(limit => 5);
       my $throttle_request_rate
           = Sub::Throttler::Rate::AnyEvent->new(period => 0.1, limit => 10);

       # Apply configured limits to selected functions/methods with
       # throttling support
       $throttle_parallel_requests->apply_to_methods(Mojo::UserAgent => qw( get ));
       $throttle_request_rate->apply_to_methods(Mojo::UserAgent => qw( get post ));

DESCRIPTION

   This module provide sophisticated throttling framework which let you
   delay execution of any sync and async functions/methods based on any
   rules you need.

   You can use core features of this framework with usual sync application
   or with application based on any event loop, but some throttling
   algorithms may require specific event loop (like
   Sub::Throttler::Periodic::EV).

   The "SYNOPSIS" shows basic usage example, but there are a lot of
   advanced features: define which and how many resources each
   function/method use depending not only on it name but also on it
   params, normal and high-priority queues for delayed functions/methods,
   custom wrappers with ability to free unused resources, write your own
   functions/methods with smart support for throttling, implement own
   throttling algorithms, save and restore current limits and used
   resources between different executions of your app.

   Basic use case: limit rate for downloading urls. Advanced use case:
   apply complex limits for using remote API, which depends on how many
   items you process (i.e. on some params of API call), use high-priority
   queue to ensure login/re-login API call will be executed ASAP before
   any other delayed API calls, cancel unused limits in case of failed API
   call to avoid needless delay for next API calls, save/restore used
   resources to avoid occasional exceeding the quota because of
   crash/restart of your app.

ALGORITHMS / PLUGINS

   These algorithms are included in this module, but there are may be
   other algorithms available as separate modules on CPAN.

   Sub::Throttler::Limit implement algorithm to throttle based on quantity
   of used resources/limits. For example, it will let you limit an amount
   of simultaneous tasks.

   Sub::Throttler::Rate::AnyEvent implement algorithm to throttle based on
   rate (quantity of used resources/limits per some period of time). For
   example, it will let you control maximum calls/sec and burst rate (by
   choosing between "1000 calls per 1 second" and "10 calls per 0.01
   second" limits).

   Sub::Throttler::Periodic::EV is similar to
   Sub::Throttler::Rate::AnyEvent, but it treat "period" differently,
   using absolute wall-clock time instead of relative time (for ex. if
   period is set to 1 hour then it begins at 00m00s and ends at 59m59s of
   every hour).

HOW THROTTLING WORKS

   To be able to throttle some function/method it should support
   throttling. This mean it either should be implemented using this module
   (see "throttle_me", "throttle_me_asap" and "throttle_me_sync"), or you
   should replace original function/method with special wrapper. Simple
   (but limited) way to do this is use "throttle_it", "throttle_it_asap"
   and "throttle_it_sync" helpers. If simple way doesn't work for you then
   you can implement "custom wrapper" yourself.

   Next, not all functions/methods with throttling support should be
   actually throttled - you should configure which of them and how exactly
   should be throttled by using throttling algorithm objects.

   Then, when some function/method (which has throttling support and some
   limits applied using throttling algorithms) is called, it will try to
   acquire some resources (see below) - on success it will be immediately
   executed, otherwise it execution will be delayed until these resources
   will be available. When it's finished it should explicitly free used
   resources (if it was unable to do it work - for example because of
   network error - it may cancel resources as unused) - to let some other
   (delayed until these resources will be available) function/method runs.

   How exactly function/method will be delayed depends on it type:

   async

     Async function/method will be delayed by adding it into queue (normal
     or high-priority one) and returning. It will be run later, when
     resources needed for it will become available: when another async
     function/method (which is already running) will finish and release
     used resources, or when some resources will become available
     automatically as time passes, or if you reconfigure algorithm objects
     and increase amount of available resources, or if you manually
     release resources which you've manually acquired before. In most
     cases your application should use some event loop to have these
     events happens.

     As side effect, you won't get value returned by async function/method
     (if any) because execution of this function/method may be delayed by
     adding it into queue instead of running it immediately. In most cases
     this isn't a problem because async functions usually return results
     using user's callback when done and don't return anything useful when
     started.

   sync

     Sync function/method will be delayed by calling sleep() if some
     resources it needs isn't available yet. Of course, this will work
     only with algorithms which automatically release some resources as
     time passes (like Sub::Throttler::Rate::AnyEvent or
     Sub::Throttler::Periodic::EV). In all other cases - used algorithm
     doesn't release resources with time (like Sub::Throttler::Limit) or
     needed amount of resources is over algorithm's maximal limit - it
     doesn't make sense to sleep() and thus exception will be thrown (such
     exception indicate bug in your code and should never happens
     otherwise).

 HOW TO CONTROL LIMITS / RESOURCES

   When you configure throttling for some function/method you define which
   "resources" and how much of them it needs to run. The "resource" is
   just any string, and in simple case all throttled functions/methods
   will use same string (say, "default") and same quantity of this
   "resource": 1.

       # this algorithm allow using up to 5 "resources" of same name
       $throttle = Sub::Throttler::Limit->new(limit => 5);

       # this is same:
       $throttle->apply_to_functions('Package::func');
       # as this:
       $throttle->apply_to(sub {
           my ($this, $name, @params) = @_;
           if (!$this && $name eq 'Package::func') {
               return { default=>1 };  # require 1 resource named "default"
           }
           return;                     # do not throttle other functions/methods
       });

   But in complex cases you can (based on name of function or class name
   or exact object and method name, and their parameters) define several
   "resources" with different quantities of each.

       $throttle->apply_to(sub {
           my ($this, $name, @params) = @_;
           if (ref $this && $this eq $target_object && $name eq 'method') {
               # require 2 "some" and 10 "other" resources
               return { some=>2, other=>10 };
           }
           return;                     # do not throttle other functions/methods
       });

   It's allowed to "apply" same $throttle instance to same
   functions/methods more than once if you won't return same resource name
   more than once for same function/method.

   How exactly these "resources" will be acquired and released depends on
   used algorithm.

 PRIORITY / QUEUES

   There are two separate queues for delayed async functions/methods:
   normal and high-priority "asap" queue. Functions/methods in "asap"
   queue will be executed before any (even delayed before them)
   function/method in normal queue which require same resources to run.
   But if there are not enough resources to run function/method from
   high-priority queue and enough to run from normal queue -
   function/method from normal queue will be run.

   Which function/method will use normal and which "asap" queue is defined
   by that function/method (or it wrapper) implementation.

   Delayed methods in queue use weak references to their objects, so these
   objects doesn't kept alive only because of these delayed method calls.
   If these objects will be destroyed then their delayed methods will be
   silently removed from queue.

EXPORTS

   Nothing by default, but all documented functions can be explicitly
   imported.

   Use tag :ALL to import all of them.

INTERFACE

Enable throttling for existing functions/methods

   throttle_it

         my $orig_func = throttle_it('func');
         my $orig_func = throttle_it('Some::func2');

     This helper is able to replace with wrapper either sync
     function/method or async function/method which receive callback in
     last parameter.

     That wrapper will call $done->() (release used resources, see
     "throttle_me" for details about it) after sync function/method
     returns or just before callback of async function/method will be
     called.

     If given function name without package it will look for that function
     in caller's package.

     Return reference to original function or throws if given function is
     not exists.

   throttle_it_asap

         my $orig_func = throttle_it_asap('func');
         my $orig_func = throttle_it_asap('Some::func2');

     Same as "throttle_it" but use high-priority "asap" queue for async
     function/method calls.

   throttle_it_sync

         my $orig_func = throttle_it_sync('func');
         my $orig_func = throttle_it_sync('Some::func2');

     Same as "throttle_it" but doesn't try to handle given function/method
     as async even if it's called with CODEREF in last parameter.

 custom wrapper

   If you want to call $done->() after async function/method callback or
   before sync function/method will be actually called or want to cancel
   unused resources in some cases by calling $done->(0) you should
   implement custom wrapper instead of using "throttle_it",
   "throttle_it_asap" or "throttle_it_sync" helpers.

   Throttling anonymous function is not supported, that's why you need to
   use string eval instead of *Some::func = sub { 'wrapper' }; here
   (actually, you can avoid string eval using "set_subname" in Sub::Util).

       # Example wrapper for sync function which called in scalar context and
       # return false when it failed to do it work (we want to cancel
       # unused resources in this case).
       my $orig_func = \&Some::func;
       eval <<'EOW';
       no warnings 'redefine';
       sub Some::func {
           my $done = &throttle_me_sync;
           my $result = $orig_func->(@_);
           if ($result) {
               $done->();
           } else {
               $done->(0);
           }
           return $result;
       }
       EOW

       # Example wrapper for sync function/method which can be called in any
       # context, don't affect call stack and release resources when start.
       # Also let's use Sub::Util to avoid string eval.
       use Sub::Util 1.40 qw( set_subname );
       my $orig_sub = \&Some::sub;
       no warnings 'redefine';
       *Some::sub = set_subname 'Some::sub', sub {
           my $done = &throttle_me_sync;
           $done->();
           goto &$orig_sub;
       };

       # Example wrapper for async method which receive callback in first
       # parameter and call it with error message when it failed to do it
       # work (we want to cancel unused resources in this case); we also want
       # to call $done->() after callback and use "asap" queue.
       my $orig_method = \&Class::method;
       eval <<'EOW';
       no warnings 'redefine';
       sub Class::method {
           my $done = &throttle_me_asap || return;
           my $self = shift;
           my $orig_cb = shift;
           my $cb = sub {
               my ($error) = @_;
               $orig_cb->(@_);
               if ($error) {
                   $done->(0);
               } else {
                   $done->();
               }
           };
           $self->$orig_method($cb, @_);
           return;
       }
       EOW

Writing functions/methods with support for throttling

   To have maximum control over some function/method throttling you should
   write that function yourself (in some cases it's enough to write a
   "custom wrapper" for existing function/method). This will let you
   control when exactly it should release used resources or cancel unused
   resources to let next delayed function/method run as soon as possible.

   throttle_me

         sub async_func {
             my $done = &throttle_me || return;
             my (@params) = @_;
             if ('unable to do the work') {
                 $done->(0);
                 return;
             }
             ...
             $done->();
             return;
         }
         sub async_method {
             my $done = &throttle_me || return;
             my ($self, @params) = @_;
             if ('unable to do the work') {
                 $done->(0);
                 return;
             }
             ...
             $done->();
             return;
         }

     Support only async function/method, which can't return anything to
     caller (because it call may be delayed because of throttling). When
     this function/method will be executed it won't get anything useful
     from caller() or wantarray() because it will be called from internals
     of this module.

     You should use it exactly as it shown in these examples: it should be
     called using form &throttle_me because it needs to modify your
     function/method's @_.

     If your function/method should be delayed because of throttling it
     will return false, and you should interrupt your function/method.
     Otherwise it'll acquire "resources" needed to run your
     function/method and return callback which you should call later to
     release these resources.

     If your function/method has done it work (and thus "used" these
     resources) $done should be called without parameters $done->() or
     with one true param $done->(1); if it hasn't done it work (and thus
     "not used" these resources) it's better to call it with one false
     param $done->(0) to cancel these unused resources and give a chance
     for another function/method to reuse them.

     If you forget to call $done - you'll get a warning (and chances are
     you'll soon run out of resources because of this and new throttled
     functions/methods won't be run anymore), if you call it more than
     once - exception will be thrown.

     Anonymous functions are not supported.

   throttle_me_asap

     Same as "throttle_me" except use &throttle_me_asap:

             my $done = &throttle_me_asap || return;

     This will make this async function/method use high-priority "asap"
     queue instead of normal queue.

   throttle_me_sync

     Similar to "throttle_me" but for sync function/method:

             my $done = &throttle_me_sync;

     Delaying sync function/method is implemented using sleep(), so
     caller() and wantarray() will work as expected, so only difference
     from usual function/method is needs to call &throttle_me_sync on
     start and then later release resources.

   done_cb

         my $cb = done_cb($done, sub {
             my (@params) = @_;
             ...
         });

         my $cb = done_cb($done, sub {
             my ($extra1, $extra2, @params) = @_;
             ...
         }, $extra1, $extra2);

         my $cb = done_cb($done, $class_or_object, 'method');
         sub Class::Of::That::Object::method {
             my ($self, @params) = @_;
             ...
         }

         my $cb = done_cb($done, $class_or_object, 'method', $extra1, $extra2);
         sub Class::Of::That::Object::method {
             my ($self, $extra1, $extra2, @params) = @_;
             ...
         }

     This is a simple helper function used to make sure you won't forget
     to call $done->() in your async function/method with throttling
     support.

     First parameter must be  $done  callback, then either callback
     function or object (or class name) and name of it method, and then
     optionally any extra params for that callback function/object's
     method.

     Returns callback, which when called will first call $done->() and
     then given callback function or object's method with any extra params
     (if any) followed by it own params.

     Example:

         # use this:
         sub download {
             my $done = &throttle_me || return;
             my ($url) = @_;
             $ua->get($url, done_cb($done, sub {
                 my ($ua, $tx) = @_;
                 ...
             }));
         }
         # instead of this:
         sub download {
             my $done = &throttle_me || return;
             my ($url) = @_;
             $ua->get($url, sub {
                 my ($ua, $tx) = @_;
                 $done->();
                 ...
             });
         }

Implementing throttle algorithms/plugins

   It's recommended to inherit your algorithm from Sub::Throttler::algo.

   Each plugin must provide these methods (they'll be called by throttling
   engine):

       sub acquire {
           my ($self, $id, $key, $quantity) = @_;
           if ('resource temporary unavailable') {
               # wait for resource (don't use external event loop or
               # anything else which may call user's code/callbacks!)
               sleep(...);
           }
           # acquire $quantity of resources named $key for
           # function/method identified by $id
           if ('failed to acquire') {
               croak "$self: unable to acquire $quantity of resource '$key'";
           }
           return $self;
       }
       sub try_acquire {
           my ($self, $id, $key, $quantity) = @_;
           # try to acquire $quantity of resources named $key for
           # function/method identified by $id
           if ('failed to acquire') {
               return;
           }
           return 1;
       }
       sub release {
           my ($self, $id) = @_;
           # release resources previously acquired for $id
           if ('amount of available resources was increased') {
               throttle_flush();
           }
           return $self;
       }
       sub release_unused {
           my ($self, $id) = @_;
           # cancel unused resources previously acquired for $id
           if ('amount of available resources was increased') {
               throttle_flush();
           }
           return $self;
       }

   While trying to find out is there are enough resources to run some
   delayed function/method throttling engine may call release_unused()
   immediately after successful try_acquire() - if it turns out some other
   resource needed for same function/method isn't available.

   Also, usually plugins should provide few more methods, which isn't used
   by throttling engine (so they're optional in some sense), but usually
   they needed for application which uses that algorithm:

       sub new {
           my ($class, %options) = @_;
           # create new algorithm object
           return $self;
       }
       sub load {
           my ($class, $state) = @_;
           # create new algorithm object using previous state from $state
           # (which usually include both object's configuration like {limit}
           # plus information about currently acquired resources)
           return $self;
       }
       sub save {
           my ($self) = @_;
           # generate and return perl structure describing current
           # object's configuration and acquired resources
           return $state;
       }

   throttle_add

         throttle_add($throttle_plugin, sub {
             my ($this, $name, @params) = @_;
             # $this is undef or a class name or an object
             # $name is a function or method name
             # @params is function/method params
             ...
             return;                         # OR
             return {key1=>$quantity1, ...};
         });

     This function usually used to implement helper methods in algorithm
     like "apply_to" in Sub::Throttler::algo, "apply_to_functions" in
     Sub::Throttler::algo, "apply_to_methods" in Sub::Throttler::algo. But
     if algorithm doesn't implement such helpers it may be used directly
     by user to apply some algorithm instance to selected
     functions/methods.

   throttle_del

         throttle_del();
         throttle_del($throttle_plugin);

     Undo previous "throttle_add" calls with $throttle_plugin in first
     param or all of them if given no param. This is rarely useful,
     usually you setup throttling when your app initializes and then
     doesn't change it.

   throttle_flush

         throttle_flush();

     Algorithm must call it each time quantity of some resources increases
     (so there is a chance one of delayed functions/methods can be run
     now).

SUPPORT

Bugs / Feature Requests

   Please report any bugs or feature requests through the issue tracker at
   https://github.com/powerman/perl-Sub-Throttler/issues. You will be
   notified automatically of any progress on your issue.

Source Code

   This is open source software. The code repository is available for
   public review and contribution under the terms of the license. Feel
   free to fork the repository and submit pull requests.

   https://github.com/powerman/perl-Sub-Throttler

       git clone https://github.com/powerman/perl-Sub-Throttler.git

Resources

     * MetaCPAN Search

     https://metacpan.org/search?q=Sub-Throttler

     * CPAN Ratings

     http://cpanratings.perl.org/dist/Sub-Throttler

     * AnnoCPAN: Annotated CPAN documentation

     http://annocpan.org/dist/Sub-Throttler

     * CPAN Testers Matrix

     http://matrix.cpantesters.org/?dist=Sub-Throttler

     * CPANTS: A CPAN Testing Service (Kwalitee)

     http://cpants.cpanauthors.org/dist/Sub-Throttler

AUTHOR

   Alex Efros <[email protected]>

COPYRIGHT AND LICENSE

   This software is Copyright (c) 2014-2015 by Alex Efros
   <[email protected]>.

   This is free software, licensed under:

     The MIT (X11) License