NAME
   Net::DNSBL::MultiDaemon - multi DNSBL prioritization

SYNOPSIS
     use Net::DNSBL::MultiDaemon qw(
           :debug
           run
           bl_lookup
           set_extension
     );

     run($BLzone,$L,$R,$DNSBL,$STATs,$Run,$Sfile,$StatStamp,$DEBUG)
     bl_lookup($put,$mp,$rtp,$sinaddr,$alarm,$rid,$id,$rip,$type,$zone,@blist);

DESCRIPTION
   Net::DNSBL::MultiDaemon is the Perl module that implements the
   multi_dnsbl daemon.

   multi_dnsbl is a DNS emulator daemon that increases the efficacy of
   DNSBL look-ups in a mail system. multi_dnsbl may be used as a
   stand-alone DNSBL or as a plug-in for a standard BIND 9 installation.
   multi_dnsbl shares a common configuration file format with the
   Mail::SpamCannibal sc_BLcheck.pl script so that DNSBL's can be
   maintained in a common configuration file for an entire mail
   installation.

   Because DNSBL usefulness is dependent on the nature and source of spam
   sent to a specific site and because sometimes DNSBL's may provide
   intermittant service, multi_dnsbl interrogates them sorted in the order
   of greatest successful hits. DNSBL's that do not respond within the
   configured timeout period are not interrogated at all after 6
   consecutive failures, and thereafter will be retried not more often than
   once every hour until they come back online. This eliminates the need to
   place DNSBL's in a particular order in your MTA's config file or
   periodically monitor the DNSBL statistics and/or update the MTA config
   file.

   In addition to optimizing DNSBL interrogation, multi_dnsbl may be
   configured to locally accept or reject specified IP's, IP ranges and to
   reject specified countries by 2 character country code. By adding a
   DNSBL entry of in-addr.arpa, IP's will be rejected that do not return
   some kind of valid reverse DNS lookup. In addition, IP's can be rejected
   that have a PTR record that matchs a configurable GENERIC 'regexp' set.

   Reject codes are as follows:

     query 2.0.0.127.{zonename}    127.0.0.2
     blocked by configured DNSBL   127.0.0.2
     no reverse DNS                127.0.0.4
     BLOCKED (local blacklist)     127.0.0.5
     Blocked by Country            127.0.0.6
     Blocked GENERIC               127.0.0.7

OPERATION
   The configuration file for multi_dnsbl contains optional IGNORE (always
   pass), optional BLOCK (always reject), and optional BBC (block by
   country) entries against which all received queries are checked before
   external DNSBL's are queried. IP's which pass IGNORE, BLOCK, and BBC
   test are then checked against the prioritized list of DNSBL's to try
   when looking up an IP address for blacklisting. Internally, multi_dnsbl
   maintains this list in sorted order (including 'in-addr.arpa') based on
   the number of responses that resulted in an acceptable A record being
   returned from the DNSBL query. For each IP address query sent to
   multi_dnsbl, a query is sent to each configured DNSBL sequentially until
   all DNSBL's have been queried or an acceptable A record is returned.

   Let us say for example that blackholes.easynet.nl (below) will return an
   A record and list.dsbl.org, bl.spamcop.net, dynablock.easynet.nl, will
   not.

                   LIST
           9451    list.dsbl.org
           6516    bl.spamcop.net
           2350    dynablock.easynet.nl
           575     blackholes.easynet.nl
           327     cbl.abuseat.org
           309     dnsbl.sorbs.net
           195     dnsbl.njabl.org
           167     sbl.spamhaus.org
           22      spews.dnsbl.net.au
           6       relays.ordb.org
           1       proxies.blackholes.easynet.nl
           0       dsbl.org

   A query to multi_dnsbl (pseudo.dnsbl in this example) looks like this

           QUERY
     1.2.3.4.pseudo.dnsbl
             |
             V
     ####################
     #    multi_dnsbl   #
     ####################
      |                                  RESPONSE
      +--> 1.2.3.4.list.dsbl.org         NXDOMAIN
      |
      +--> 1.2.3.4.bl.spamcop.net        NXDOMAIN
      |
      +--> 1.2.3.4.dynablock.easynet.nl  NXDOMAIN
      |
      +--> 1.2.3.4.blackholes.easynet.nl A-127.0.0.2

   The A record is returned to originator of the Query and the statistics
   count on blackholes.easynet.nl is incremented by one.

INSTALLATION / CONFIGURATION / OPERATION
   multi_dnsbl can be installed as either a standalone DNSBL or as a
   plug-in to a BIND 9 installation on the same host. In either case, copy
   the rc.multi_daemon script to the appropriate startup directory on your
   host and modify the start, stop, restart scripts as required. Operation
   of the script is as follows:

     Syntax: ./rc.multi_dnsbl start    /path/to/config.file
             ./rc.multi_dnsbl start -v /path/to/config.file
             ./rc.multi_dnsbl stop     /path/to/config.file
             ./rc.multi_dnsbl restart  /path/to/config.file

     The -v switch will print the scripts
     actions verbosely to the STDERR.

 CONFIGURATION FILE

   The configuration file for multi_dnsbl shares a common format with the
   Mail::SpamCannibal sc_BLcheck.pl script, facilitating common maintenance
   of DNSBL's for your MTA installation.

   The sample configuration file multi_dnsbl.conf.sample is heavily
   commented with the details for each configuration element. If you plan
   to use a common configuration file in a SpamCannibal installation,
   simply add the following elements to the sc_BlackList.conf file:

     MDstatfile     => '/path/to/statistics/file.txt',
     MDpidpath      => '/path/to/pidfiles', # /var/run
     MDzone         => 'pseudo.dnsbl',

     # OPTIONAL
     MDstatrefresh => 300,       # seconds
     MDipaddr      => '0.0.0.0', # PROBABLY NOT WHAT YOU WANT
     MDport        => 9953,
     MDcache       => 10000,     # an entry takes ~400 bytes
                                 # default 10000 (to small)

   ### WARNING ### failure to set MDipaddr to a valid ip address will
   result in the authority section return an NS record of INADDR_ANY This
   will return an invalid NS record in stand alone operation

 STANDALONE OPERATION

   For standalone operation, simply set MDport = 53, nothing more is
   required.

   Interrogating the installation will then return the first match from the
   configured list of DNSBL servers.

     i.e.  dig 2.0.0.127.pseudo.dnsbl

           .... results

 PLUGIN to BIND 9

   multi_dnsbl may be used as a plugin helper for a standard bind 9
   installation by adding a forward zone to the configuration file as
   follows:

     //zone pseudo.dnsbl
     zone "pseudo.dnsbl" in {
           type forward;
           forward only;
           forwarders {
               127.0.0.1 port 9953;
           };
     };

   You may also wish to add one or more of the following statements with
   appropriate address_match_lists to restrict access to the facility.

           allow-notify {};
           allow-query { address_match_list };
           allow-recursion { address_match_list };
           allow-transfer {};

 MTA CONFIGURATION

   Access to DNSBL lookup is configured in the normal fashion for each MTA.
   Since MTA's generally must interrogate on port 53, multi_dnsbl must be
   installed on a stand-alone server or as a plugin for BIND 9.

   A typical configuration line for sendmail M4 configuration file is shown
   below:

     FEATURE(`dnsbl',`pseudo.dnsbl',
     `554 Rejected $&{client_addr} found in http://www.my.blacklist.org')dnl

SYSTEM SIGNALS
   multi_dnsbl responds to the following system signals:

   * TERM
       Operations the statistics file is updated with the internal counts
       and the daemon then exits.

   * HUP
       Operations are stopped including an update of the optional
       statistics file, the configuration file is re-read and operations
       are restarted.

   * USR1
       The statistics file is updated on the next second tick.

   * USR2
       The statistics file is deleted, internal statistics then a new
       (empty) statistics file is written on the next second tick.

PERL MODULE DESCRIPTION
   Net::DNSBL::MultiDaemon provides most of the functions that implement
   multi_dnsbl which is an MTA helper that interrogates a list of DNSBL
   servers in preferential order based on their success rate.

   The following describes the workings of individual functions used to
   implement multi_dnsbl.

   * run($BLzone,$L,$R,$DNSBL,$STATs,$Run,$Sfile,$StatStamp,$DEBUG);
       This function is the 'run' portion for the DNSBL multidaemon

         input:
               $BLzone zone name,
               $L      local listen socket object pointer,
               $R      remote socket object pointer,
               $DNSBL  config hash pointer,
               $STATs  statistics hash pointer
               $Run    pointer to stats refresh time,  # must be non-zero
               $Sfile  statistics file path,
               $StatStamp      stat file initial time stamp

         returns:      nothing

       * $BLzone
         The fully qualified domain name of the blacklist lookup

       * $L
         A pointer to a UDP listener object

       * $R
         A pointer to a unbound UDP socket used for interogation and
         receiving replies for the multiple DNSBL's

       * $DNSBL
         A pointer to the configuration hash of the form:

           $DNSBL = {
             # Always allow these addresses
                 'IGNORE' => [   # OPTIONAL
                    # a single address
                 '11.22.33.44',
                    # a range of ip's, ONLY VALID WITHIN THE SAME CLASS 'C'
                 '22.33.44.55 - 22.33.44.65',
                    # a CIDR range
                 '5.6.7.16/28',
                    # a range specified with a netmask
                 '7.8.9.128/255.255.255.240',
                    # you may want these
                 '10.0.0.0/8',
                 '172.16.0.0/12',
                 '192.168.0.0/16',
                    # this should ALWAYS be here
                 '127.0.0.0/8',  # ignore all test entries and localhost
                 ],

             # Do rhbl lookups only, default false
             # all other rejection classes are disabled, IGNORE, BLOCK, BBC, in-addr.arpa
             # RHBL need only be "true" for operation. If OPTIONAL URBL conditioning
             # is needed, then the parameters in the has must be added
                 RHBL    => {    # optional URBL preparation
                   urblwhite => [
                         '/path/to/cached/whitefile',
                         '/path/to/local/file'   # see format of spamassassin file
                   ],
                   urblblack => [
                         '/path/to/local/blacklist'
                   ],
         # NOTE: level 3 tld's should be first before level 2 tld's
                   urbltlds  => [
                         '/path/to/cached/tld3file',
                         '/path/to/cached/tld2file'
                   ],
                   urlwhite  => [
                         'http://spamassasin.googlecode.com/svn-history/r6/trunk/share/spamassassin/25_uribl.cf',
                         '/path/to/cached/whitefile'
                   ],
                   urltld3   => [
                         'http://george.surbl.org/three-level-tlds',
                         '/path/to/cached/tld3file'
                   ],
                   urltld2   => [
                         'http://george.surbl.org/two-level-tlds',
                         '/path/to/cached/tld2file'
                   ],
                 },

             # Authoratative answers
                 'AUTH'  => 0,

             # Always reject these addresses
                 'BLOCK' => [    # OPTIONAL
                    # same format as above
                 ],

             # Always block these countries
                 'BBC'   => [qw(CN TW RO )],

             # Check for reverse lookup failures - OPTIONAL
                 'in-addr.arpa'  => {
                     timeout     => 15,  # default timeout is 30
                 },

             # RBL zones as follows: OPTIONAL
                 'domain.name' => {
             # mark this dnsbl to require right hand side domain processing
             # requires URBL::Prepare
         # NOT IMPLEMENTED
         #           urbl        => 1,
                     acceptany   => 'comment - treat any response as valid',
             # or
                     accept      => {
                         '127.0.0.2' => 'comment',
                         '127.0.0.3' => 'comment',
                     },
             # or
             # mask the low 8 bits and accept any true result
                     acceptmask  => 0x3D,        # accepts 0011 1101

           #         timeout     => 30,  # default seconds to wait for dnsbl
                 },

                 'next.domain' = {
                     etc....
           # included but extracted external to B<run>

                 MDzone          => 'pseudo.dnsbl',
                 MDstatfile      => '/path/to/statistics/file.txt',
                 MDpidpath       => '/path/to/pidfiles
           # OPTIONAL, defaults shown
           #     MDstatrefresh   => 300, # max seconds for refresh
           #     MDipaddr        => '0.0.0.0', # PROBABLY NOT WHAT YOU WANT
           #     MDport          => 9953,
           # syslog. Specify the facility, one of:
           # LOG_EMERG LOG_ALERT LOG_CRIT LOG_ERR LOG_WARNING LOG_NOTICE LOG_INFO LOG_DEBUG
           #     MDsyslog        => 'LOG_WARNING',
           #
           #     cache lookups using the TTL of the providing DNSBL
           #     each cache entry takes about 400 bytes, minimum size = 1000
           #     MDcache         => 1000,      # 1000 is too small
           };

         Zone labels that are not of the form *.*... are ignored, making
         this hash table fully compatible with the SpamCannibal
         sc_Blacklist.conf file.

       * $STATs
         A pointer to a statistics collection array of the form:

           $STATs = {
                 'domain.name' => count,
                 etc...,
                 'CountryCode' => count,
                 etc...
           };

         Initialize this array with cntinit($DNSBL,$cp) the
         Net::DNSBL::Utilities manpage/cntinit, then list2hash($BBC,$cp)
         the Net::DNSBL::Utilities manpage/list2hash, then
         statinit($Sfile,$cp) the Net::DNSBL::Utilities manpage/statinit,
         below.

       * $Run
         A POINTER to the time in seconds to refresh the $STATs backing
         file. Even if there is not backing file used, this value must be a
         positive integer. Setting this value to zero will stop the daemon
         and force a restart. It is used by $SIG{HUP} to restart the
         daemon.

       * $Sfile
         The path to the STATISTICS backing file.

           i.e.  /some/path/to/filename.ext

         If $Sfile is undefined, then the time stamp need not be defined

       * $StatTimestamp
         Normally the value returned by statinit($Sfile,$cp) the
         Net::DNSBL::Utilities manpage/statinit, below.

   *
   bl_lookup($put,$mp,$rtp,$sinaddr,$alarm,$rid,$id,$rip,$type,$zone,@blist
   );
       Generates a query message for the first DNSBL in the @blist array.
       Creates a thread entry for the response and subsequent queries
       should the first one fail.

         input:        put,
                       message pointer,
                       remote thread pointer,
                       sockinaddr,
                       connection timeout,
                       remote id or undef to create
                       id of question,
                       reverse IP address in text
                       type of query received, (used in response)
                       ORIGINAL zone (case preserved),
                       array of remaining DNSBL's in sorted order
         returns:      nothing, puts stuff in thread queue

         extra:        if URBL processing is required,
                       $remoteThreads{$rid}->{urbl}
                       is set to the domain to look up

   * set_extension($pointer);
       This function sets a pointer to user defined extensions to
       Net::DNSBL::MultiDaemon.

       Pointer is of the form:

               $Extension ->{
                       OPCODE   => value,
                       CLASS    => subref->($Extension,internal args),
                       NAME     => subref->($Extension,internal args),
                       TYPE     => subref->($Extension,internal args),
                       LOOKUP   => subref->($Extension,internal args),
                       ANSWER   => subref->($Extension,internal args),
                       NOTFOUND => subref->($Extension,internal args)
               };

       The pointer should be blessed into the package of the caller if the
       calling package needs to store persistant variables for its own
       instance. The subref will be called with the first argument of
       $Extension.

       Care should be taken to NOT instantiate a %remoteThreads in the
       CLASS, NAME, TYPE section unless it is know that it will be found
       and expired/deleted.

       Read the code if you wish to add an extension

DEPENDENCIES
     Unix::Syslog
     Geo::IP::PurePerl [conditional for country codes]
     NetAddr::IP
     Net::DNS::Codes
     Net::DNS::ToolKit

EXPORT_OK
           run
           bl_lookup

EXPORT_TAGS :debug
     DEBUG is a set of semaphores for the 'run' function

     $D_CLRRUN    = 0x1;  # clear run flag and force unconditional return
     $D_SHRTHD    = 0x2;  # return short header message
     $D_TIMONLY   = 0x4;  # exit at end of timer section
     $D_QRESP     = 0x8;  # return query response message
     $D_NOTME     = 0x10; # return received response not for me
     $D_ANSTOP    = 0x20; # clear run OK flag if ANSWER present
     $D_VERBOSE   = 0x40; # verbose debug statements to STDERR

AUTHOR
   Michael Robinton, [email protected]

COPYRIGHT
   Copyright 2003 - 2014, Michael Robinton & BizSystems This program is
   free software; you can redistribute it and/or modify it under the terms
   as Perl itself or the GNU General Public License as published by the
   Free Software Foundation; either version 2 of the License, or (at your
   option) any later version.

   This program is distributed in the hope that it will be useful, but
   WITHOUT ANY WARRANTY; without even the implied warranty of
   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General
   Public License for more details.

   You should have received a copy of the GNU General Public License along
   with this program; if not, write to the Free Software Foundation, Inc.,
   59 Temple Place - Suite 330, Boston, MA 02111-1307, USA.

SEE ALSO
   the URBL::Prepare manpage, the Geo::IP::PurePerl manpage, the
   Net::DNSBL::Utilities manpage, the Net::DNS::Codes manpage, the
   Net::DNS::ToolKit manpage, the Mail::SpamCannibal manpage