NAME
Text::Darts - Perl interface to DARTS by Taku Kudoh
SYNOPSIS
use Text::Darts;
my @words = qw/ALGOL ANSI ARCO ARPA ARPANET ASCII/;
my %word = map { $_ => lc $_ } @words;
my $td = Text::Darts->new(@words);
my $newstr = $td->gsub("ARPANET is a net by ARPA", sub{ "<<$_[0]>>" });
# $newstr is now "<<ARPANET>> is a net by <<ARPA>>".
my $lstr = $td->gsub("ARPANET is a net by ARPA", \%words);
# $Lstr is now "arpanet is a net by arpa".
# or
my $td = Text::Darts->open("words.darts"); # make one with mkdarts
my $newstr = $td->gsub($str, sub{
qq(<a href="
http://dictionary.com/browse/$_[0]">$_[0]</a>)
}); # link'em all!
DESCRIPTION
Darts, or Double-ARray Trie System is a C++ Template Library by Taku
Kudoh. This module makes use of Darts to implement global replace like
below;
$str = s{ (foo|bar|baz) }{ "<<$1>>" }msgex;
The problem with regexp is that it is slow with alterations. Suppose you
want to anchor all words that appear in /usr/share/dict/words with
regexp. It would be impractical with regexp. This module makes it
practical.
Even if you are using perl 5.10 or better which does TRIE optimization
internally, you still have to compile the regexp everytime you run the
script. So it is still more practical to use this module if your match
is exact -- containing no quantifier.
Since Version 0.05, Text::Darts also accepts a hash reference instead of
a code reference. In such cases gsub behaves as follows.
$str = s{ (foo|bar|baz) }{$replacement{$1}}msgx;
like "s///ge" vs "s///g", this is less flexible but faster.
REQUIREMENT
Since Version 0.07, you no longer need to have Darts installed. This
module now bundles darts.h which is needed to build this module. To get
the most out of Darts, you still need to install Darts 0.32 or above.
<
http://chasen.org/~taku/software/darts/index.html> (Japanese)
fetch
http://chasen.org/~taku/software/darts/src/darts-0.32.tar.gz
tar zxvf darts-0.32.tar.gz
cd darts-0.32
configure
make
make check
sudo make install
EXPORT
None.
Functions
Text::Darts::DARTS_VERSION
returns DARTS version. Currently 0.32
SEE ALSO
<
http://chasen.org/~taku/software/darts/index.html> (Japanese)
Regexp::Assemble
AUTHOR
Dan Kogai, <
[email protected]>
COPYRIGHT AND LICENSE
darts.h
Copyright (c) 2001-2008, Taku Kudo All rights reserved.
Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are
met:
* Redistributions of source code must retain the above
copyright notice, this list of conditions and the
following disclaimer.
* Redistributions in binary form must reproduce the above
copyright notice, this list of conditions and the
following disclaimer in the documentation and/or other
materials provided with the distribution.
* Neither the name of the Nippon Telegraph and Telegraph Corporation
nor the names of its contributors may be used to endorse or
promote products derived from this software without specific
prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
"AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABI LITY AND FITNESS FOR
A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
OWNER OR CONT RIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES ( INCLUDING, BUT NOT
LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
DATA, OR P ROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
THEORY OF LIABILITY, WHETHER IN CONTRACT , STRICT LIABILITY, OR TORT
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
OF TH IS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
anything else
Copyright (C) 2007-2009 by Dan Kogai
This library is free software; you can redistribute it and/or modify
it under the same terms as Perl itself, either Perl version 5.10.0 or,
at your option, any later version of Perl 5 you may have available.