diff options
author | Dmitry Marakasov <amdmi3@FreeBSD.org> | 2016-05-19 10:53:05 +0000 |
---|---|---|
committer | Dmitry Marakasov <amdmi3@FreeBSD.org> | 2016-05-19 10:53:05 +0000 |
commit | 1f8b48b772b2d0ac0ed48a8259d2117ea3236a90 (patch) | |
tree | 06f11bc7b351f3abf3bf5507405afdba5374614e /textproc/p5-Algorithm-RabinKarp | |
parent | 4e942b64191e2ef98dce2c5af31047a8640db768 (diff) | |
download | ports-1f8b48b772b2d0ac0ed48a8259d2117ea3236a90.tar.gz ports-1f8b48b772b2d0ac0ed48a8259d2117ea3236a90.zip |
- Fix trailing whitespace in pkg-descrs, categories [p-x]*
Approved by: portmgr blanket
Notes
Notes:
svn path=/head/; revision=415500
Diffstat (limited to 'textproc/p5-Algorithm-RabinKarp')
-rw-r--r-- | textproc/p5-Algorithm-RabinKarp/pkg-descr | 16 |
1 files changed, 8 insertions, 8 deletions
diff --git a/textproc/p5-Algorithm-RabinKarp/pkg-descr b/textproc/p5-Algorithm-RabinKarp/pkg-descr index 5d4ddcc02bcd..28adfaf561d7 100644 --- a/textproc/p5-Algorithm-RabinKarp/pkg-descr +++ b/textproc/p5-Algorithm-RabinKarp/pkg-descr @@ -1,17 +1,17 @@ -This is an implementation of Rabin and Karp's streaming hash, as described -in "Winnowing: Local Algorithms for Document Fingerprinting" by Schleimer, -Wilkerson, and Aiken. Following the suggestion of Schleimer, I am using +This is an implementation of Rabin and Karp's streaming hash, as described +in "Winnowing: Local Algorithms for Document Fingerprinting" by Schleimer, +Wilkerson, and Aiken. Following the suggestion of Schleimer, I am using their second equation: $H[ $c[2..$k + 1] ] = (( $H[ $c[1..$k] ] - $c[1] ** $k ) + $c[$k+1] ) * $k -The results of this hash encodes information about the next k values in -the stream (hense k-gram.) This means for any given stream of length n +The results of this hash encodes information about the next k values in +the stream (hense k-gram.) This means for any given stream of length n integer values (or characters), you will get back n - k + 1 hash values. -For best results, you will want to create a code generator that filters -your data to remove all unnecessary information. For example, in a large -english document, you should probably remove all white space, as well as +For best results, you will want to create a code generator that filters +your data to remove all unnecessary information. For example, in a large +english document, you should probably remove all white space, as well as removing all capitalization. WWW: http://search.cpan.org/dist/Algorithm-RabinKarp/ |