diff options
Diffstat (limited to 'textproc/p5-Text-Language-Guess/pkg-descr')
-rw-r--r-- | textproc/p5-Text-Language-Guess/pkg-descr | 14 |
1 files changed, 14 insertions, 0 deletions
diff --git a/textproc/p5-Text-Language-Guess/pkg-descr b/textproc/p5-Text-Language-Guess/pkg-descr new file mode 100644 index 000000000000..9ab26f30870e --- /dev/null +++ b/textproc/p5-Text-Language-Guess/pkg-descr @@ -0,0 +1,14 @@ +Text::Language::Guess guesses a document's language. Its implementation +is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN, +it determines how many of the known stopwords the document contains for +each language supported by "Lingua::StopWords". + +Each word in the document recognized as stopword of a particular +language scores one point for this language. + +The "language_guess()" function takes a document as a parameter and +returns the abbreviation of the language that it is most likely written +in. + +Author: Mike Schilli <cpan@perlmeister.com> +WWW: http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/ |