aboutsummaryrefslogtreecommitdiff
path: root/textproc/p5-Text-Language-Guess/pkg-descr
blob: ebd6d567943bb7bcdef3e68429001c9f6182ac13 (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
Text::Language::Guess guesses a document's language. Its implementation
is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN,
it determines how many of the known stopwords the document contains for
each language supported by "Lingua::StopWords".

Each word in the document recognized as stopword of a particular
language scores one point for this language.

The "language_guess()" function takes a document as a parameter and
returns the abbreviation of the language that it is most likely written
in.

WWW: https://metacpan.org/release/Text-Language-Guess