diff options
Diffstat (limited to 'www/larbin/pkg-descr')
-rw-r--r-- | www/larbin/pkg-descr | 20 |
1 files changed, 20 insertions, 0 deletions
diff --git a/www/larbin/pkg-descr b/www/larbin/pkg-descr new file mode 100644 index 000000000000..7f95d1f309e8 --- /dev/null +++ b/www/larbin/pkg-descr @@ -0,0 +1,20 @@ +Larbin is a powerful web crawler (also called [web] robot, spider...). It +is intended to fetch a large number of web pages to fill the database of a +search engine. With a network fast enough, Larbin is able to fetch more than +100 million pages on a standard PC. + +Larbin was initially developed for the XYLEME project in the VERSO team at +INRIA. The goal of Larbin was to go and fetch XML pages on the web to fill +the database of an xml-oriented search engine. + +The following can be done with Larbin: + + o A crawler for a search engine + o A crawler for a specialized search enginer (xml, images, mp3...) + o Statistics on the web (about servers or page contents) + +Larbin is created by: Sebastien Ailleret + +WWW: http://larbin.sourceforge.net +WWW: http://www.sourceforge.net/projects/larbin +WWW: http://www.ailleret.com |