aboutsummaryrefslogtreecommitdiff
path: root/devel/py-grab/pkg-descr
blob: 70d64ce186090a2a56b2d69be9feceb08c7a79f6 (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
Grab is a python web scraping framework. Grab provides tons of helpful methods
to scrape web sites and to process the scraped content:

 * Automatic cookies (session) support
 * HTTP and SOCKS proxy with and without authorization
 * Keep-Alive support
 * IDN support
 * Tools to work with web forms
 * Easy multipart file uploading
 * Flexible customization of HTTP requests
 * Automatic charset detection
 * Powerful API of extracting info from HTML documents with XPATH queries
 * Asynchronous API to make thousands of simultaneous queries. This part of
   library called Spider and it is too big to even list its features in this
   README.
 * Python 3 ready

WWW: http://grablib.org/