Skip to content

HTTP, SOCKS4, SOCKS5 proxies scraper and checker with rich functionality.

License

Notifications You must be signed in to change notification settings

jaimecoj/proxy-scraper-checker

 
 

Repository files navigation

proxy-scraper-checker

Screenshot

HTTP, SOCKS4, SOCKS5 proxies scraper and checker.

  • Asynchronous.
  • Uses regex to search for proxies (ip:port format) on a web page, which allows you to pull out proxies even from json without making any changes to the code.
  • Supports determining the geolocation of the proxy exit node.
  • Can determine if a proxy is anonymous.

For a version that uses Python's built-in logging instead of rich, see the simple-output branch.

You can get proxies obtained using this script in monosans/proxy-list.

Usage

  • Make sure Python version is 3.7 or higher.
  • Install dependencies from requirements.txt (python -m pip install -U -r requirements.txt).
    • If you want to improve the performance, you can also install extra dependencies. See aiohttp documentation.
  • Edit config.py according to your preference.
  • Run main.py.

Folders description

When the script finishes running, the following folders will be created (this behavior can be changed in the config):

  • proxies - proxies with any anonymity level.
  • proxies_anonymous - anonymous proxies.
  • proxies_geolocation - same as proxies, but including exit-node's geolocation.
  • proxies_geolocation_anonymous - same as proxies_anonymous, but including exit-node's geolocation.

Geolocation format is ip:port::Country::Region::City.

Buy me a coffee

Ask for details in Telegram or VK.

License

MIT

About

HTTP, SOCKS4, SOCKS5 proxies scraper and checker with rich functionality.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%