Hilft Dir dabei, Deine BÜNDNIS 90/DIE GRÜNEN Website zu optimieren https://green-spider.netzbegruenung.de/
Go to file
2018-04-04 00:04:53 +02:00
.gitignore First working code and results 2018-04-03 23:15:28 +02:00
README.md First working code and results 2018-04-03 23:15:28 +02:00
requirements.txt First working code and results 2018-04-03 23:15:28 +02:00
result.json Add test results for new directory entries 2018-04-04 00:04:53 +02:00
spider.py Reduce concurrency to reduce variance in timing results 2018-04-03 23:25:14 +02:00

green-spider

Collects data on green websites and checks for things like SEO, performance, TLS.

Written and tested in Python3

Ideas

  • If the URL does not start with www., will entering www.<url> also work?
  • If the URL is HTTP, is it possible to access the site via HTTPS (recommended)?
  • If the URL is HTTPS, is it possible to access the sire via HTTP (recommended: redirect to HTTPS)
  • Check which cookies are set and with what settings (expiry, domain)
  • submit the URL against a service like Google Page Speed and retrieve the score
  • Check against our own webpagetest.org instance
  • Detect which one of the well-known CMS is used?

Usage

virtualenv -p python3 venv
source venv/bin/activate
pip install -r requirements.txt

python spider.py