green-spider/README.md

28 lines
846 B
Markdown
Raw Normal View History

2018-04-03 21:04:26 +02:00
# green-spider
2018-04-03 23:15:28 +02:00
2018-04-03 21:04:26 +02:00
Collects data on green websites and checks for things like SEO, performance, TLS.
2018-04-03 23:15:28 +02:00
Written and tested in Python3
### Ideas
- If the URL does not start with `www.`, will entering `www.<url>` also work?
- If the URL is HTTP, is it possible to access the site via HTTPS (recommended)?
- If the URL is HTTPS, is it possible to access the sire via HTTP (recommended: redirect to HTTPS)
- Check which cookies are set and with what settings (expiry, domain)
- submit the URL against a service like Google Page Speed and retrieve the score
- Check against our own webpagetest.org instance
- Detect which one of the well-known CMS is used
2018-04-04 21:14:16 +02:00
- Certificate expiry warning
- Favourite icon availability check
2018-04-03 23:15:28 +02:00
### Usage
```nohighlight
virtualenv -p python3 venv
source venv/bin/activate
pip install -r requirements.txt
python spider.py
```