mirror of
https://github.com/netzbegruenung/green-spider.git
synced 2024-05-08 03:43:41 +02:00
28 lines
846 B
Markdown
28 lines
846 B
Markdown
# green-spider
|
|
|
|
Collects data on green websites and checks for things like SEO, performance, TLS.
|
|
|
|
Written and tested in Python3
|
|
|
|
### Ideas
|
|
|
|
- If the URL does not start with `www.`, will entering `www.<url>` also work?
|
|
- If the URL is HTTP, is it possible to access the site via HTTPS (recommended)?
|
|
- If the URL is HTTPS, is it possible to access the sire via HTTP (recommended: redirect to HTTPS)
|
|
- Check which cookies are set and with what settings (expiry, domain)
|
|
- submit the URL against a service like Google Page Speed and retrieve the score
|
|
- Check against our own webpagetest.org instance
|
|
- Detect which one of the well-known CMS is used
|
|
- Certificate expiry warning
|
|
- Favourite icon availability check
|
|
|
|
### Usage
|
|
|
|
```nohighlight
|
|
virtualenv -p python3 venv
|
|
source venv/bin/activate
|
|
pip install -r requirements.txt
|
|
|
|
python spider.py
|
|
```
|