green-spider/README.md

34 lines
867 B
Markdown
Raw Normal View History

2018-04-03 21:04:26 +02:00
# green-spider
2018-04-03 23:15:28 +02:00
2018-04-03 21:04:26 +02:00
Collects data on green websites and checks for things like SEO, performance, TLS.
2018-04-03 23:15:28 +02:00
Written and tested in Python3
2018-04-05 19:39:09 +02:00
### Usage
Run the spider:
```nohighlight
make spider
```
The result will be in `webapp/dist/data`.
Build the webapp:
```nohighlight
make webapp
```
2018-04-03 23:15:28 +02:00
### Ideas
- If the URL does not start with `www.`, will entering `www.<url>` also work?
- If the URL is HTTP, is it possible to access the site via HTTPS (recommended)?
- If the URL is HTTPS, is it possible to access the sire via HTTP (recommended: redirect to HTTPS)
- Check which cookies are set and with what settings (expiry, domain)
- submit the URL against a service like Google Page Speed and retrieve the score
- Check against our own webpagetest.org instance
- Detect which one of the well-known CMS is used
2018-04-04 21:14:16 +02:00
- Certificate expiry warning
- Favourite icon availability check