Gym occupancy

My local rock climbing gym provides real-time occupancy numbers - yay! This repo attempts to capture this info over time, to find the quietest times to visit.

Quickstart

Pull the repo and build the Docker image with docker-compose build
Start services with docker-compose up -d
Head to localhost:8000 for the Graphite app (for finding metrics) and localhost:3000 for the Grafana app (for visualising metrics).
1. A dashboard will already have been configured in Grafana - head to Dashboards, Manage, and then "Gym occupancy").
2. If you're not seeing data, check docker-compose logs -f populator.

Tech details, or how to adapt this to your needs

I'm using StatsD for simple metric collection, Graphite for storage and Grafana for visualisations. This means that tracking a different website is as simple as writing a scraping script (like fetch.py) and printing StatsD metrics to stdout, piped to netcat. For example, this is how I use fetch.py (where graphite:8125 is the network location of the graphite docker-compose container):

while true; do
    poetry run python fetch.py 2> /dev/null | nc -u -w1 graphite 8125;
    sleep 60;
done

If you're lucky, your target website will expose their occupancy data as a JSON endpoint - it's a fairly trivial matter to request that (response = requests.get(url)), parse the JSON (response.json) and convert that into some StatsD gauge metric lines (print(f"gym.MYGYM.occupancy:{json['visitors']}|g")).

The site/service I'm scraping embeds their numbers into inline JavaScript on the page. I've employed two tools to extract these:

Parse the <script> tag contents from the page: I'm using html.parser, which can be a bit tricky to understand if you haven't had the joy of using a streaming parser before (e.g. SAX in XML-parsing land). Essentially I am listening for three 'events' as the HTML stream is parsed: start-tag (e.g. <script>), end-tag (</script>) and data, which is when the parser has finished reading everything inside the <script></script> tags. ScriptExtractorParser hooks into these events to collect a list of strings, which will be the contents of every <script></script> tag on the page (except the empty ones).
Parse the JavaScript source to extract the numbers. The simple method for this is regular-expressions, though I've found these to be very brittle in the past, requiring lots of effort to keep them working. Instead, I've opted for a Python package called SlimIt - a JavaScript minifier, that contains a nice JavaScript parser. This package lexes the JS source into tokens (letter, number, semicolon, etc.), and nodes (variable-declaration, string, array, null, etc.). We can then tap into this by visiting each node in the JS source to find our variable declaration var data = {} and scrape out our fields. It's more complicated, but if the occupancy provider decides to update their code later on, there's a smaller chance it will break our scraper (and it should be easier to fix).

Finally I'm emitting these numbers as StatsD Gauge metrics: occupancy, capacity (probably static, but may evolve as the Covid situation progresses) and a percentage of occupancy (as Grafana doesn't seem to be able to calculate two metrics easily).

I've configured Grafana to boot up with a pre-configured Graphite data-source and a dashboard. If you make changes to the dashboard, you can persist these by going to Dashboard Settings > JSON Model and saving the JSON to provisioning/grafana/dashboards/your-dashboard.json.

If you're thinking about deploying this somewhere, Grafana is configured to allow anonymous access, with automatically granted administrator rights. This is fine for me, because it quietly runs on my laptop. You should look into configuring Grafana properly :-)

MIT Licensed

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
grafana/provisioning		grafana/provisioning
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dashboard-screenshot.png		dashboard-screenshot.png
docker-compose.yml		docker-compose.yml
fetch.py		fetch.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gym occupancy

Quickstart

Tech details, or how to adapt this to your needs

About

Languages

License

rmasters/gym-occupancy

Folders and files

Latest commit

History

Repository files navigation

Gym occupancy

Quickstart

Tech details, or how to adapt this to your needs

About

Resources

License

Stars

Watchers

Forks

Languages