https://github.com/aleksandarhaber/q-learning-algorithm-in-python-with-cart-pole-openai-gym--gymnasium-environment