Yeah thanks, I have tried, I was comparing and found reinforcement learning is not really reinforcement in this setting. As it cannot allow exploring like what if scenario. It is similar to supervised learning setting as we do in labeling large data and trying to classify etc.
In contrast if we take https://gym.openai.com/envs/Pendulum-v0/ Environment allows algorithm to play with it and learn from new conditions and actions, so is reinforcement. To me finance is like teacher trying to tell same story no matter what student does/learns, bit wierd.
however there are many factors to consider before going that path
* Low price start will be barrier for any future price changes or re-assesment.
* There is quality perception attached to price so it impacts brand perception.
* Even with price differentiation quality is something cant be compromised (in most of cases)
Yes I do understand it like measuring temperature at one point on mac and trying to predict what application its running. However thats not what I wanted (though I dont know what I wanted), I was curious how activity changes and If am not wrong these activity comes at cost both energy and capacity to think / focus. Who knows if you can wear fitbit for mind tomorrow :)