How do humans decipher reward in an uncertain state and environment?
Imitation seems the most likely, supported by extended solitude usually leading to a depressed state.
Feels like a question to run a human Monte Carlo Tree Search on!
#BeingHuman #ReinforcementLearning #InverseReinforcementLearning