You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Some agents only adapt after receiving several doses of incentive. An agent who once observes some behavior might not change theirs directly, but only after subsequently observing the same behavior, i.e. after accumulating a certain number of doses. Observing contrary behavior on the other hand would reduce the stack of doses again.
This essentially means introducing a memory and could also be implemented as remembering a fixed number of time steps and changing behavior if a personal threshold is exceeded.
The text was updated successfully, but these errors were encountered:
leander-j
added
the
user story
marks issues that are formulated in the style of a user story, e.g. "A model composer can..."
label
Dec 17, 2021
Some agents only adapt after receiving several doses of incentive. An agent who once observes some behavior might not change theirs directly, but only after subsequently observing the same behavior, i.e. after accumulating a certain number of doses. Observing contrary behavior on the other hand would reduce the stack of doses again.
This essentially means introducing a memory and could also be implemented as remembering a fixed number of time steps and changing behavior if a personal threshold is exceeded.
The text was updated successfully, but these errors were encountered: