TY - JOUR
T1 - Cost Function Learning in Memorized Social Networks With Cognitive Behavioral Asymmetry
AU - Mao, Yanbing
AU - Li, Jinning
AU - Hovakimyan, Naira
AU - Abdelzaher, Tarek
AU - Lebiere, Christian
N1 - This work was supported in part by DoD Basic Research Office under Award HQ00342110002 and in part by the Defense Advanced Research Projects Agency under Award HR001121C0165 and Award HR00112290105.
PY - 2024/2/1
Y1 - 2024/2/1
N2 - This article investigates the cost function learning in social information networks, wherein human memory and cognitive bias are explicitly taken into account. We first propose a model for social information-diffusion dynamics, with a focus on the systematic modeling of asymmetric cognitive bias represented by confirmation bias and novelty bias. Building on the dynamics model, we then propose the M3IRL - a memorized model and maximum-entropy-based inverse reinforcement learning - for learning cost functions. Compared with the existing model-free IRLs, the characteristics of M3IRL are significantly different here: no dependence on the Markov decision process principle, the need for only a single finite-time trajectory sample, and bounded decision variables. Finally, the effectiveness of the proposed social information-diffusion model and the M3IRL algorithm is validated by the online social media data.
AB - This article investigates the cost function learning in social information networks, wherein human memory and cognitive bias are explicitly taken into account. We first propose a model for social information-diffusion dynamics, with a focus on the systematic modeling of asymmetric cognitive bias represented by confirmation bias and novelty bias. Building on the dynamics model, we then propose the M3IRL - a memorized model and maximum-entropy-based inverse reinforcement learning - for learning cost functions. Compared with the existing model-free IRLs, the characteristics of M3IRL are significantly different here: no dependence on the Markov decision process principle, the need for only a single finite-time trajectory sample, and bounded decision variables. Finally, the effectiveness of the proposed social information-diffusion model and the M3IRL algorithm is validated by the online social media data.
KW - Asymmetric confirmation bias
KW - asymmetric novelty bias
KW - cost function learning
KW - human memory
KW - inverse reinforcement learning (IRL)
KW - social information-diffusion dynamics
UR - http://www.scopus.com/inward/record.url?scp=85141588000&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85141588000&partnerID=8YFLogxK
U2 - 10.1109/TCSS.2022.3218485
DO - 10.1109/TCSS.2022.3218485
M3 - Article
AN - SCOPUS:85141588000
SN - 2329-924X
VL - 11
SP - 418
EP - 430
JO - IEEE Transactions on Computational Social Systems
JF - IEEE Transactions on Computational Social Systems
IS - 1
ER -