TY - GEN
T1 - Mining diverse opinions
AU - Srivatsa, Mudhakar
AU - Lee, Sihyung
AU - Abdelzaher, Tarek
PY - 2012
Y1 - 2012
N2 - Network operations that support tactical missions are often characterized by evolving information that needs to be delivered over bandwidth constrained communication networks and presented to a social/cognitive network with limited human attention span and high stress. Most past research efforts on data dissemination examined syntactic redundancy between data items (e.g., common bit strings, entropy coding and compression, etc.), but only limited work has examined the problem of reducing semantic redundancy with the goal of providing higher quality information to end users. In this paper we propose to measure semantic redundancy in large volume text streams using online topic models and opinion analysis (e.g., topic = Location X and opinion = possible-hazard +, safe-zone-). By suppressing semantically redundant content one can better utilize bottleneck resources such as bandwidth on a resource constrained network or attention time of a human user. However, unlike syntactic redundancy (e.g., lossless compression, lossy compression with small reconstruction errors), a semantic redundancy based approach is faced with the challenge of having to deal with larger inaccuracies (e.g., false positive and false negative probabilities in an opinion classifier). This paper seeks to quantify the effectiveness of a semantic redundancy based approach (over its syntactic counterparts) as a function of such inaccuracies and present a detailed experimental evaluation using realistic information flows collected from an enterprise network with about 1500 users1.
AB - Network operations that support tactical missions are often characterized by evolving information that needs to be delivered over bandwidth constrained communication networks and presented to a social/cognitive network with limited human attention span and high stress. Most past research efforts on data dissemination examined syntactic redundancy between data items (e.g., common bit strings, entropy coding and compression, etc.), but only limited work has examined the problem of reducing semantic redundancy with the goal of providing higher quality information to end users. In this paper we propose to measure semantic redundancy in large volume text streams using online topic models and opinion analysis (e.g., topic = Location X and opinion = possible-hazard +, safe-zone-). By suppressing semantically redundant content one can better utilize bottleneck resources such as bandwidth on a resource constrained network or attention time of a human user. However, unlike syntactic redundancy (e.g., lossless compression, lossy compression with small reconstruction errors), a semantic redundancy based approach is faced with the challenge of having to deal with larger inaccuracies (e.g., false positive and false negative probabilities in an opinion classifier). This paper seeks to quantify the effectiveness of a semantic redundancy based approach (over its syntactic counterparts) as a function of such inaccuracies and present a detailed experimental evaluation using realistic information flows collected from an enterprise network with about 1500 users1.
UR - http://www.scopus.com/inward/record.url?scp=84874318030&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84874318030&partnerID=8YFLogxK
U2 - 10.1109/MILCOM.2012.6415602
DO - 10.1109/MILCOM.2012.6415602
M3 - Conference contribution
AN - SCOPUS:84874318030
SN - 9781467317290
T3 - Proceedings - IEEE Military Communications Conference MILCOM
BT - MILCOM 2012 - 2012 IEEE Military Communications Conference
T2 - 2012 IEEE Military Communications Conference, MILCOM 2012
Y2 - 1 November 2012 through 1 November 2012
ER -