TY - GEN
T1 - Towards Dark Jargon Interpretation in Underground Forums
AU - Seyler, Dominic
AU - Liu, Wei
AU - Wang, Xiao Feng
AU - Zhai, Cheng Xiang
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Dark jargons are benign-looking words that have hidden, sinister meanings and are used by participants of underground forums for illicit behavior. For example, the dark term “rat” is often used in lieu of “Remote Access Trojan”. In this work we present a novel method towards automatically identifying and interpreting dark jargons. We formalize the problem as a mapping from dark words to “clean” words with no hidden meaning. Our method makes use of interpretable representations of dark and clean words in the form of probability distributions over a shared vocabulary. In our experiments we show our method to be effective in terms of dark jargon identification, as it outperforms another baseline on simulated data. Using manual evaluation, we show that our method is able to detect dark jargons in a real-world underground forum dataset.
AB - Dark jargons are benign-looking words that have hidden, sinister meanings and are used by participants of underground forums for illicit behavior. For example, the dark term “rat” is often used in lieu of “Remote Access Trojan”. In this work we present a novel method towards automatically identifying and interpreting dark jargons. We formalize the problem as a mapping from dark words to “clean” words with no hidden meaning. Our method makes use of interpretable representations of dark and clean words in the form of probability distributions over a shared vocabulary. In our experiments we show our method to be effective in terms of dark jargon identification, as it outperforms another baseline on simulated data. Using manual evaluation, we show that our method is able to detect dark jargons in a real-world underground forum dataset.
KW - Dark jargon
KW - Hidden meaning interpretation
KW - NLP
UR - http://www.scopus.com/inward/record.url?scp=85107384853&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85107384853&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-72240-1_40
DO - 10.1007/978-3-030-72240-1_40
M3 - Conference contribution
AN - SCOPUS:85107384853
SN - 9783030722395
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 393
EP - 400
BT - Advances in Information Retrieval - 43rd European Conference on IR Research, ECIR 2021, Proceedings
A2 - Hiemstra, Djoerd
A2 - Moens, Marie-Francine
A2 - Mothe, Josiane
A2 - Perego, Raffaele
A2 - Potthast, Martin
A2 - Sebastiani, Fabrizio
PB - Springer
T2 - 43rd European Conference on Information Retrieval, ECIR 2021
Y2 - 28 March 2021 through 1 April 2021
ER -