TY - GEN
T1 - Detecting and modeling local text reuse
AU - Smith, David A.
AU - Cordell, Ryan
AU - Dillon, Elizabeth Maddock
AU - Stramp, Nick
AU - Wilkerson, John
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/12
Y1 - 2014/12
N2 - Texts propagate through many social networks and provide evidence for their structure. We describe and evaluate efficient algorithms for detecting clusters of reused passages embedded within longer documents in large collections. We apply these techniques to two case studies: analyzing the culture of free reprinting in the nineteenth-century United States and the development of bills into legislation in the U.S. Congress. Using these divergent case studies, we evaluate both the efficiency of the approximate local text reuse detection methods and the accuracy of the results. These techniques allow us to explore how ideas spread, which ideas spread, and which subgroups shared ideas.
AB - Texts propagate through many social networks and provide evidence for their structure. We describe and evaluate efficient algorithms for detecting clusters of reused passages embedded within longer documents in large collections. We apply these techniques to two case studies: analyzing the culture of free reprinting in the nineteenth-century United States and the development of bills into legislation in the U.S. Congress. Using these divergent case studies, we evaluate both the efficiency of the approximate local text reuse detection methods and the accuracy of the results. These techniques allow us to explore how ideas spread, which ideas spread, and which subgroups shared ideas.
UR - http://www.scopus.com/inward/record.url?scp=84919386196&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84919386196&partnerID=8YFLogxK
U2 - 10.1109/JCDL.2014.6970166
DO - 10.1109/JCDL.2014.6970166
M3 - Conference contribution
AN - SCOPUS:84919386196
T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
SP - 183
EP - 192
BT - 2014 IEEE/ACM Joint Conference on Digital Libraries, JCDL 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2014 14th IEEE/ACM Joint Conference on Digital Libraries, JCDL 2014
Y2 - 8 September 2014 through 12 September 2014
ER -