TY - GEN
T1 - Using citation bias to guide better sampling of scientific literature
AU - Fu, Yuanxi
AU - Yuan, Jasmine
AU - Schneider, Jodi
N1 - Funding Information:
This research is supported by the Campus Research Board of the University of Illinois at Urbana-Champaign Grant RB21012. We acknowledge Dr. Ymkje Anna de Vries and Prof. Marcus Munafò for making their dataset available under the Non-Commercial Government Licence v2.0 and Zhonghe Wan for her assistance in statistical calculations.
Funding Information:
This research is supported by the Campus eR search Board of the University of Illinois at Urbana-Champaign Grant RB21012. We acknowledge Dr. Ymkje Anna de rV ies and Prof. Marcus Munafò for making their dataset available under the onN - Commercial Government Licence v2.0 and Zhonghe Wan for her assistance in statistical calculations.
Publisher Copyright:
© 2021 18th International Conference on Scientometrics and Informetrics, ISSI 2021. All rights reserved.
PY - 2021
Y1 - 2021
N2 - It is rarely possible to cite every relevant work on a topic. When controversy exists in a field, work holding the same opinion as the citing paper (i.e., homophily) is more likely to be cited. Thus, readers may inadvertently select a non-representative sample of articles to read. Here, we begin to develop a method that guides better sampling of scientific literature by designing and testing two new network metrics. The first metric, the ratio between real and expected citation counts, guides users to papers that were cited many fewer times than expected and may represent marginalized findings. The second metric, the relative evidence coupling strength, guides users to papers that may present a unique view of the field. We test our metrics on a known case of citation bias: a network of 73 papers about whether stress is a risk factor for depression. Our metrics select a cross-section of 21 papers. The intersection of the two metrics selects 3 papers that represent all 3 positions of this claim network. In future work we will test our metrics on more datasets, and we will partner with domain experts to verify whether our metrics do identify a diverse sample of research articles.
AB - It is rarely possible to cite every relevant work on a topic. When controversy exists in a field, work holding the same opinion as the citing paper (i.e., homophily) is more likely to be cited. Thus, readers may inadvertently select a non-representative sample of articles to read. Here, we begin to develop a method that guides better sampling of scientific literature by designing and testing two new network metrics. The first metric, the ratio between real and expected citation counts, guides users to papers that were cited many fewer times than expected and may represent marginalized findings. The second metric, the relative evidence coupling strength, guides users to papers that may present a unique view of the field. We test our metrics on a known case of citation bias: a network of 73 papers about whether stress is a risk factor for depression. Our metrics select a cross-section of 21 papers. The intersection of the two metrics selects 3 papers that represent all 3 positions of this claim network. In future work we will test our metrics on more datasets, and we will partner with domain experts to verify whether our metrics do identify a diverse sample of research articles.
UR - http://www.scopus.com/inward/record.url?scp=85112617632&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85112617632&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85112617632
T3 - 18th International Conference on Scientometrics and Informetrics, ISSI 2021
SP - 419
EP - 424
BT - 18th International Conference on Scientometrics and Informetrics, ISSI 2021
A2 - Glanzel, Wolfgang
A2 - Heeffer, Sarah
A2 - Chi, Pei-Shan
A2 - Rousseau, Ronald
PB - International Society for Scientometrics and Informetrics
T2 - 18th International Conference on Scientometrics and Informetrics Conference, ISSI 2021
Y2 - 12 July 2021 through 15 July 2021
ER -