Reliability in performance assessment creates a potential application of artificial intelligence in veterinary education: evaluation of suturing skills at a single institution

Jennifer Kuzminsky, Heidi Phillips, Hajar Sharif, Clara Moran, Hadley E. Gleason, Sophia P. Topulos, Kathryn Pitt, Leslie Klis McNeil, Annette M. McCoy, Thenkurussi Kesavadas

Research output: Contribution to journalArticlepeer-review

Abstract

OBJECTIVES To evaluate suturing skills of veterinary students using 3 common performance assessments (PAs) and to compare findings to data obtained by an electromyographic armband. SAMPLE 16 second-year veterinary students. PROCEDURES Students performed 4 suturing tasks on synthetic tissue models 1, 3, and 5 weeks after a surgical skills course. Digital videos were scored by 4 expert surgeons using 3 PAs (an Objective Structured Clinical Examination [OSCE]-style surgical binary checklist, an Objective Structured Assessment of Technical Skill [OSATS] checklist, and a surgical Global Rating Scale [GRS]). Surface electromyography (sEMG) data collected from the dominant forearm were input to machine learning algorithms. Performance assessment scores were compared between experts and correlated to task completion times and sEMG data. Inter-rater reliability was calculated using the intraclass correlation coefficient (ICC). Inter-rater agreement was calculated using percent agreement with varying levels of tolerance. RESULTS Reliability was moderate for the OSCE and OSATS checklists and poor for the GRS. Agreement was achieved for the checklists when moderate tolerance was applied but remained poor for the GRS. sEMG signals did not correlate well with checklist scores or task times, but features extracted from signals permitted task differentiation by routine statistical comparison and correct task classification using machine learning algorithms. CLINICAL RELEVANCE Reliability and agreement of an OSCE-style checklist, OSATS checklist, and surgical GRS assessment were insufficient to characterize suturing skills of veterinary students. To avoid subjectivity associated with PA by raters, further study of kinematics and EMG data is warranted in the surgical skills evaluation of veterinary students.

Original languageEnglish (US)
JournalAmerican journal of veterinary research
Volume84
Issue number8
DOIs
StatePublished - Aug 2023

ASJC Scopus subject areas

  • General Veterinary

Fingerprint

Dive into the research topics of 'Reliability in performance assessment creates a potential application of artificial intelligence in veterinary education: evaluation of suturing skills at a single institution'. Together they form a unique fingerprint.

Cite this