tailwiz: Empowering Domain Experts with Easy-to-Use, Task-Specific Natural Language Processing Models

Timothy Dai, Austin Peters, Jonah B. Gelbach, David Freeman Engstrom, Daniel Kang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Experts outside the field of machine learning (ML) are interested in using ML techniques to analyze their textual data, but they are inhibited by a lack of convenient natural language processing (NLP) tools. To address this issue, we present tailwiz, an easy-to-use Python tool, powered by supervised fine-tuning of NLP models. tailwiz caters to domain experts by abstracting away technical ML knowledge and running conveniently on personal computers, the preferred mode of computation among domain experts. We show that tailwiz outperforms domain experts' current textual analysis techniques on a majority of real-world tasks, up to a 384.8% F1 increase (46.18% absolute increase). tailwiz consistently outperforms GPT-3.5-Turbo on such tasks, showing the need for fine-tuned NLP models to perform domain-specific tasks that meet the analytical demands of domain experts.

Original languageEnglish (US)
Title of host publicationProceedings of the 8th Workshop on Data Management for End-to-End Machine Learning, DEEM 2024 - In conjunction with the 2024 ACM SIGMOD/PODS Conference
PublisherAssociation for Computing Machinery
Pages12-22
Number of pages11
ISBN (Electronic)9798400706110
DOIs
StatePublished - Jun 9 2024
Event8th Workshop on Data Management for End-to-End Machine Learning, DEEM 2024 - Santiago, Chile
Duration: Jun 9 2024Jun 9 2024

Publication series

NameProceedings of the 8th Workshop on Data Management for End-to-End Machine Learning, DEEM 2024 - In conjunction with the 2024 ACM SIGMOD/PODS Conference

Conference

Conference8th Workshop on Data Management for End-to-End Machine Learning, DEEM 2024
Country/TerritoryChile
CitySantiago
Period6/9/246/9/24

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Hardware and Architecture
  • Sociology and Political Science

Fingerprint

Dive into the research topics of 'tailwiz: Empowering Domain Experts with Easy-to-Use, Task-Specific Natural Language Processing Models'. Together they form a unique fingerprint.

Cite this