3arif: A corpus of modern standard and Egyptian Arabic tweets annotated for epistemic modality using Interactive Crowdsourcing

Rania Al-Sabbagh, Corina R Girju, Jana Diesner

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present 3arif1, a large-scale corpus of Modern Standard and Egyptian Arabic tweets annotated for epistemic modality2. To create 3arif, we design an interactive crowdsourcing annotation procedure that splits up the annotation process into a series of simplified questions, dispenses with the requirement for expert linguistic knowledge and captures nested modality triggers and their attributes semiautomatically.

Original languageEnglish (US)
Title of host publication25th International Conference on Computational Linguistics
Subtitle of host publicationTechnical Papers
Place of PublicationDublin
PublisherAssociation for Computational Linguistics, ACL Anthology
Pages1521-1532
Number of pages12
ISBN (Electronic)9781941643266
StatePublished - Jan 1 2014
Event25th International Conference on Computational Linguistics, COLING 2014 - Dublin, Ireland
Duration: Aug 23 2014Aug 29 2014

Other

Other25th International Conference on Computational Linguistics, COLING 2014
CountryIreland
CityDublin
Period8/23/148/29/14

Fingerprint

expert
linguistics
present
Egyptians
Epistemic Modality
Annotation
Split
Linguistic Knowledge
Modality
Trigger

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Al-Sabbagh, R., Girju, C. R., & Diesner, J. (2014). 3arif: A corpus of modern standard and Egyptian Arabic tweets annotated for epistemic modality using Interactive Crowdsourcing. In 25th International Conference on Computational Linguistics: Technical Papers (pp. 1521-1532). Dublin: Association for Computational Linguistics, ACL Anthology.

3arif : A corpus of modern standard and Egyptian Arabic tweets annotated for epistemic modality using Interactive Crowdsourcing. / Al-Sabbagh, Rania; Girju, Corina R; Diesner, Jana.

25th International Conference on Computational Linguistics: Technical Papers. Dublin : Association for Computational Linguistics, ACL Anthology, 2014. p. 1521-1532.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Al-Sabbagh, R, Girju, CR & Diesner, J 2014, 3arif: A corpus of modern standard and Egyptian Arabic tweets annotated for epistemic modality using Interactive Crowdsourcing. in 25th International Conference on Computational Linguistics: Technical Papers. Association for Computational Linguistics, ACL Anthology, Dublin, pp. 1521-1532, 25th International Conference on Computational Linguistics, COLING 2014, Dublin, Ireland, 8/23/14.
Al-Sabbagh R, Girju CR, Diesner J. 3arif: A corpus of modern standard and Egyptian Arabic tweets annotated for epistemic modality using Interactive Crowdsourcing. In 25th International Conference on Computational Linguistics: Technical Papers. Dublin: Association for Computational Linguistics, ACL Anthology. 2014. p. 1521-1532
Al-Sabbagh, Rania ; Girju, Corina R ; Diesner, Jana. / 3arif : A corpus of modern standard and Egyptian Arabic tweets annotated for epistemic modality using Interactive Crowdsourcing. 25th International Conference on Computational Linguistics: Technical Papers. Dublin : Association for Computational Linguistics, ACL Anthology, 2014. pp. 1521-1532
@inproceedings{5db21c84baa1492fbdb91f2502cbdf3f,
title = "3arif: A corpus of modern standard and Egyptian Arabic tweets annotated for epistemic modality using Interactive Crowdsourcing",
abstract = "We present 3arif1, a large-scale corpus of Modern Standard and Egyptian Arabic tweets annotated for epistemic modality2. To create 3arif, we design an interactive crowdsourcing annotation procedure that splits up the annotation process into a series of simplified questions, dispenses with the requirement for expert linguistic knowledge and captures nested modality triggers and their attributes semiautomatically.",
author = "Rania Al-Sabbagh and Girju, {Corina R} and Jana Diesner",
year = "2014",
month = "1",
day = "1",
language = "English (US)",
pages = "1521--1532",
booktitle = "25th International Conference on Computational Linguistics",
publisher = "Association for Computational Linguistics, ACL Anthology",

}

TY - GEN

T1 - 3arif

T2 - A corpus of modern standard and Egyptian Arabic tweets annotated for epistemic modality using Interactive Crowdsourcing

AU - Al-Sabbagh, Rania

AU - Girju, Corina R

AU - Diesner, Jana

PY - 2014/1/1

Y1 - 2014/1/1

N2 - We present 3arif1, a large-scale corpus of Modern Standard and Egyptian Arabic tweets annotated for epistemic modality2. To create 3arif, we design an interactive crowdsourcing annotation procedure that splits up the annotation process into a series of simplified questions, dispenses with the requirement for expert linguistic knowledge and captures nested modality triggers and their attributes semiautomatically.

AB - We present 3arif1, a large-scale corpus of Modern Standard and Egyptian Arabic tweets annotated for epistemic modality2. To create 3arif, we design an interactive crowdsourcing annotation procedure that splits up the annotation process into a series of simplified questions, dispenses with the requirement for expert linguistic knowledge and captures nested modality triggers and their attributes semiautomatically.

UR - http://www.scopus.com/inward/record.url?scp=84942619791&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84942619791&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84942619791

SP - 1521

EP - 1532

BT - 25th International Conference on Computational Linguistics

PB - Association for Computational Linguistics, ACL Anthology

CY - Dublin

ER -