Editors

Tirşe Erbaysal Filibeli, Tunga Güngör

Language

English

March 2025
72 pages
Free of charge

The report titled "Utilizing AI Against Hate Speech: A Guide to Annotation, Classification, and Detection" consists of two main sections: one detailing the development process of the AI tool pari, and another focusing on the labeling guide and model development stages.

In addition to the ethnic, religious, national, and sexual identities examined in the foundation’s media monitoring work since 2009, new groups targeted in agenda tracking on X were identified. The report explores how certain identity groups in Turkey are exposed to hate speech on social media, the process of collecting and labeling this data, and its transformation into an AI tool. Data collected through selected hashtags and keywords reveal the spread of hate speech against nine identity groups and the reinforcement of negative judgments about these identities. The first section describes the labeling process of the data collected for training pari. The following section, "Developing the AI Model," covers the processing of labeled data and model training.

Guidelines were created to address the subjective nature of data labeling and the limitations of the AI model, particularly for challenging elements such as sarcastic content and implicit hate speech. This framework serves both the current project and future research on hate speech.

Please review the complete report to explore the full process of collecting, labeling, and processing 16,254 tweets categorized as hate speech—as well as the hate speech categories and example tweets.

Report title
Utilizing AI Against Hate Speech: A Guide to Annotation, Classification, and Detection
ISBN
9786057183590
Page numbers
72
Width
160 mm
Height
225 mm
Printing
March 2025
Language
English
Publication coordinators
İnanç Arın, Didar Akar, Başak Can, Somaiyeh Dehghan, Elif Erol, Burak Işık, Sıla Kartal, Buket Kapısız, Nayat Karaköse, Yasemin Korkmaz, Arzucan Özgür, Nural Özel, Pelin Önal, Gökçe Uludoğan, Ayşecan Terzioğlu, Murat Tercan, İrem Topçu, Tuğba Özsoy, Berrin Yanıkoğlu, Elif Yararbaş, Umut Şen
Project coordinators
Başak Can, Elif Erol, Buket Kapısız, Yasemin Korkmaz, Pelin Önal, Tuğba Özsoy, Elif Yararbaş
Editors
Tirşe Erbaysal Filibeli, Tunga Güngör 
Translators
Simon Charles Popay, Burcu Becermen
Proofreader
Neil Patrick Doherty
Design and data visualization
Yasemen Cemre Gürbüz
Graphic application
Selin Uluer
Printed by
Sena Ofset Ambalaj Sanayi ve Ticaret. Ltd. Şti.

Utilizing Digital Technology for Social Cohesion, Positive Messaging and Peace by Boosting Collaboration, Exchange and Solidarity project is supported by the European Union and the Friedrich Naumann Foundation.

Hrant Dink Foundation is solely responsible for the content in the publication, which does not reflect the views of supporters.

  • Introduction
  • The “Utilizing Digital Technology for Social Cohesion, Positive Messaging and Peace by Boosting Collaboration, Exchange and Solidarity” project
  • Hate Speech Labeling Guide
    • How do we decide whether discourse is hate speech?
    • Labeling interface
      - Determining the target group
      - Determining the type of speech
      - Determining the category of hate speech
      - Assessing challenging examples
      - Additional labeling headings
  • Developing the AI model 
    • Data collection & annotation
    • Developed AI tool for detecting and measuring hate speech
    • Error analysis
    • Limitations
  • Conclusion 
  • Appendix A: Labeling interface