Skip to the content.

NakbaVirality: Multimodal and Textual Virality Prediction in High-Stakes Discourse

1. Motivation and Introduction

Social media has become the primary battleground for narrative control during geopolitical conflicts. The discourse surrounding the Nakba and the post-October 7th war on Gaza represents a unique intersection of historical trauma, real-time war reporting, and highly polarized sentiment.

Understanding what makes a post “go viral” in this specific context is critical for analyzing information diffusion, propaganda spread, and public sentiment. This shared task proposes a novel challenge: predicting the reach (virality) and engagement (interaction) of posts where the content is emotionally charged, historically deep, and often multimodal (text combined with graphic or symbolic imagery).

By focusing on this specific domain, we aim to push the boundaries of how NLP and Computer Vision models handle context-heavy, sensitive, and polarizing data.

2. Data Description

We present a curated, multi-platform dataset specifically designed for this task.

Note on Ethics: All data will be anonymized to protect user privacy, given the sensitive nature of the topic. IDs and handles will be hashed.

3. Task Definitions

We propose two distinct tasks to evaluate model performance on different modalities and prediction objectives.

Task 1: Multimodal Virality Classification

This task focuses on the interplay between text and imagery. In conflict zones, an image often determines the spread of a post more than the text.

Task 2: Textual Virality and Interaction Prediction (Regression)

This task isolates the textual component to understand how rhetoric, sentiment, and specific keywords drive engagement.

4. Baseline Systems

To assist participants, the organizers will provide the following baselines:

5. Significance and Impact

This shared task contributes to the NLP community by:

6. Tentative Schedule

7. Organizers

Participation Guidelines