Skip to content

A Persian Reimplementation of Prof. Eliot Ash's Framework, Relatio

Notifications You must be signed in to change notification settings

MohammadForouhesh/persian-relatio

Repository files navigation

A Persian Reimplementation of Prof. Eliot Ash's Framework, Relatio

Mohammad H. Forouhesh

Metodata Inc ®

April 25, 2022

A Persian Reimplementation of Prof. Eliot Ash's Framework, Relatio

E. Ash, et al. Text Semantics Capture Political and Economic Narratives

Table of Contents

  1. A Brief Overview
  2. Main Problem
  3. Illustrative Example
  4. I/O
  5. Motivation
  6. Related Works
  7. Contributions of this paper
  8. Proposed Method
  9. Experiments

A Brief Overview

The aim of this project is to quantify latent narrative structures in text documents unsupervised. It maps explicit relations between entity groups and identifies coherent entity groups. In particular, we propose a new method of satisfying this requirement - by identifying who does what to whom and by mapping the relationships and interactions among entities in a corpus. By analysing the Persian Twitter in recent years, our team provides an analysis of political and economic narratives. We demonstrate how narratives are dynamic, sentimental, polarised, and interconnected in the political discourse.

Main Problem:

Is it possible to represent a list of simple narrative statements as a directed multigraph in which the edges are actions and the nodes are entities?
The time interval of analysis in this paper is 1994-2015.

Illustrative Example:

To determine who said what to whom in a paragraph, the proposed method uses semantic role labelling, which identifies the semantic roles in words. In the case of "Millions of Americans lost their unemployment benefits", the narrative graph would look like this:
> بازار کریپتو ---هدر داد---> وقت و پول من را

I/O:

  • Input: Tweets (textual modality)
  • Output: Predicted salient narratives around historical events.

Motivation:

  1. The articulation of partisan values can be discovered by arranging narratives by relative party usage.
  2. Narratives provide a new window on polarisation of language in politics.
  3. Analysing narratives quantitatively is still largely unexplored.
  4. Narrative statements are intuitive and close to the original raw text.
  5. Various narratives can form a broader discourse.

Related (Previous) Works:

According to the dependency parser method, it is divided into 3 categories:

  1. Dictionary methods rely on matching particular words or phrases Baker, Bloom and Davis, 2016; Shiller, 2019; Enke, 2020
  2. Unsupervised learning methods such as topic models and document embeddings break sentences down into words or phrases and ignore grammatical information Hansen, McMahon and Prat, 2017; Larsen and Thorsrud, 2019; Bybee et al., 2020.
  3. syntactic dependency parsers, which identify grammatical relationships between words Ash et al., 2020, will often miss how actors are related in a sentence.

Contributions of this paper:

  1. Multigraph approach that links up entities and their associated actions through a network.
  2. Robustness to word ordering.
  3. Giving qualitative researchers a rich context.
  4. This method can provide insights into narrative discourse and polarisation through node centrality and graph distance measures.

Proposed Method:

Stage I: (Feature Extraction)

Given a sentence, using Semantic Role Labelling, subject (agent or who), action (verb or what) and target (patient or whom) of the sentence is extracted. A0, V, and A1 are the corresponding set of phrases. Define the set of all narratives N ⊂ A0 ⨉ V ⨉ A1 = S. This set can be too high-dimensional.

Stage II: (Dimensionality Reduction)

Construct the set E of latent entities such that: (not for verbs)

  • |E| < |A0 U A1 |

  • N = E ⨉ V ⨉ E

    a. Construction

    If an entity is frequent, then it is explicit, otherwise, it is implicit. Sometimes, a sentence entity has no surface form, this is also categorised under the umbrella term of implicit entity.

    1. Explicit: Apply Named Entity Recognition, then chose top L frequent entities.
    2. Implicit: Embed sentence then apply K-Means, each cluster represents a latent entity.

Stage III: (Pipeline Flowchart)

ash0

Stage IV: (Narrative Multi-directional Graph)

Find the narrative representation of each sentence, create complex narratives by combination. These complex narratives can be analysed using the degree of input and output.

Experiments:

Datasets:

U.S. Congressional Record, 1994-2015. Transcripts of speeches made in the House and Senate with names and party affiliations. Often used as a data source for text analysis in social science applications.

Hyperparams:

L = 1000 K = 1000

Results:

In short narratives capture the following:

  • Historical Events: e.g. Sep 11, war on terror
  • Sentiment Polarity
  • Partisanship
  • Debate Structure

A sample from the output:

IMG-20220404-WA0001

About

A Persian Reimplementation of Prof. Eliot Ash's Framework, Relatio

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published