Beja

Beja Guidelines #

NB: This page is under construction.

Publication #

A overview of the SUD annotation of the Beja corpus is available in the paper: A morph-based and a word-based treebank for Beja.

Annotation at the morph level #

The SUD corpus of Beja is firstly annotated at the morph level (mSUD_Beja-Autogramm).

In the UD repository, the word-based corpus is released as UD_Beja-Autogramm.

The two other combinations are also available:

  • SUD_Beja-Autogramm the data following SUD guidelines but at the word level
  • mUD_Beja-Autogramm the data following UD guidelines but at the morph level

The table below shows how the conversions are made in order to produce all the corpora described above.

SUD UD
morph-based mSUD_Beja-Autogramm gh gm mUD_Beja-Autogramm_MB gh gm
word-based SUD_Beja-Autogramm gh gm UD_Beja-Autogramm gh gm