home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-NSC: POS Tags: PRON

There are 1 PRON lemmas (6%), 60 PRON types (5%) and 395 PRON tokens (7%). Out of 16 observed tags, the rank of PRON is: 11 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: _

The 10 most frequent PRON types: =heːb, =i, =oː, ani, =eː, kna, =hoːk, =oːk, =joː, =ji

The 10 most frequent ambiguous lemmas: _ (PUNCT 1126, VERB 1097, DET 933, NOUN 894, ADP 408, PRON 395, SCONJ 298, PART 167, CCONJ 160, AUX 125, ADV 104, ADJ 77, PROPN 32, INTJ 28, NUM 26, X 18)

The 10 most frequent ambiguous types: =i (ADP 61, PRON 48, AUX 12, SCONJ 10), ani (PRON 27, VERB 9, AUX 2), =eː (PRON 23, ADP 13, SCONJ 12, DET 1), =ji (PRON 9, ADP 7, SCONJ 3, AUX 1), =jeː (SCONJ 13, PRON 6, ADP 3), hoː (PRON 6, NOUN 1), =eːk (SCONJ 10, PRON 2), i= (DET 145, PRON 2, SCONJ 2), jhaː (PRON 2, INTJ 1), nafs (NOUN 2, PRON 2)

Morphology

The form / lemma ratio of PRON is 60.000000 (the average of all parts of speech is 76.500000).

The 1st highest number of forms (60) was observed with the lemma “_”: =aː, =aːk, =b, =eː, =eːk, =heːb, =hi, =hoːk, =hoːn, =i, =iheː, =ihi, =iji, =ijoː, =ijoːk, =iːsi, =iːsiː, =iːsoː, =jaː, =jeː, =ji, =joː, =joːk, =juːk, =oː, =oːk, =oːkna, =oːn, =saj, =siːsi, =t, =uː, =uːn, aneːb, ani, aniː, barijoːk, barjoː, baroːk, baruː, baruːk, beːn, hinin, hoː, i=, imbareː, jhaː, ji=, kina, kna, nafs, naː, naːn, oːn, ti=, umbaruː, umbaruːk, wi=, ʔani, ʔaːw.

PRON occurs with 11 features: Number (360; 91% instances), Person (353; 89% instances), Case (286; 72% instances), Poss (193; 49% instances), Gender (28; 7% instances), Reflex (19; 5% instances), PronType (10; 3% instances), Polite (9; 2% instances), PartType (6; 2% instances), Definite (3; 1% instances), Deixis (2; 1% instances)

PRON occurs with 22 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Case=Voc, Definite=Def, Deixis=Prox, Deixis=Remt, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, PartType=Int, Person=1, Person=2, Person=3, Polite=Form, Poss=Yes, PronType=Dem, PronType=Rel, Reflex=Yes

PRON occurs with 51 feature combinations. The most frequent feature combination is Number=Sing|Person=1 (73 tokens). Examples: =heːb, =oː, hoː, =joː, aneːb

Relations

PRON nodes are attached to their parents using 19 different relations: nmod:poss (142; 36% instances), obj (112; 28% instances), obl:mod (36; 9% instances), nsubj (35; 9% instances), nmod (14; 4% instances), dep:comp (13; 3% instances), discourse (9; 2% instances), dislocated:subj (9; 2% instances), iobj (7; 2% instances), obl:arg (4; 1% instances), acl:relcl (2; 1% instances), dislocated (2; 1% instances), dislocated:obj (2; 1% instances), reparandum (2; 1% instances), vocative (2; 1% instances), dep (1; 0% instances), dep:conj (1; 0% instances), parataxis (1; 0% instances), root (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (176; 45% instances), NOUN (150; 38% instances), ADP (32; 8% instances), ADJ (15; 4% instances), PRON (11; 3% instances), PROPN (3; 1% instances), PART (2; 1% instances), ADV (1; 0% instances), AUX (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances), (1; 0% instances), X (1; 0% instances)

326 (83%) PRON nodes are leaves.

43 (11%) PRON nodes have one child.

21 (5%) PRON nodes have two children.

5 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 4.

Children of PRON nodes are attached using 13 different relations: punct (37; 36% instances), det (30; 29% instances), discourse (10; 10% instances), nmod:poss (7; 7% instances), case (5; 5% instances), cc (4; 4% instances), dep (2; 2% instances), dep:comp (2; 2% instances), nmod (2; 2% instances), cop (1; 1% instances), dep:conj (1; 1% instances), dislocated:subj (1; 1% instances), vocative (1; 1% instances)

Children of PRON nodes belong to 11 different parts of speech: PUNCT (37; 36% instances), DET (35; 34% instances), PRON (11; 11% instances), ADP (7; 7% instances), CCONJ (4; 4% instances), PART (3; 3% instances), INTJ (2; 2% instances), AUX (1; 1% instances), NOUN (1; 1% instances), SCONJ (1; 1% instances), X (1; 1% instances)