home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: PROPN

There are 70 PROPN lemmas (0%), 74 PROPN types (0%) and 245 PROPN tokens (0%). Out of 16 observed tags, the rank of PROPN is: 6 in number of lemmas, 9 in number of types and 15 in number of tokens.

The 10 most frequent PROPN lemmas: بِن، عَبداَللّٰه، عَبداَلعَزِيز، طٰهٰ، بُورسَعِيد، أَبُو، عَبداَلمُنعِم، نَرُوج، أَبُوظَبِي، أَبُورُدَينَة

The 10 most frequent PROPN types: بن، عبدالله، عبدالعزيز، طه، بورسعيد، أبو، النروج، عبدالمنعم، يومبلغاز، أبوردينة

The 10 most frequent ambiguous lemmas: بِن (PROPN 104, NOUN 1), أُمّ (NOUN 13, PROPN 2)

The 10 most frequent ambiguous types: بن (PROPN 104, NOUN 1), أبو (X 63, PROPN 6, NOUN 5), النروج (PROPN 5, X 2), أم (CCONJ 12, NOUN 2, PROPN 2), الفليبين (PROPN 2, X 1), بدر (PROPN 2, X 2, NOUN 1), ميلوسيفيتش (PROPN 2, X 2), وليام (PROPN 2, X 2), أبي (NOUN 2, PROPN 1), البدري (PROPN 1, X 1)

Morphology

The form / lemma ratio of PROPN is 1.057143 (the average of all parts of speech is 1.761701).

The 1st highest number of forms (3) was observed with the lemma “أَبُوظَبِي”: أبوظبي, ابوظبى, ابوظبي.

The 2nd highest number of forms (2) was observed with the lemma “أَبُورُدَينَة”: أبوردينة, ابوردينة.

The 3rd highest number of forms (2) was observed with the lemma “عَبداَللّٰه”: عبدالله, عبداللٰه.

PROPN occurs with 4 features: Definite (24; 10% instances), Case (3; 1% instances), Gender (3; 1% instances), Number (3; 1% instances)

PROPN occurs with 5 feature-value pairs: Case=Gen, Definite=Cons, Definite=Def, Gender=Fem, Number=Sing

PROPN occurs with 5 feature combinations. The most frequent feature combination is _ (221 tokens). Examples: بن، عبدالله، عبدالعزيز، طه، بورسعيد، أبو، عبدالمنعم، يومبلغاز، أبوردينة، أم

Relations

PROPN nodes are attached to their parents using 11 different relations: nmod (177; 72% instances), conj (30; 12% instances), nsubj (22; 9% instances), root (4; 2% instances), obl (3; 1% instances), appos (2; 1% instances), dep (2; 1% instances), obj (2; 1% instances), cop (1; 0% instances), obl:arg (1; 0% instances), orphan (1; 0% instances)

Parents of PROPN nodes belong to 8 different parts of speech: NOUN (103; 42% instances), X (92; 38% instances), VERB (19; 8% instances), PROPN (17; 7% instances), ADJ (5; 2% instances), (4; 2% instances), CCONJ (3; 1% instances), NUM (2; 1% instances)

89 (36%) PROPN nodes are leaves.

79 (32%) PROPN nodes have one child.

66 (27%) PROPN nodes have two children.

11 (4%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 10 different relations: nmod (173; 70% instances), cc (31; 13% instances), case (21; 8% instances), punct (8; 3% instances), acl (4; 2% instances), amod (4; 2% instances), conj (3; 1% instances), dep (2; 1% instances), mark (1; 0% instances), nummod (1; 0% instances)

Children of PROPN nodes belong to 9 different parts of speech: X (138; 56% instances), CCONJ (29; 12% instances), NOUN (23; 9% instances), ADP (22; 9% instances), PROPN (17; 7% instances), PUNCT (8; 3% instances), ADJ (5; 2% instances), VERB (4; 2% instances), NUM (2; 1% instances)