This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ar/pos issue tracker

PROPN: proper noun

This document is a placeholder for the language-specific documentation for PROPN.


Treebank Statistics (UD_Arabic)

There are 27 PROPN lemmas (0%), 31 PROPN types (0%) and 187 PROPN tokens (0%). Out of 16 observed tags, the rank of PROPN is: 8 in number of lemmas, 10 in number of types and 14 in number of tokens.

The 10 most frequent PROPN lemmas: بِن، عَبداَللّٰه، عَبداَلعَزِيز، طٰهٰ، بُورسَعِيد، عَبداَلمُنعِم، أَبُوظَبِي، أَبُورُدَينَة، أُمّ، عَبداَلحَلِيم

The 10 most frequent PROPN types: بن، عبدالله، عبدالعزيز، طه، بورسعيد، عبدالمنعم، أبوردينة، أم، ابوظبى، عبدالحليم

The 10 most frequent ambiguous lemmas: أُمّ (NOUN 12, PROPN 2)

The 10 most frequent ambiguous types: أم (CONJ 12, PROPN 2, NOUN 1)

Morphology

The form / lemma ratio of PROPN is 1.148148 (the average of all parts of speech is 1.685612).

The 1st highest number of forms (3) was observed with the lemma “أَبُوظَبِي”: أبوظبي, ابوظبى, ابوظبي.

The 2nd highest number of forms (2) was observed with the lemma “أَبُورُدَينَة”: أبوردينة, ابوردينة.

The 3rd highest number of forms (2) was observed with the lemma “عَبداَللّٰه”: عبدالله, عبداللٰه.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 8 different relations: nmod (153; 82% instances), nsubj (20; 11% instances), conj (4; 2% instances), root (4; 2% instances), appos (2; 1% instances), dep (2; 1% instances), cop (1; 1% instances), dobj (1; 1% instances)

Parents of PROPN nodes belong to 7 different parts of speech: NOUN (80; 43% instances), X (62; 33% instances), VERB (18; 10% instances), PROPN (17; 9% instances), ADJ (5; 3% instances), ROOT (4; 2% instances), CONJ (1; 1% instances)

71 (38%) PROPN nodes are leaves.

53 (28%) PROPN nodes have one child.

51 (27%) PROPN nodes have two children.

12 (6%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 10 different relations: nmod (168; 86% instances), case (8; 4% instances), punct (6; 3% instances), acl (3; 2% instances), cc (3; 2% instances), conj (3; 2% instances), amod (1; 1% instances), dep (1; 1% instances), mark (1; 1% instances), nummod (1; 1% instances)

Children of PROPN nodes belong to 8 different parts of speech: X (138; 71% instances), NOUN (19; 10% instances), PROPN (17; 9% instances), ADP (9; 5% instances), PUNCT (6; 3% instances), VERB (3; 2% instances), NUM (2; 1% instances), ADJ (1; 1% instances)


PROPN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]