home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: POS Tags: PROPN

There are 4817 PROPN lemmas (88%), 1 PROPN types (6%) and 58325 PROPN tokens (8%). Out of 16 observed tags, the rank of PROPN is: 1 in number of lemmas, 12 in number of types and 5 in number of tokens.

The 10 most frequent PROPN lemmas: _، None، w، TBupdate، .، bry، bwls، Aljmyl، ,، (

The 10 most frequent PROPN types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 216429, PUNCT 72574, ADJ 66760, ADP 62646, VERB 54473, PROPN 48965, ADV 26129, SCONJ 23987, NUM 15122, AUX 6581, DET 6330, PART 5856, CCONJ 5168, PRON 2460, INTJ 54, X 32), None (NOUN 457, X 344, VERB 264, ADJ 125, PROPN 124, ADV 34, CCONJ 20, PRON 16, SCONJ 16, PART 14, ADP 8, DET 6, AUX 2), w (CCONJ 43321, NOUN 190, PUNCT 136, ADP 120, ADV 117, PROPN 78, VERB 71, SCONJ 69, ADJ 55, PRON 33, PART 10, DET 9, NUM 8, AUX 5, X 3), TBupdate (NOUN 401, ADJ 280, VERB 263, X 174, ADV 74, PROPN 69, ADP 4, SCONJ 2, CCONJ 1, DET 1, PART 1, PRON 1), . (NOUN 107, ADJ 95, PROPN 67, PRON 20, VERB 12, PART 6, ADP 5, X 5, CCONJ 3, ADV 2, AUX 2, DET 2, SCONJ 1), bwls (PROPN 40, ADP 1), , (NOUN 100, CCONJ 96, VERB 34, PROPN 33, ADJ 30, ADP 30, PRON 11, SCONJ 11, PART 10, AUX 5, DET 5, ADV 4), ( (PROPN 26, NOUN 9, CCONJ 6, ADJ 3, PART 1, PRON 1, VERB 1), EAlyh (PROPN 23, ADJ 1), Almr (PROPN 22, ADP 1)

The 10 most frequent ambiguous types: _ (NOUN 218254, ADP 91694, PUNCT 75148, ADJ 67604, PROPN 58325, VERB 55215, CCONJ 50032, PRON 31239, ADV 26527, SCONJ 26034, NUM 15147, PART 8612, AUX 7723, DET 6362, X 917, INTJ 56)

Morphology

The form / lemma ratio of PROPN is 0.000208 (the average of all parts of speech is 0.002933).

The 1st highest number of forms (1) was observed with the lemma “!”: _.

The 2nd highest number of forms (1) was observed with the lemma “””: _.

The 3rd highest number of forms (1) was observed with the lemma “$AbAlykyn”: _.

PROPN occurs with 8 features: Gender (54782; 94% instances), Number (54782; 94% instances), Definite (54088; 93% instances), Case (11495; 20% instances), Person (715; 1% instances), Voice (696; 1% instances), Mood (674; 1% instances), Polarity (8; 0% instances)

PROPN occurs with 20 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Definite=Com, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Mood=Ind, Mood=Jus, Mood=Sub, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Voice=Act, Voice=Pass

PROPN occurs with 98 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Masc|Number=Sing (36091 tokens). Examples: _

Relations

PROPN nodes are attached to their parents using 14 different relations: flat:name (15524; 27% instances), nmod (12137; 21% instances), nmod:poss (10338; 18% instances), appos (8939; 15% instances), nsubj (4557; 8% instances), conj (3101; 5% instances), root (1561; 3% instances), obj (1241; 2% instances), flat (449; 1% instances), parataxis (309; 1% instances), nsubj:pass (119; 0% instances), aux (31; 0% instances), iobj (18; 0% instances), mark (1; 0% instances)

Parents of PROPN nodes belong to 15 different parts of speech: NOUN (22973; 39% instances), PROPN (21342; 37% instances), VERB (8877; 15% instances), ADV (1775; 3% instances), (1561; 3% instances), ADJ (869; 1% instances), PUNCT (595; 1% instances), NUM (110; 0% instances), PRON (66; 0% instances), CCONJ (56; 0% instances), X (48; 0% instances), PART (39; 0% instances), DET (9; 0% instances), SCONJ (4; 0% instances), AUX (1; 0% instances)

29811 (51%) PROPN nodes are leaves.

14047 (24%) PROPN nodes have one child.

7007 (12%) PROPN nodes have two children.

7460 (13%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 36.

Children of PROPN nodes are attached using 24 different relations: flat:name (15464; 27% instances), punct (12001; 21% instances), case (8557; 15% instances), nmod (4758; 8% instances), cc (3949; 7% instances), conj (3523; 6% instances), nummod (2633; 5% instances), amod (1846; 3% instances), parataxis (1023; 2% instances), mark (888; 2% instances), ccomp (701; 1% instances), xcomp (507; 1% instances), nmod:poss (437; 1% instances), appos (315; 1% instances), advmod (241; 0% instances), flat (213; 0% instances), nsubj (105; 0% instances), dep (91; 0% instances), cop (79; 0% instances), obj (34; 0% instances), csubj (15; 0% instances), det (13; 0% instances), acl (6; 0% instances), iobj (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: PROPN (21342; 37% instances), PUNCT (12001; 21% instances), ADP (8573; 15% instances), CCONJ (3951; 7% instances), NOUN (2937; 5% instances), NUM (2870; 5% instances), VERB (2149; 4% instances), ADJ (1964; 3% instances), SCONJ (890; 2% instances), ADV (354; 1% instances), PRON (143; 0% instances), AUX (93; 0% instances), PART (75; 0% instances), X (38; 0% instances), DET (19; 0% instances), INTJ (1; 0% instances)