home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Armenian-ArmTDP: POS Tags: PROPN

There are 197 PROPN lemmas (7%), 280 PROPN types (6%) and 532 PROPN tokens (4%). Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Հայաստան, Իրան, Կարապետյան, Սարգսյան, Կարեն, ՀՀ, Ադրբեջան, Հարութ, Եսայի, Սերժ

The 10 most frequent PROPN types: Իրանի, Հայաստանի, Կարեն, ՀՀ, Կարապետյանը, Հարութը, Ադրբեջանի, Հայաստանում, Սարգսյանի, Սերժ

The 10 most frequent ambiguous lemmas: 24 (NUM 3, NOUN 2, PROPN 1)

The 10 most frequent ambiguous types: 24 (NUM 3, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.421320 (the average of all parts of speech is 1.523825).

The 1st highest number of forms (7) was observed with the lemma “Հայաստան”: Հայաստան, Հայաստանը, Հայաստանի, Հայաստանին, Հայաստանից, Հայաստանն, Հայաստանում.

The 2nd highest number of forms (6) was observed with the lemma “Ադրբեջան”: Ադրբեջան, Ադրբեջանը, Ադրբեջանի, Ադրբեջանին, Ադրբեջանից, Ադրբեջանում.

The 3rd highest number of forms (4) was observed with the lemma “Իրան”: Իրան, Իրանը, Իրանի, Իրանն.

PROPN occurs with 11 features: Case (532; 100% instances), Definite (532; 100% instances), NameType (532; 100% instances), Number (532; 100% instances), Animacy (531; 100% instances), Abbr (47; 9% instances), Number[psor] (4; 1% instances), Poss (4; 1% instances), Style (4; 1% instances), NumForm (1; 0% instances), Typo (1; 0% instances)

PROPN occurs with 26 feature-value pairs: Abbr=Yes, Animacy=Hum, Animacy=Inan, Animacy=Nhum, Case=Abl, Case=Dat, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, NameType=Com, NameType=Geo, NameType=Giv, NameType=Oth, NameType=Pro, NameType=Prs, NameType=Sur, NumForm=Digit, Number=Coll, Number=Plur, Number=Sing, Number[psor]=Sing, Poss=Yes, Style=Coll, Style=Vrnc, Typo=Yes

PROPN occurs with 45 feature combinations. The most frequent feature combination is Animacy=Hum|Case=Nom|Definite=Ind|NameType=Giv|Number=Sing (101 tokens). Examples: Կարեն, Սերժ, Վիգեն, Զաբել, Արթուր, Արմեն, Հասան, Շուշան, Աբկայ, Անն

Relations

PROPN nodes are attached to their parents using 19 different relations: nmod:poss (104; 20% instances), nsubj (103; 19% instances), flat (101; 19% instances), conj (62; 12% instances), nmod (53; 10% instances), obl (46; 9% instances), nmod:npmod (19; 4% instances), obj (11; 2% instances), root (10; 2% instances), appos (8; 2% instances), iobj (3; 1% instances), xcomp (3; 1% instances), nsubj:pass (2; 0% instances), parataxis (2; 0% instances), acl:relcl (1; 0% instances), aux (1; 0% instances), nsubj:caus (1; 0% instances), obl:agent (1; 0% instances), vocative (1; 0% instances)

Parents of PROPN nodes belong to 6 different parts of speech: NOUN (195; 37% instances), VERB (164; 31% instances), PROPN (153; 29% instances), (10; 2% instances), ADJ (9; 2% instances), PRON (1; 0% instances)

316 (59%) PROPN nodes are leaves.

105 (20%) PROPN nodes have one child.

52 (10%) PROPN nodes have two children.

59 (11%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 21 different relations: flat (101; 23% instances), punct (93; 21% instances), conj (61; 14% instances), nmod (36; 8% instances), cc (33; 8% instances), case (22; 5% instances), amod (18; 4% instances), appos (11; 3% instances), orphan (10; 2% instances), advmod:emph (9; 2% instances), acl (7; 2% instances), acl:relcl (7; 2% instances), cop (7; 2% instances), nsubj (6; 1% instances), nmod:npmod (4; 1% instances), goeswith (3; 1% instances), list (3; 1% instances), det (2; 0% instances), dep (1; 0% instances), nmod:poss (1; 0% instances), obl (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: PROPN (153; 35% instances), PUNCT (93; 21% instances), NOUN (78; 18% instances), CCONJ (36; 8% instances), ADP (22; 5% instances), ADJ (19; 4% instances), VERB (15; 3% instances), AUX (7; 2% instances), ADV (3; 1% instances), PART (3; 1% instances), DET (2; 0% instances), NUM (2; 0% instances), PRON (2; 0% instances), X (1; 0% instances)