home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Armenian-ArmTDP: POS Tags: PROPN

There are 363 PROPN lemmas (8%), 511 PROPN types (7%) and 978 PROPN tokens (4%). Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 8 in number of tokens.

The 10 most frequent PROPN lemmas: Հայաստան, Իրան, Սիմեոն, Սարգսյան, Մարտին, Ադրբեջան, ՀՀ, Կարապետյան, Կարեն, Հարութ

The 10 most frequent PROPN types: Հայաստանի, Իրանի, ՀՀ, Հայաստանում, Սիմեոնը, Ադրբեջանի, Կարեն, Հայաստան, Սարգսյանի, Կարապետյանը

The 10 most frequent ambiguous lemmas: 24 (NUM 3, NOUN 2, PROPN 1)

The 10 most frequent ambiguous types: 24 (NUM 3, PROPN 1), Գևորգյան (ADJ 1, PROPN 1), Ս (ADJ 9, PROPN 1), Տեր (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.407713 (the average of all parts of speech is 1.635667).

The 1st highest number of forms (7) was observed with the lemma “Հայաստան”: Հայաստան, Հայաստանը, Հայաստանի, Հայաստանին, Հայաստանից, Հայաստանն, Հայաստանում.

The 2nd highest number of forms (6) was observed with the lemma “Ադրբեջան”: Ադրբեջան, Ադրբեջանը, Ադրբեջանի, Ադրբեջանին, Ադրբեջանից, Ադրբեջանում.

The 3rd highest number of forms (6) was observed with the lemma “Ռուսաստան”: Ռուսաստան, Ռուսաստանը, Ռուսաստանի, Ռուսաստանին, Ռուսաստանն, Ռուսաստանում.

PROPN occurs with 11 features: Case (978; 100% instances), Definite (978; 100% instances), NameType (978; 100% instances), Animacy (977; 100% instances), Number (977; 100% instances), Abbr (74; 8% instances), Style (5; 1% instances), Number[psor] (4; 0% instances), Poss (4; 0% instances), NumForm (1; 0% instances), Typo (1; 0% instances)

PROPN occurs with 26 feature-value pairs: Abbr=Yes, Animacy=Hum, Animacy=Nhum, Case=Abl, Case=Dat, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, NameType=Com, NameType=Geo, NameType=Giv, NameType=Oth, NameType=Pro, NameType=Prs, NameType=Sur, NumForm=Digit, Number=Coll, Number=Plur, Number=Sing, Number[psor]=Sing, Poss=Yes, Style=Coll, Style=Rare, Style=Vrnc, Typo=Yes

PROPN occurs with 53 feature combinations. The most frequent feature combination is Animacy=Hum|Case=Nom|Definite=Ind|NameType=Giv|Number=Sing (183 tokens). Examples: Կարեն, Սերժ, Վիգեն, Արթուր, Գարեգին, Զաբել, Մարտին, Գրիգոր, Արմեն, Դավիթ

Relations

PROPN nodes are attached to their parents using 21 different relations: nsubj (203; 21% instances), nmod:poss (202; 21% instances), flat (196; 20% instances), obl (93; 10% instances), conj (89; 9% instances), nmod (73; 7% instances), obj (35; 4% instances), nmod:npmod (27; 3% instances), appos (23; 2% instances), root (11; 1% instances), iobj (8; 1% instances), nsubj:pass (6; 1% instances), acl:relcl (2; 0% instances), nsubj:caus (2; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), csubj:pass (1; 0% instances), fixed (1; 0% instances), obl:agent (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)

Parents of PROPN nodes belong to 8 different parts of speech: NOUN (353; 36% instances), VERB (329; 34% instances), PROPN (264; 27% instances), ADJ (12; 1% instances), (11; 1% instances), PRON (6; 1% instances), X (2; 0% instances), DET (1; 0% instances)

565 (58%) PROPN nodes are leaves.

209 (21%) PROPN nodes have one child.

106 (11%) PROPN nodes have two children.

98 (10%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 8.

Children of PROPN nodes are attached using 25 different relations: flat (197; 25% instances), punct (151; 19% instances), conj (90; 11% instances), nmod (85; 11% instances), cc (49; 6% instances), amod (47; 6% instances), case (42; 5% instances), acl (19; 2% instances), acl:relcl (17; 2% instances), appos (16; 2% instances), advmod:emph (15; 2% instances), orphan (12; 2% instances), cop (10; 1% instances), nsubj (8; 1% instances), goeswith (6; 1% instances), nmod:poss (6; 1% instances), det:poss (4; 1% instances), det (3; 0% instances), list (3; 0% instances), nmod:npmod (3; 0% instances), obl (2; 0% instances), discourse (1; 0% instances), expl (1; 0% instances), fixed (1; 0% instances), parataxis (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: PROPN (264; 33% instances), NOUN (155; 20% instances), PUNCT (151; 19% instances), ADJ (57; 7% instances), CCONJ (52; 7% instances), ADP (42; 5% instances), VERB (32; 4% instances), ADV (10; 1% instances), AUX (10; 1% instances), DET (7; 1% instances), PRON (4; 1% instances), NUM (2; 0% instances), PART (2; 0% instances), X (1; 0% instances)