home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Armenian-BSUT: POS Tags: PROPN

There are 596 PROPN lemmas (9%), 841 PROPN types (7%) and 1807 PROPN tokens (4%). Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: Հայաստան, ՀՀ, Արցախ, Սարյան, Ռուբեն, Ռուսաստան, Նվարդ, Ջիվանի, Փաշինյան, Երևան

The 10 most frequent PROPN types: Հայաստանի, ՀՀ, Հայաստանում, Արցախի, Ջիվանին, Խոսրովի, Ռուսաստանի, Կարինե, Ադրբեջանի, Իրանի

The 10 most frequent ambiguous lemmas: Սարյան (PROPN 31, NOUN 18), Լոս (PROPN 2, X 1), Ղազախստան (NOUN 6, PROPN 2), Այգուտ (NOUN 3, PROPN 1), Գոչունյան (NOUN 1, PROPN 1)

The 10 most frequent ambiguous types: Լ (ADJ 6, PROPN 4), Ա (PROPN 3, ADJ 2), Ազատ (PROPN 3, ADJ 1), Կենտրոն (PROPN 3, NOUN 1), Գ (ADJ 1, PROPN 1), Երկիրը (NOUN 2, PROPN 1), Ի (ADP 7, PROPN 1), Պատմությունը (NOUN 1, PROPN 1), երկիր (NOUN 2, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.411074 (the average of all parts of speech is 1.712534).

The 1st highest number of forms (7) was observed with the lemma “Հայաստան”: ՀԱՅԱՍՏԱՆԻ, Հայաստան, Հայաստանը, Հայաստանի, Հայաստանին, Հայաստանն, Հայաստանում.

The 2nd highest number of forms (7) was observed with the lemma “Պետրոսյան”: ՊԵՏՐՈՍՅԱՆ, Պետրոսյան, Պետրոսյանը, Պետրոսյանի, Պետրոսյանին, Պետրոսյանից, Պետրոսյանն.

The 3rd highest number of forms (7) was observed with the lemma “Սարյան”: Սարյան, Սարյանի, Սարյանին, Սարյանն, Սարյանների, Սարյաններին, Սարյանով.

PROPN occurs with 8 features: Animacy (1807; 100% instances), Case (1807; 100% instances), Definite (1807; 100% instances), NameType (1807; 100% instances), Number (1807; 100% instances), Abbr (112; 6% instances), Style (22; 1% instances), Foreign (7; 0% instances)

PROPN occurs with 25 feature-value pairs: Abbr=Yes, Animacy=Hum, Animacy=Nhum, Case=Abl, Case=Dat, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, Foreign=Yes, NameType=Com, NameType=Geo, NameType=Giv, NameType=Oth, NameType=Pro, NameType=Prs, NameType=Sur, Number=Assoc, Number=Coll, Number=Plur, Number=Sing, Style=Coll, Style=Expr, Style=Rare

PROPN occurs with 62 feature combinations. The most frequent feature combination is Animacy=Nhum|Case=Dat|Definite=Ind|NameType=Geo|Number=Sing (349 tokens). Examples: Հայաստանի, Արցախի, Ռուսաստանի, Ադրբեջանի, Իրանի, Թուրքիայի, Ղարաբաղի, Երևանի, Արարատի, ՀԱՅԱՍՏԱՆԻ

Relations

PROPN nodes are attached to their parents using 26 different relations: nmod:poss (480; 27% instances), nsubj (275; 15% instances), flat:name (262; 14% instances), obl (181; 10% instances), conj (157; 9% instances), nmod (152; 8% instances), obj (64; 4% instances), nmod:npmod (57; 3% instances), appos (52; 3% instances), root (41; 2% instances), iobj (18; 1% instances), nsubj:pass (13; 1% instances), parataxis (12; 1% instances), flat (9; 0% instances), vocative (9; 0% instances), orphan (6; 0% instances), acl (3; 0% instances), dislocated (3; 0% instances), xcomp (3; 0% instances), amod (2; 0% instances), ccomp (2; 0% instances), flat:range (2; 0% instances), advcl (1; 0% instances), compound (1; 0% instances), csubj (1; 0% instances), obl:agent (1; 0% instances)

Parents of PROPN nodes belong to 11 different parts of speech: NOUN (805; 45% instances), VERB (519; 29% instances), PROPN (389; 22% instances), (41; 2% instances), ADJ (27; 1% instances), PRON (9; 0% instances), ADV (6; 0% instances), X (5; 0% instances), NUM (4; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances)

1031 (57%) PROPN nodes are leaves.

371 (21%) PROPN nodes have one child.

244 (14%) PROPN nodes have two children.

161 (9%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 9.

Children of PROPN nodes are attached using 32 different relations: punct (397; 27% instances), flat:name (292; 20% instances), conj (170; 11% instances), nmod (120; 8% instances), case (102; 7% instances), cc (94; 6% instances), amod (40; 3% instances), dep (33; 2% instances), acl (30; 2% instances), acl:relcl (23; 2% instances), advmod:emph (22; 1% instances), list (22; 1% instances), cop (21; 1% instances), nsubj (17; 1% instances), parataxis (16; 1% instances), nmod:npmod (13; 1% instances), nmod:poss (11; 1% instances), appos (9; 1% instances), orphan (9; 1% instances), flat (8; 1% instances), discourse (6; 0% instances), mark (6; 0% instances), nummod (5; 0% instances), flat:range (4; 0% instances), advcl (3; 0% instances), det (3; 0% instances), obl (3; 0% instances), aux (2; 0% instances), advmod (1; 0% instances), case:loc (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: PUNCT (397; 27% instances), PROPN (389; 26% instances), NOUN (306; 21% instances), ADP (102; 7% instances), CCONJ (93; 6% instances), ADJ (50; 3% instances), VERB (50; 3% instances), AUX (23; 2% instances), ADV (22; 1% instances), NUM (13; 1% instances), X (11; 1% instances), PART (10; 1% instances), DET (6; 0% instances), PRON (6; 0% instances), SCONJ (6; 0% instances), INTJ (1; 0% instances)