home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Armenian-ArmTDP: POS Tags: NOUN

There are 5087 NOUN lemmas (41%), 11717 NOUN types (51%) and 28806 NOUN tokens (28%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: ը, աշխատանք, տարի, հանրապետություն, օր, մարդ, ժամանակ, գործ, երկիր, աշխարհ

The 10 most frequent NOUN types: հանրապետության, ի, անգամ, ժամանակ, աշխատանքի, ին, թ, տարի, տարվա, բան

The 10 most frequent ambiguous lemmas: անգամ (NOUN 110, ADV 30), կոլեկտիվ (NOUN 64, ADJ 4), վերջ (NOUN 61, INTJ 5), այսօր (NOUN 59, ADV 26), հայ (ADJ 67, NOUN 53), հավաքական (NOUN 45, ADJ 2), կենտրոն (NOUN 40, ADJ 1), ղեկավար (NOUN 38, ADJ 3), ներկա (NOUN 35, ADJ 11), աշխատավոր (NOUN 33, ADJ 2)

The 10 most frequent ambiguous types: ի (NOUN 185, ADP 60), անգամ (NOUN 106, ADV 29), ին (NOUN 90, ADJ 24), ը (NOUN 57, X 1), այսօր (NOUN 34, ADV 15), գործում (NOUN 34, VERB 11), թվում (NOUN 25, VERB 19), տան (NOUN 19, VERB 3), ժամանակին (NOUN 16, ADV 7), տարեկան (NOUN 17, ADJ 13, ADV 7)

Morphology

The form / lemma ratio of NOUN is 2.303322 (the average of all parts of speech is 1.883575).

The 1st highest number of forms (20) was observed with the lemma “օր”: օր, օրդ, օրեր, օրերը, օրերի, օրերին, օրերից, օրերն, օրերով, օրերս, օրը, օրի, օրից, օրն, օրով, օրում, օրս, օրվա, օրվան, օրվանից.

The 2nd highest number of forms (19) was observed with the lemma “աչք”: Աչքերիդ, աչք, աչքեր, աչքերդ, աչքերը, աչքերի, աչքերին, աչքերն, աչքերով, աչքերում, աչքերս, աչքը, աչքի, աչքին, աչքիս, աչքն, աչքով, աչքում, աչքս.

The 3rd highest number of forms (19) was observed with the lemma “ձեռք”: ձեռք, ձեռքդ, ձեռքեր, ձեռքերը, ձեռքերի, ձեռքերով, ձեռքերում, ձեռքը, ձեռքի, ձեռքիդ, ձեռքին, ձեռքիս, ձեռքից, ձեռքն, ձեռքները, ձեռքներիցս, ձեռքներն, ձեռքով, ձեռքս.

NOUN occurs with 17 features: Animacy (28806; 100% instances), Case (28806; 100% instances), Number (28802; 100% instances), Definite (28486; 99% instances), Abbr (582; 2% instances), ExtPos (484; 2% instances), Hyph (437; 2% instances), Style (363; 1% instances), Person[psor] (239; 1% instances), Number[psor] (224; 1% instances), Deixis[psor] (76; 0% instances), NameType (40; 0% instances), NumForm (35; 0% instances), Poss (29; 0% instances), Typo (12; 0% instances), Echo (2; 0% instances), Foreign (1; 0% instances)

NOUN occurs with 44 feature-value pairs: Abbr=Yes, Animacy=Hum, Animacy=Hum,Nhum, Animacy=Nhum, Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, Deixis[psor]=Prox, Echo=Ech, ExtPos=ADP, ExtPos=ADV, ExtPos=PROPN, Foreign=Yes, Hyph=Yes, NameType=Geo, NameType=Sur, NumForm=Combi, NumForm=Digit, NumForm=Word, Number=Assoc, Number=Coll, Number=Plur, Number=Ptan, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3, Poss=Yes, Style=Arch, Style=Coll, Style=Expr, Style=Rare, Style=Slng, Style=Vrnc, Style=Vulg, Typo=Yes

NOUN occurs with 282 feature combinations. The most frequent feature combination is Animacy=Nhum|Case=Dat|Definite=Ind|Number=Sing (4749 tokens). Examples: հանրապետության, աշխատանքի, տարվա, աշխարհի, թվականի, երկրի, կառավարության, շրջանի, ծրագրի, անվան

Relations

NOUN nodes are attached to their parents using 41 different relations: nmod:poss (6610; 23% instances), obl (5417; 19% instances), nsubj (3436; 12% instances), obj (3259; 11% instances), conj (2654; 9% instances), nmod (1027; 4% instances), root (856; 3% instances), nmod:npmod (748; 3% instances), compound:lvc (742; 3% instances), iobj (629; 2% instances), nsubj:pass (533; 2% instances), dep (435; 2% instances), case (400; 1% instances), parataxis (380; 1% instances), appos (307; 1% instances), xcomp (289; 1% instances), compound (171; 1% instances), fixed (132; 0% instances), list (107; 0% instances), orphan (85; 0% instances), flat (73; 0% instances), ccomp (71; 0% instances), advmod (67; 0% instances), acl:relcl (55; 0% instances), advcl (44; 0% instances), obl:agent (42; 0% instances), vocative (41; 0% instances), compound:redup (37; 0% instances), flat:name (33; 0% instances), acl (29; 0% instances), dislocated (23; 0% instances), amod (17; 0% instances), flat:range (16; 0% instances), csubj (9; 0% instances), discourse (7; 0% instances), iobj:agent (7; 0% instances), nsubj:caus (7; 0% instances), advcl:relcl (4; 0% instances), advmod:emph (4; 0% instances), obj:agent (2; 0% instances), csubj:pass (1; 0% instances)

Parents of NOUN nodes belong to 16 different parts of speech: VERB (13764; 48% instances), NOUN (11372; 39% instances), PROPN (880; 3% instances), (856; 3% instances), ADJ (759; 3% instances), NUM (551; 2% instances), PRON (279; 1% instances), ADV (133; 0% instances), DET (68; 0% instances), ADP (59; 0% instances), X (42; 0% instances), SYM (32; 0% instances), INTJ (4; 0% instances), AUX (3; 0% instances), PART (3; 0% instances), CCONJ (1; 0% instances)

7087 (25%) NOUN nodes are leaves.

11975 (42%) NOUN nodes have one child.

5846 (20%) NOUN nodes have two children.

3898 (14%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 17.

Children of NOUN nodes are attached using 47 different relations: nmod:poss (8004; 21% instances), amod (7511; 20% instances), punct (5128; 13% instances), conj (2639; 7% instances), case (2106; 5% instances), det (1935; 5% instances), cc (1558; 4% instances), acl (1405; 4% instances), nmod (1167; 3% instances), nummod (1142; 3% instances), det:poss (1041; 3% instances), nmod:npmod (842; 2% instances), cop (658; 2% instances), acl:relcl (542; 1% instances), nsubj (485; 1% instances), advmod:emph (453; 1% instances), appos (378; 1% instances), parataxis (292; 1% instances), compound (182; 0% instances), obl (126; 0% instances), mark (115; 0% instances), orphan (110; 0% instances), list (108; 0% instances), discourse (89; 0% instances), advmod (84; 0% instances), csubj (51; 0% instances), advcl (42; 0% instances), case:loc (39; 0% instances), compound:redup (39; 0% instances), dep (37; 0% instances), flat (31; 0% instances), aux (29; 0% instances), flat:range (16; 0% instances), flat:name (14; 0% instances), fixed (13; 0% instances), obj (13; 0% instances), advcl:relcl (8; 0% instances), goeswith (7; 0% instances), vocative (6; 0% instances), dislocated (5; 0% instances), xcomp (5; 0% instances), iobj (4; 0% instances), nsubj:pass (4; 0% instances), ccomp (3; 0% instances), compound:lvc (3; 0% instances), expl (3; 0% instances), csubj:outer (2; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (11372; 30% instances), ADJ (7504; 20% instances), PUNCT (5128; 13% instances), DET (2980; 8% instances), VERB (2330; 6% instances), PROPN (2172; 6% instances), ADP (1844; 5% instances), NUM (1512; 4% instances), CCONJ (1483; 4% instances), AUX (687; 2% instances), ADV (584; 2% instances), PRON (482; 1% instances), PART (154; 0% instances), SCONJ (102; 0% instances), SYM (63; 0% instances), X (61; 0% instances), INTJ (16; 0% instances)