home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Armenian-ArmTDP: POS Tags: NOUN

There are 3252 NOUN lemmas (43%), 6800 NOUN types (50%) and 13312 NOUN tokens (25%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: հանրապետություն, օր, մարդ, տարի, թվական, կյանք, տուն, աշխարհ, բան, կին

The 10 most frequent NOUN types: հանրապետության, բան, անգամ, թվականի, ժամանակ, կառավարության, տարի, նախագահի, օրը, թ

The 10 most frequent ambiguous lemmas: ժամանակ (NOUN 71, ADP 6), անգամ (NOUN 57, ADV 22, PART 1), ը (NOUN 40, X 1), երեխա (NOUN 38, ADJ 1), հայ (ADJ 43, NOUN 37), վերջ (NOUN 28, INTJ 4), ներկա (NOUN 20, ADJ 8), կենտրոն (NOUN 17, ADJ 1), այսօր (NOUN 16, ADV 14), ներս (NOUN 16, ADV 5)

The 10 most frequent ambiguous types: անգամ (NOUN 56, ADV 22, PART 1), ժամանակ (NOUN 44, ADP 6), դեպքում (NOUN 31, ADP 2), հայոց (ADJ 3, NOUN 2), տան (NOUN 15, VERB 2), ի (ADP 30, NOUN 13), որոշում (NOUN 13, VERB 1), ը (NOUN 12, X 1), կողմից (ADP 35, NOUN 9), շարժում (NOUN 9, VERB 1)

Morphology

The form / lemma ratio of NOUN is 2.091021 (the average of all parts of speech is 1.814455).

The 1st highest number of forms (20) was observed with the lemma “ձեռք”: ձեռդ, ձեռի, ձեռն, ձեռս, ձեռք, ձեռքդ, ձեռքերը, ձեռքերով, ձեռքը, ձեռքի, ձեռքիդ, ձեռքին, ձեռքիս, ձեռքից, ձեռքն, ձեռքները, ձեռքներիցս, ձեռքներն, ձեռքով, ձեռքս.

The 2nd highest number of forms (18) was observed with the lemma “աչք”: աչք, աչքեր, աչքերդ, աչքերը, աչքերի, աչքերին, աչքերն, աչքերով, աչքերում, աչքերս, աչքը, աչքի, աչքին, աչքիս, աչքն, աչքով, աչքում, աչքս.

The 3rd highest number of forms (16) was observed with the lemma “տուն”: տան, տանդ, տանը, տանից, տանն, տներ, տների, տներից, տներն, տներով, տներում, տնից, տնով, տուն, տունը, տունն.

NOUN occurs with 13 features: Case (13312; 100% instances), Animacy (13311; 100% instances), Number (13309; 100% instances), Definite (13075; 98% instances), Style (284; 2% instances), Person[psor] (229; 2% instances), Number[psor] (209; 2% instances), NumForm (158; 1% instances), Abbr (114; 1% instances), Typo (18; 0% instances), Poss (6; 0% instances), NameType (5; 0% instances), Echo (2; 0% instances)

NOUN occurs with 34 feature-value pairs: Abbr=Yes, Animacy=Hum, Animacy=Nhum, Case=Abl, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, Echo=Ech, NameType=Geo, NameType=Sur, NumForm=Digit, NumForm=Word, Number=Assoc, Number=Coll, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3, Poss=Yes, Style=Arch, Style=Coll, Style=Expr, Style=Rare, Style=Slng, Style=Vrnc, Style=Vulg, Typo=Yes

NOUN occurs with 212 feature combinations. The most frequent feature combination is Animacy=Nhum|Case=Dat|Definite=Ind|Number=Sing (2584 tokens). Examples: հանրապետության, թվականի, կառավարության, ծրագրի, հոդվածի, օրենքի, կյանքի, որոշման, տարվա, աշխարհի

Relations

NOUN nodes are attached to their parents using 35 different relations: obl (2963; 22% instances), nmod:poss (2742; 21% instances), obj (1893; 14% instances), nsubj (1713; 13% instances), conj (1306; 10% instances), compound:lvc (356; 3% instances), nmod (353; 3% instances), nmod:npmod (345; 3% instances), root (335; 3% instances), parataxis (175; 1% instances), appos (158; 1% instances), xcomp (158; 1% instances), nsubj:pass (144; 1% instances), iobj (85; 1% instances), flat (76; 1% instances), compound (72; 1% instances), orphan (59; 0% instances), fixed (49; 0% instances), ccomp (48; 0% instances), dep (42; 0% instances), obl:agent (34; 0% instances), compound:redup (33; 0% instances), acl (30; 0% instances), advcl (28; 0% instances), vocative (27; 0% instances), acl:relcl (25; 0% instances), dislocated (23; 0% instances), case (7; 0% instances), iobj:agent (7; 0% instances), csubj (6; 0% instances), list (6; 0% instances), nsubj:caus (6; 0% instances), amod (4; 0% instances), discourse (3; 0% instances), csubj:pass (1; 0% instances)

Parents of NOUN nodes belong to 15 different parts of speech: VERB (6921; 52% instances), NOUN (4970; 37% instances), ADJ (458; 3% instances), (335; 3% instances), PROPN (257; 2% instances), PRON (153; 1% instances), ADV (73; 1% instances), DET (36; 0% instances), X (36; 0% instances), NUM (30; 0% instances), ADP (19; 0% instances), SYM (15; 0% instances), AUX (3; 0% instances), INTJ (3; 0% instances), PART (3; 0% instances)

3413 (26%) NOUN nodes are leaves.

5474 (41%) NOUN nodes have one child.

2539 (19%) NOUN nodes have two children.

1886 (14%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 15.

Children of NOUN nodes are attached using 41 different relations: nmod:poss (3265; 18% instances), amod (3056; 17% instances), punct (2352; 13% instances), conj (1301; 7% instances), case (1113; 6% instances), det (1107; 6% instances), acl (825; 5% instances), cc (758; 4% instances), det:poss (658; 4% instances), nmod (476; 3% instances), nummod (429; 2% instances), cop (417; 2% instances), nmod:npmod (388; 2% instances), advmod:emph (308; 2% instances), nsubj (297; 2% instances), acl:relcl (248; 1% instances), appos (162; 1% instances), parataxis (136; 1% instances), obl (89; 0% instances), compound (76; 0% instances), mark (76; 0% instances), orphan (67; 0% instances), advmod (56; 0% instances), discourse (54; 0% instances), advcl (33; 0% instances), compound:redup (32; 0% instances), case:loc (28; 0% instances), flat (28; 0% instances), csubj (24; 0% instances), aux (21; 0% instances), obj (19; 0% instances), dep (10; 0% instances), fixed (9; 0% instances), compound:lvc (8; 0% instances), dislocated (8; 0% instances), list (6; 0% instances), xcomp (6; 0% instances), vocative (3; 0% instances), expl (2; 0% instances), iobj (2; 0% instances), nsubj:pass (2; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (4970; 28% instances), ADJ (3165; 18% instances), PUNCT (2352; 13% instances), DET (1761; 10% instances), VERB (1287; 7% instances), ADP (1137; 6% instances), PROPN (796; 4% instances), CCONJ (788; 4% instances), NUM (462; 3% instances), AUX (438; 2% instances), ADV (319; 2% instances), PRON (223; 1% instances), PART (99; 1% instances), SCONJ (88; 0% instances), X (31; 0% instances), SYM (25; 0% instances), INTJ (14; 0% instances)