home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Armenian-ArmTDP: POS Tags: NOUN

There are 1991 NOUN lemmas (44%), 3590 NOUN types (48%) and 5773 NOUN tokens (25%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: օր, կին, տարի, թվական, կյանք, տուն, ժամանակ, բան, երկիր, մարդ

The 10 most frequent NOUN types: անգամ, տարի, բան, օրը, ընթացքում, ժամանակ, դեպքում, կնոջ, Հայոց, թվականի

The 10 most frequent ambiguous lemmas: անգամ (NOUN 27, ADV 14), հայ (ADJ 26, NOUN 20), ներկա (NOUN 13, ADJ 6), հավաքական (NOUN 11, ADJ 1), այսօր (NOUN 10, ADV 5), ի (ADP 16, NOUN 10), վերջ (NOUN 10, INTJ 1), մեկ (NUM 15, NOUN 8, ADV 6, DET 3, ADJ 1), ներս (NOUN 6, ADV 1), 22 (NOUN 4, NUM 1)

The 10 most frequent ambiguous types: անգամ (NOUN 27, ADV 14), Հայոց (NOUN 16, ADJ 3), ի (ADP 15, NOUN 10), կողմից (ADP 12, NOUN 6), այսօր (ADV 5, NOUN 4), գործում (NOUN 4, VERB 1), թվում (VERB 6, NOUN 4), ավելին (NOUN 3, PART 1), դեմ (ADP 16, NOUN 1), զույգ (ADJ 3, NOUN 3)

Morphology

The form / lemma ratio of NOUN is 1.803114 (the average of all parts of speech is 1.635667).

The 1st highest number of forms (12) was observed with the lemma “աչք”: աչք, աչքեր, աչքերը, աչքերի, աչքերին, աչքերով, աչքերում, աչքը, աչքի, աչքին, աչքն, աչքում.

The 2nd highest number of forms (12) was observed with the lemma “օր”: օր, օրերը, օրերի, օրերին, օրերից, օրերս, օրը, օրից, օրն, օրում, օրվա, օրվանից.

The 3rd highest number of forms (11) was observed with the lemma “խնդիր”: խնդիր, խնդիրը, խնդիրն, խնդիրներ, խնդիրները, խնդիրների, խնդիրներին, խնդիրներից, խնդրի, խնդրով, խնդրում.

NOUN occurs with 13 features: Case (5773; 100% instances), Animacy (5772; 100% instances), Number (5772; 100% instances), Definite (5727; 99% instances), NumForm (83; 1% instances), Style (82; 1% instances), Abbr (56; 1% instances), Number[psor] (46; 1% instances), Person[psor] (35; 1% instances), Typo (16; 0% instances), Poss (3; 0% instances), Echo (1; 0% instances), NameType (1; 0% instances)

NOUN occurs with 32 feature-value pairs: Abbr=Yes, Animacy=Hum, Animacy=Nhum, Case=Abl, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, Echo=Ech, NameType=Geo, NumForm=Digit, NumForm=Word, Number=Coll, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3, Poss=Yes, Style=Arch, Style=Coll, Style=Expr, Style=Rare, Style=Slng, Style=Vrnc, Style=Vulg, Typo=Yes

NOUN occurs with 128 feature combinations. The most frequent feature combination is Animacy=Nhum|Case=Dat|Definite=Ind|Number=Sing (1046 tokens). Examples: թվականի, երկրի, կյանքի, մարտի, ի, դոլարի, համալսարանի, ջրի, փետրվարի, փողոցի

Relations

NOUN nodes are attached to their parents using 36 different relations: obl (1376; 24% instances), nmod:poss (1099; 19% instances), obj (861; 15% instances), nsubj (779; 13% instances), conj (539; 9% instances), nmod (177; 3% instances), nmod:npmod (140; 2% instances), root (129; 2% instances), xcomp (85; 1% instances), appos (83; 1% instances), compound:lvc (78; 1% instances), nsubj:pass (77; 1% instances), parataxis (64; 1% instances), orphan (35; 1% instances), flat (34; 1% instances), iobj (31; 1% instances), goeswith (23; 0% instances), ccomp (19; 0% instances), compound (19; 0% instances), compound:redup (19; 0% instances), acl (15; 0% instances), acl:relcl (15; 0% instances), fixed (14; 0% instances), obl:agent (14; 0% instances), advcl (12; 0% instances), advmod (11; 0% instances), vocative (8; 0% instances), csubj (4; 0% instances), discourse (3; 0% instances), amod (2; 0% instances), case (2; 0% instances), nsubj:caus (2; 0% instances), cc (1; 0% instances), csubj:pass (1; 0% instances), expl (1; 0% instances), iobj:agent (1; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (3109; 54% instances), NOUN (2029; 35% instances), ADJ (210; 4% instances), PROPN (155; 3% instances), (129; 2% instances), PRON (55; 1% instances), ADV (36; 1% instances), X (22; 0% instances), NUM (12; 0% instances), DET (10; 0% instances), ADP (5; 0% instances), PART (1; 0% instances)

1409 (24%) NOUN nodes are leaves.

2448 (42%) NOUN nodes have one child.

1137 (20%) NOUN nodes have two children.

779 (13%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 10.

Children of NOUN nodes are attached using 38 different relations: amod (1415; 18% instances), nmod:poss (1299; 17% instances), punct (965; 12% instances), det (566; 7% instances), conj (533; 7% instances), case (452; 6% instances), acl (380; 5% instances), cc (343; 4% instances), det:poss (293; 4% instances), nummod (186; 2% instances), nmod (178; 2% instances), cop (170; 2% instances), nmod:npmod (166; 2% instances), advmod:emph (137; 2% instances), acl:relcl (125; 2% instances), nsubj (124; 2% instances), appos (87; 1% instances), obl (50; 1% instances), advmod (36; 0% instances), parataxis (34; 0% instances), mark (31; 0% instances), orphan (31; 0% instances), flat (19; 0% instances), compound (18; 0% instances), compound:redup (18; 0% instances), case:loc (16; 0% instances), discourse (16; 0% instances), advcl (12; 0% instances), csubj (12; 0% instances), aux (7; 0% instances), goeswith (6; 0% instances), obj (6; 0% instances), xcomp (4; 0% instances), fixed (3; 0% instances), dep (2; 0% instances), compound:lvc (1; 0% instances), expl (1; 0% instances), nsubj:pass (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (2029; 26% instances), ADJ (1463; 19% instances), PUNCT (965; 12% instances), DET (856; 11% instances), VERB (550; 7% instances), ADP (461; 6% instances), CCONJ (360; 5% instances), PROPN (353; 5% instances), NUM (203; 3% instances), AUX (178; 2% instances), ADV (137; 2% instances), PRON (74; 1% instances), PART (46; 1% instances), SCONJ (40; 1% instances), X (24; 0% instances), SYM (3; 0% instances), INTJ (1; 0% instances)