home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Armenian-CAVaL: POS Tags: NOUN

There are 1559 NOUN lemmas (37%), 3118 NOUN types (33%) and 14213 NOUN tokens (14%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent NOUN lemmas: որդի, աստուած, հայր, տէր, բան, պատասխանի, աշակերտ, ժողովուրդ, այր, աւր

The 10 most frequent NOUN types: պատասխանի, որդի, Աստուածոյ, հայր, տէր, անուն, այր, բան, Աստուած, մարդոյ

The 10 most frequent ambiguous lemmas: ձեռն (NOUN 144, ADP 3), մանուկ (NOUN 61, ADJ 1), հանդերձ (NOUN 51, ADP 50), մէջ (ADV 47, NOUN 39, ADP 5), պարտ (NOUN 38, ADJ 4), այս (PRON 260, DET 205, NOUN 30), յոյն (NOUN 23, ADJ 3), ինչ (PRON 261, DET 63, ADV 31, NOUN 22), անապատ (NOUN 20, ADJ 14), յաւիտեան (NOUN 20, ADV 4)

The 10 most frequent ambiguous types: ձեռն (NOUN 71, ADP 3), մանուկ (NOUN 42, ADJ 1), տան (NOUN 42, VERB 3), պարտ (NOUN 35, ADJ 3), մէջ (ADV 47, NOUN 23, ADP 5), հանդերձ (ADP 50, NOUN 17), հետէ (NOUN 17, ADP 4), այս (PRON 172, DET 90, NOUN 15), յաւիտեան (NOUN 12, ADV 3), վաղիւ (NOUN 11, ADV 4)

Morphology

The form / lemma ratio of NOUN is 2.000000 (the average of all parts of speech is 2.285234).

The 1st highest number of forms (11) was observed with the lemma “ժողովուրդ”: ժոլովուրդս, ժողով, ժողովըրդեան, ժողովըրդենէ, ժողովուրդ, ժողովուրդս, ժողովուրդք, ժողովրդեան, ժողովրդենէ, ժողովրդով, ժողովրդոց.

The 2nd highest number of forms (10) was observed with the lemma “ակն”: ակամբ, ական, ականէ, ակն, ակունս, աչաց, աչաւք, աչկունք, աչս, աչք.

The 3rd highest number of forms (10) was observed with the lemma “փարիսեցի”: փարեսեցի, փարեսեցւոյ, փարիսացի, փարիսացիք, փարիսացւոց, փարիսեցի, փարիսեցիս, փարիսեցիք, փարիսեցւոյ, փարիսեցւոց.

NOUN occurs with 2 features: Case (14210; 100% instances), Number (14210; 100% instances)

NOUN occurs with 9 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing

NOUN occurs with 15 feature combinations. The most frequent feature combination is Case=Acc|Number=Sing (3565 tokens). Examples: պատասխանի, անուն, տուն, անձն, երկիր, հայր, բան, ձեռն, որդի, Աստուած

Relations

NOUN nodes are attached to their parents using 26 different relations: obl (3662; 26% instances), obj (2791; 20% instances), nmod (2257; 16% instances), nsubj (1983; 14% instances), conj (1206; 8% instances), iobj (342; 2% instances), ccomp (281; 2% instances), root (254; 2% instances), vocative (229; 2% instances), appos (214; 2% instances), advcl (178; 1% instances), xcomp (165; 1% instances), acl (129; 1% instances), obl:arg (128; 1% instances), orphan (127; 1% instances), nsubj:pass (65; 0% instances), nsubj:caus (43; 0% instances), obl:agent (39; 0% instances), fixed (38; 0% instances), amod (24; 0% instances), csubj (20; 0% instances), compound:redup (12; 0% instances), flat (10; 0% instances), dislocated (9; 0% instances), parataxis (6; 0% instances), compound (1; 0% instances)

Parents of NOUN nodes belong to 15 different parts of speech: VERB (9148; 64% instances), NOUN (3069; 22% instances), ADJ (584; 4% instances), PROPN (400; 3% instances), PRON (339; 2% instances), (254; 2% instances), ADV (155; 1% instances), AUX (98; 1% instances), ADP (58; 0% instances), NUM (58; 0% instances), DET (23; 0% instances), INTJ (21; 0% instances), PART (3; 0% instances), CCONJ (2; 0% instances), X (1; 0% instances)

2660 (19%) NOUN nodes are leaves.

4100 (29%) NOUN nodes have one child.

4276 (30%) NOUN nodes have two children.

3177 (22%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 12.

Children of NOUN nodes are attached using 30 different relations: case (6197; 25% instances), det (5372; 22% instances), nmod (3820; 15% instances), punct (1968; 8% instances), cc (1313; 5% instances), conj (1151; 5% instances), amod (1060; 4% instances), acl (741; 3% instances), cop (733; 3% instances), nsubj (453; 2% instances), orphan (429; 2% instances), nummod (309; 1% instances), advmod (291; 1% instances), mark (280; 1% instances), obl (217; 1% instances), advcl (150; 1% instances), iobj (86; 0% instances), xcomp (75; 0% instances), discourse (72; 0% instances), appos (57; 0% instances), ccomp (57; 0% instances), obj (29; 0% instances), csubj (21; 0% instances), compound:redup (12; 0% instances), parataxis (12; 0% instances), vocative (11; 0% instances), flat (3; 0% instances), obl:arg (3; 0% instances), compound (2; 0% instances), dislocated (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: ADP (6284; 25% instances), DET (5429; 22% instances), NOUN (3069; 12% instances), PRON (2088; 8% instances), PUNCT (1968; 8% instances), CCONJ (1400; 6% instances), ADJ (1225; 5% instances), VERB (1021; 4% instances), AUX (752; 3% instances), PROPN (595; 2% instances), NUM (340; 1% instances), SCONJ (283; 1% instances), ADV (237; 1% instances), PART (201; 1% instances), INTJ (33; 0% instances)