home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Armenian-CAVaL: POS Tags: NOUN

There are 1030 NOUN lemmas (39%), 2084 NOUN types (31%) and 10916 NOUN tokens (13%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent NOUN lemmas: որդի, հայր, Տէր, աշակերտ, պատասխանի, ժողովուրդ, բան, աւր, այր, երկին

The 10 most frequent NOUN types: պատասխանի, հայր, տէր, որդի, այր, մարդոյ, աշակերտք, անուն, բան, երկնից

The 10 most frequent ambiguous lemmas: ձեռն (NOUN 101, ADP 16), հանդերձ (NOUN 50, ADP 40), այս (DET 314, NOUN 29, PRON 10), մէջ (ADP 32, NOUN 25), յաւիտեան (NOUN 23, ADV 1), անապատ (NOUN 19, ADJ 14), ինչ (PRON 199, DET 28, ADV 25, NOUN 19), արդար (NOUN 17, ADJ 15, ADV 1), բարեկամ (NOUN 14, ADJ 7), վաղիւ (NOUN 12, ADV 4)

The 10 most frequent ambiguous types: ձեռն (NOUN 38, ADP 16), տան (NOUN 38, VERB 3), հանդերձ (ADP 40, NOUN 16), այս (DET 195, NOUN 14, PRON 7), մէջ (ADP 32, NOUN 14), վաղիւ (NOUN 11, ADV 4), անապատի (NOUN 9, ADJ 7), այրի (NOUN 8, VERB 2), անապատ (NOUN 8, ADJ 7), բարեկամ (NOUN 8, ADJ 1)

Morphology

The form / lemma ratio of NOUN is 2.023301 (the average of all parts of speech is 2.533817).

The 1st highest number of forms (11) was observed with the lemma “ժողովուրդ”: ժոլովուրդս, ժողով, ժողովըրդեան, ժողովըրդենէ, ժողովուրդ, ժողովուրդս, ժողովուրդք, ժողովրդեան, ժողովրդենէ, ժողովրդով, ժողովրդոց.

The 2nd highest number of forms (10) was observed with the lemma “փարիսեցի”: փարեսեցի, փարեսեցւոյ, փարիսացի, փարիսացիք, փարիսացւոց, փարիսեցի, փարիսեցիս, փարիսեցիք, փարիսեցւոյ, փարիսեցւոց.

The 3rd highest number of forms (9) was observed with the lemma “հրեշտակ”: հըրեշտեկաց, հրեշտակ, հրեշտակաց, հրեշտակէ, հրեշտակի, հրեշտակս, հրեշտակք, հրեշտեկաց, հրեշտեկաւք.

NOUN occurs with 2 features: Case (10881; 100% instances), Number (10881; 100% instances)

NOUN occurs with 9 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing

NOUN occurs with 15 feature combinations. The most frequent feature combination is Case=Acc|Number=Sing (2875 tokens). Examples: պատասխանի, անձն, հայր, տուն, երկիր, անուն, բան, որդի, հաց, խաչ

Relations

NOUN nodes are attached to their parents using 28 different relations: obl (2878; 26% instances), obj (2352; 22% instances), nsubj (1780; 16% instances), nmod (1128; 10% instances), conj (843; 8% instances), iobj (288; 3% instances), ccomp (252; 2% instances), vocative (237; 2% instances), appos (235; 2% instances), root (215; 2% instances), advcl (130; 1% instances), obl:arg (122; 1% instances), xcomp (119; 1% instances), orphan (97; 1% instances), acl (68; 1% instances), nsubj:pass (49; 0% instances), nsubj:caus (38; 0% instances), obl:agent (30; 0% instances), csubj (16; 0% instances), compound:redup (9; 0% instances), amod (8; 0% instances), fixed (7; 0% instances), parataxis (5; 0% instances), dislocated (4; 0% instances), flat:name (3; 0% instances), compound (1; 0% instances), csubj:outer (1; 0% instances), discourse (1; 0% instances)

Parents of NOUN nodes belong to 15 different parts of speech: VERB (7532; 69% instances), NOUN (1906; 17% instances), ADJ (409; 4% instances), PROPN (250; 2% instances), PRON (248; 2% instances), (215; 2% instances), ADV (141; 1% instances), AUX (74; 1% instances), DET (55; 1% instances), NUM (51; 0% instances), ADP (18; 0% instances), INTJ (12; 0% instances), CCONJ (2; 0% instances), PART (2; 0% instances), X (1; 0% instances)

2007 (18%) NOUN nodes are leaves.

3202 (29%) NOUN nodes have one child.

3395 (31%) NOUN nodes have two children.

2312 (21%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 12.

Children of NOUN nodes are attached using 29 different relations: case (4907; 26% instances), det (4082; 22% instances), nmod (2766; 15% instances), punct (1484; 8% instances), cc (902; 5% instances), conj (806; 4% instances), cop (632; 3% instances), amod (631; 3% instances), acl (582; 3% instances), orphan (353; 2% instances), nsubj (340; 2% instances), nummod (247; 1% instances), mark (232; 1% instances), advmod (174; 1% instances), advcl (132; 1% instances), obl (132; 1% instances), appos (130; 1% instances), iobj (94; 0% instances), discourse (59; 0% instances), xcomp (56; 0% instances), ccomp (40; 0% instances), csubj (20; 0% instances), vocative (13; 0% instances), compound:redup (9; 0% instances), obj (9; 0% instances), flat:name (3; 0% instances), obl:arg (3; 0% instances), parataxis (3; 0% instances), compound (2; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: ADP (4942; 26% instances), DET (4213; 22% instances), NOUN (1906; 10% instances), PRON (1797; 10% instances), PUNCT (1484; 8% instances), CCONJ (974; 5% instances), VERB (819; 4% instances), ADJ (723; 4% instances), AUX (645; 3% instances), PROPN (490; 3% instances), NUM (271; 1% instances), SCONJ (260; 1% instances), PART (148; 1% instances), ADV (140; 1% instances), INTJ (31; 0% instances)