home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Armenian-ArmTDP: POS Tags: ADJ

There are 2286 ADJ lemmas (19%), 2366 ADJ types (10%) and 9473 ADJ tokens (9%). Out of 17 observed tags, the rank of ADJ is: 2 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent ADJ lemmas: նոր, մեծ, առաջին, կարող, րդ, պետական, սովետական, շատ, վերջին, բարձր

The 10 most frequent ADJ types: նոր, մեծ, առաջին, կարող, րդ, պետական, սովետական, վերջին, շատ, հայ

The 10 most frequent ambiguous lemmas: նոր (ADJ 187, ADV 19, NOUN 3), մեծ (ADJ 185, NOUN 6, ADV 1), առաջին (ADJ 156, NOUN 12), րդ (ADJ 109, NOUN 1), շատ (ADV 96, ADJ 74, NOUN 19), վերջին (ADJ 74, NOUN 13), բարձր (ADJ 70, ADV 13), լավ (ADJ 70, ADV 20, NOUN 4, INTJ 3), հայ (ADJ 67, NOUN 53), գլխավոր (ADJ 63, NOUN 3)

The 10 most frequent ambiguous types: նոր (ADJ 163, ADV 17), մեծ (ADJ 156, NOUN 2, ADV 1), կարող (ADJ 115, VERB 2), վերջին (ADJ 51, NOUN 4), շատ (ADV 89, ADJ 65, NOUN 3), հայ (ADJ 47, NOUN 8, ADV 1), պետք (AUX 65, ADJ 56, NOUN 2), բարձր (ADJ 53, ADV 11), նման (ADJ 35, ADP 21, ADV 1), լավ (ADJ 40, ADV 15, INTJ 1)

Morphology

The form / lemma ratio of ADJ is 1.034996 (the average of all parts of speech is 1.883575).

The 1st highest number of forms (5) was observed with the lemma “թեթև”: ԹԵԹԵՎ, Թեթևագույն, ամենաթեթև, թեթեեեև, թեթև.

The 2nd highest number of forms (4) was observed with the lemma “ծանր”: ամենածանր, գերծանր, ծանր, ծանրագույն.

The 3rd highest number of forms (4) was observed with the lemma “մեծ”: ամենամեծ, մեեեեծ, մեծ, մեծագույն.

ADJ occurs with 11 features: Degree (3864; 41% instances), NumForm (471; 5% instances), NumType (450; 5% instances), NameType (373; 4% instances), Hyph (189; 2% instances), Style (92; 1% instances), Abbr (67; 1% instances), ExtPos (17; 0% instances), Poss (10; 0% instances), Echo (7; 0% instances), Typo (3; 0% instances)

ADJ occurs with 22 feature-value pairs: Abbr=Yes, Degree=Abs, Degree=Pos, Degree=Sup, Echo=Ech, ExtPos=ADV, ExtPos=PART, Hyph=Yes, NameType=Geo, NumForm=Armenian, NumForm=Combi, NumForm=Roman, NumForm=Word, NumType=Ord, Poss=Yes, Style=Arch, Style=Coll, Style=Expr, Style=Rare, Style=Slng, Style=Vrnc, Typo=Yes

ADJ occurs with 38 feature combinations. The most frequent feature combination is _ (4777 tokens). Examples: պետական, հայ, ազգային, տնտեսական, պետք, քաղաքական, կարելի, հայկական, սոցիալիստական, նման

Relations

ADJ nodes are attached to their parents using 31 different relations: amod (7524; 79% instances), conj (574; 6% instances), root (416; 4% instances), xcomp (173; 2% instances), dep (151; 2% instances), compound (99; 1% instances), advcl (82; 1% instances), ccomp (79; 1% instances), parataxis (78; 1% instances), acl:relcl (64; 1% instances), compound:lvc (47; 0% instances), acl (28; 0% instances), advmod (28; 0% instances), flat (18; 0% instances), fixed (16; 0% instances), appos (14; 0% instances), discourse (13; 0% instances), orphan (13; 0% instances), compound:redup (11; 0% instances), csubj (11; 0% instances), advcl:relcl (6; 0% instances), nmod (6; 0% instances), dislocated (5; 0% instances), flat:name (5; 0% instances), nmod:poss (5; 0% instances), nsubj (2; 0% instances), advmod:emph (1; 0% instances), csubj:pass (1; 0% instances), iobj (1; 0% instances), nmod:npmod (1; 0% instances), obj (1; 0% instances)

Parents of ADJ nodes belong to 13 different parts of speech: NOUN (7504; 79% instances), ADJ (587; 6% instances), VERB (552; 6% instances), (416; 4% instances), PROPN (197; 2% instances), NUM (171; 2% instances), PRON (18; 0% instances), ADP (15; 0% instances), ADV (8; 0% instances), AUX (2; 0% instances), DET (1; 0% instances), PART (1; 0% instances), X (1; 0% instances)

7047 (74%) ADJ nodes are leaves.

1351 (14%) ADJ nodes have one child.

225 (2%) ADJ nodes have two children.

850 (9%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 10.

Children of ADJ nodes are attached using 34 different relations: punct (1522; 26% instances), cop (775; 13% instances), conj (638; 11% instances), advmod (488; 8% instances), cc (454; 8% instances), nsubj (447; 8% instances), obl (439; 8% instances), csubj (205; 4% instances), xcomp (144; 2% instances), mark (125; 2% instances), compound (116; 2% instances), advcl (86; 1% instances), parataxis (83; 1% instances), discourse (67; 1% instances), aux (50; 1% instances), iobj (37; 1% instances), obj (34; 1% instances), appos (23; 0% instances), fixed (16; 0% instances), compound:redup (11; 0% instances), nmod:poss (6; 0% instances), nummod (6; 0% instances), amod (5; 0% instances), vocative (4; 0% instances), advcl:relcl (3; 0% instances), ccomp (3; 0% instances), case (2; 0% instances), det:poss (2; 0% instances), dislocated (2; 0% instances), csubj:outer (1; 0% instances), dep (1; 0% instances), nmod (1; 0% instances), nsubj:caus (1; 0% instances), orphan (1; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: PUNCT (1522; 26% instances), AUX (825; 14% instances), NOUN (759; 13% instances), VERB (626; 11% instances), ADJ (587; 10% instances), ADV (473; 8% instances), CCONJ (446; 8% instances), PRON (241; 4% instances), SCONJ (115; 2% instances), PART (82; 1% instances), PROPN (38; 1% instances), NUM (36; 1% instances), DET (24; 0% instances), ADP (10; 0% instances), INTJ (10; 0% instances), X (4; 0% instances)