home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Armenian-CAVaL: POS Tags: DET

There are 29 DET lemmas (1%), 120 DET types (2%) and 5876 DET tokens (7%). Out of 16 observed tags, the rank of DET is: 9 in number of lemmas, 7 in number of types and 7 in number of tokens.

The 10 most frequent DET lemmas: ն, ամենայն, այս, այն, բազում, ս, դ, մի, իւր, իմ

The 10 most frequent DET types: ն, ամենայն, այս, ս, դ, մի, այն, բազում, որ, զ

The 10 most frequent ambiguous lemmas: ամենայն (DET 397, PRON 15), այս (DET 314, NOUN 29, PRON 10), այն (DET 298, PRON 35), բազում (DET 188, ADV 1, NOUN 1, PRON 1), մի (PART 311, NUM 193, DET 164, INTJ 1), իւր (PRON 356, DET 145), որ (PRON 1267, DET 88), այդ (DET 73, PRON 6), զ (ADP 3934, DET 63), ոմն (PRON 72, DET 51)

The 10 most frequent ambiguous types: ամենայն (DET 374, PRON 9), այս (DET 195, NOUN 14, PRON 7), ս (DET 179, PRON 1), մի (PART 293, DET 162, NUM 158, INTJ 1), այն (DET 131, PRON 13), բազում (DET 87, NOUN 1), որ (PRON 1012, DET 69), զ (ADP 3879, DET 63), այդ (DET 57, PRON 5), իւրում (DET 50, PRON 2)

Morphology

The form / lemma ratio of DET is 4.137931 (the average of all parts of speech is 2.533817).

The 1st highest number of forms (13) was observed with the lemma “այն”: այն, այնմ, այնմանէ, այնմիկ, այնոսիկ, այնորիկ, այնոցիկ, այնու, այնուիկ, այնոքիկ, այնց, այնցանէ, այս.

The 2nd highest number of forms (13) was observed with the lemma “այս”: այս, այսըւ, այսմ, այսմանէ, այսմիկ, այսոսիկ, այսորիկ, այսոցիկ, այսոքիկ, այսր, այսց, այսցանէ, այսք.

The 3rd highest number of forms (11) was observed with the lemma “իւր”: իրում, իւր, իւրեանց, իւրմէ, իւրոյ, իւրով, իւրովք, իւրոց, իւրում, իւրս, իւրք.

DET occurs with 9 features: PronType (5610; 95% instances), Deixis (4396; 75% instances), Definite (3975; 68% instances), Case (2018; 34% instances), Number (2018; 34% instances), Person (387; 7% instances), Poss (373; 6% instances), Reflex (145; 2% instances), Animacy (89; 2% instances)

DET occurs with 28 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, Definite=Spec, Deixis=Med, Deixis=Prox, Deixis=Remt, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes

DET occurs with 130 feature combinations. The most frequent feature combination is Definite=Def|Deixis=Remt|PronType=Dem (3341 tokens). Examples: ն, դ

Relations

DET nodes are attached to their parents using 21 different relations: det (5141; 87% instances), obj (208; 4% instances), nsubj (176; 3% instances), orphan (107; 2% instances), obl (57; 1% instances), advcl (56; 1% instances), conj (33; 1% instances), iobj (24; 0% instances), fixed (17; 0% instances), root (16; 0% instances), nmod (11; 0% instances), ccomp (9; 0% instances), acl (6; 0% instances), obl:arg (5; 0% instances), nsubj:pass (3; 0% instances), nsubj:caus (2; 0% instances), advmod (1; 0% instances), appos (1; 0% instances), csubj (1; 0% instances), dislocated (1; 0% instances), obl:agent (1; 0% instances)

Parents of DET nodes belong to 13 different parts of speech: NOUN (4213; 72% instances), VERB (734; 12% instances), ADJ (314; 5% instances), PRON (311; 5% instances), PROPN (93; 2% instances), NUM (80; 1% instances), DET (56; 1% instances), ADV (33; 1% instances), (16; 0% instances), SCONJ (16; 0% instances), AUX (7; 0% instances), INTJ (2; 0% instances), PART (1; 0% instances)

5226 (89%) DET nodes are leaves.

462 (8%) DET nodes have one child.

121 (2%) DET nodes have two children.

67 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 8.

Children of DET nodes are attached using 20 different relations: case (473; 49% instances), acl (131; 14% instances), punct (92; 10% instances), det (52; 5% instances), cop (39; 4% instances), cc (28; 3% instances), nmod (27; 3% instances), conj (23; 2% instances), nsubj (21; 2% instances), advmod (19; 2% instances), advcl (12; 1% instances), obl (11; 1% instances), mark (10; 1% instances), orphan (7; 1% instances), ccomp (5; 1% instances), csubj (5; 1% instances), appos (3; 0% instances), discourse (2; 0% instances), amod (1; 0% instances), iobj (1; 0% instances)

Children of DET nodes belong to 13 different parts of speech: ADP (475; 49% instances), VERB (141; 15% instances), PUNCT (92; 10% instances), DET (56; 6% instances), NOUN (55; 6% instances), AUX (41; 4% instances), CCONJ (31; 3% instances), PRON (26; 3% instances), PART (13; 1% instances), SCONJ (11; 1% instances), ADJ (9; 1% instances), ADV (9; 1% instances), PROPN (3; 0% instances)