home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Armenian-CAVaL: POS Tags: NUM

There are 32 NUM lemmas (1%), 63 NUM types (1%) and 556 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 7 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: մի, երկու, երկոտասան, հինգ, երեք, եւթն, հազար, հարիւր, տասն, վեց

The 10 most frequent NUM types: մի, երկուս, հինգ, երկու, եւթն, երիս, երկոտասան, երկոտասանից, երկուց, հարիւր

The 10 most frequent ambiguous lemmas: մի (PART 311, NUM 193, DET 164, INTJ 1)

The 10 most frequent ambiguous types: մի (PART 293, DET 162, NUM 158, INTJ 1), միոյ (NUM 10, DET 1), միոջ (NUM 4, DET 1)

Morphology

The form / lemma ratio of NUM is 1.968750 (the average of all parts of speech is 2.533817).

The 1st highest number of forms (8) was observed with the lemma “մի”: մի, միո, միոյ, միոյն, միոջ, միոջէ, միով, միում.

The 2nd highest number of forms (5) was observed with the lemma “երկոտասան”: երկոտասան, երկոտասանից, երկոտասանիւք, երկոտասանս, երկոտասանք.

The 3rd highest number of forms (4) was observed with the lemma “երեք”: երեք, երիս, երից, երիւք.

NUM occurs with 3 features: NumType (548; 99% instances), Case (523; 94% instances), Number (523; 94% instances)

NUM occurs with 11 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, NumType=Card, NumType=Sets, Number=Plur, Number=Sing

NUM occurs with 21 feature combinations. The most frequent feature combination is Case=Acc|Number=Sing|NumType=Card (166 tokens). Examples: մի, հինգ, եւթն, երկոտասան, հարիւր, երեսուն, տասն, երկոտասանս, ինն, յիսուն

Relations

NUM nodes are attached to their parents using 19 different relations: nummod (284; 51% instances), nsubj (61; 11% instances), conj (50; 9% instances), obj (37; 7% instances), orphan (28; 5% instances), obl (16; 3% instances), compound (14; 3% instances), nmod (14; 3% instances), advcl (10; 2% instances), appos (10; 2% instances), ccomp (8; 1% instances), iobj (6; 1% instances), nsubj:pass (5; 1% instances), compound:redup (4; 1% instances), root (4; 1% instances), obl:arg (2; 0% instances), obl:agent (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (271; 49% instances), VERB (163; 29% instances), NUM (61; 11% instances), PRON (19; 3% instances), ADJ (17; 3% instances), PROPN (11; 2% instances), AUX (6; 1% instances), ADV (4; 1% instances), (4; 1% instances)

280 (50%) NUM nodes are leaves.

126 (23%) NUM nodes have one child.

94 (17%) NUM nodes have two children.

56 (10%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 22 different relations: case (99; 19% instances), nmod (84; 16% instances), det (77; 15% instances), punct (51; 10% instances), orphan (40; 8% instances), cc (36; 7% instances), conj (31; 6% instances), cop (20; 4% instances), compound (13; 2% instances), advmod (12; 2% instances), acl (11; 2% instances), nsubj (10; 2% instances), mark (9; 2% instances), amod (6; 1% instances), nummod (6; 1% instances), advcl (5; 1% instances), appos (4; 1% instances), compound:redup (4; 1% instances), obl (4; 1% instances), csubj (2; 0% instances), ccomp (1; 0% instances), xcomp (1; 0% instances)

Children of NUM nodes belong to 14 different parts of speech: ADP (97; 18% instances), DET (80; 15% instances), NUM (61; 12% instances), NOUN (51; 10% instances), PUNCT (51; 10% instances), CCONJ (48; 9% instances), PRON (36; 7% instances), AUX (20; 4% instances), ADJ (18; 3% instances), VERB (18; 3% instances), PROPN (16; 3% instances), ADV (14; 3% instances), SCONJ (11; 2% instances), PART (5; 1% instances)