home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Albanian-STAF: POS Tags: NUM

There are 13 NUM lemmas (1%), 13 NUM types (1%) and 18 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 11 in number of lemmas, 12 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: dy, ‘99, dhjetë, disa, gjashtë, katër, njëzet, pesë, pesëdhjetë, shtatë

The 10 most frequent NUM types: dy, ‘99, dhjetëra, disa, gjashtë, katër, njëzet, pesave, pesëdhjetë, shtatë

The 10 most frequent ambiguous lemmas: dy (NUM 6, ADJ 1), disa (NUM 1, PRON 1)

The 10 most frequent ambiguous types: disa (NUM 1, PRON 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.223770).

The 1st highest number of forms (1) was observed with the lemma “’99”: ‘99.

The 2nd highest number of forms (1) was observed with the lemma “dhjetë”: dhjetëra.

The 3rd highest number of forms (1) was observed with the lemma “disa”: disa.

NUM occurs with 1 features: NumType (12; 67% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (12 tokens). Examples: dy, gjashtë, katër, pesëdhjetë, shtatë, tetë, tre

Relations

NUM nodes are attached to their parents using 4 different relations: nummod (14; 78% instances), conj (2; 11% instances), compound (1; 6% instances), nmod:poss (1; 6% instances)

Parents of NUM nodes belong to 2 different parts of speech: NOUN (16; 89% instances), NUM (2; 11% instances)

14 (78%) NUM nodes are leaves.

1 (6%) NUM nodes have one child.

0 (0%) NUM nodes have two children.

3 (17%) NUM nodes have three or more children.

The highest child degree of a NUM node is 3.

Children of NUM nodes are attached using 8 different relations: det (2; 20% instances), punct (2; 20% instances), amod (1; 10% instances), case (1; 10% instances), cc (1; 10% instances), compound (1; 10% instances), conj (1; 10% instances), nmod (1; 10% instances)

Children of NUM nodes belong to 7 different parts of speech: DET (2; 20% instances), NUM (2; 20% instances), PUNCT (2; 20% instances), ADJ (1; 10% instances), ADP (1; 10% instances), CCONJ (1; 10% instances), NOUN (1; 10% instances)