home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Yiddish-YiTB: POS Tags: NUM

There are 1 NUM lemmas (0%), 51 NUM types (1%) and 110 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 15 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: צוויי, איין, צװײ, דרײַ, אײן, דרײַסיק, אַכט, זיבעציק, צען, איינס

The 10 most frequent ambiguous lemmas: _ (DET 221, NUM 110, X 60, INTJ 26, VERB 7, NOUN 6, PRON 6, ADJ 2, ADV 2, AUX 1, PROPN 1, PUNCT 1)

The 10 most frequent ambiguous types: מיליאָן (NOUN 2, NUM 2), פֿיר (NUM 2, VERB 2), הונדערט (NOUN 1, NUM 1), פֿינף (NOUN 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 51.000000 (the average of all parts of speech is 1.264753).

The 1st highest number of forms (51) was observed with the lemma “_”: 1.5, 10, 11, 12, 18,000, 1806, 1859, 1916, 1917, 1929, 1939, 1984, 1988, 2002, 2019, 3, 3,142,560, 32, 4, 5, 53, 60, 75, 800, אַכט, אַכצן, אַנטיסעמיטין, אַרבע, איין, איינס, אײן, דריי, דרײַ, דרײַסיק, הונדערט, זיבן, זיבעציק, זעכציק, טויזנט, מיליאָן, נול, נײַנציק, נײַנצן, פֿופֿציק, פֿינף, פֿיר, צוואַנציק, צוויי, צען, צװאַנציק, צװײ.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (79; 72% instances), obl (7; 6% instances), compound (6; 5% instances), conj (6; 5% instances), obj (5; 5% instances), nmod (2; 2% instances), root (2; 2% instances), fixed (1; 1% instances), flat (1; 1% instances), nsubj (1; 1% instances)

Parents of NUM nodes belong to 8 different parts of speech: NOUN (82; 75% instances), VERB (13; 12% instances), NUM (7; 6% instances), PROPN (3; 3% instances), (2; 2% instances), ADJ (1; 1% instances), ADV (1; 1% instances), DET (1; 1% instances)

85 (77%) NUM nodes are leaves.

21 (19%) NUM nodes have one child.

0 (0%) NUM nodes have two children.

4 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 11 different relations: case (8; 21% instances), cc (8; 21% instances), compound (4; 10% instances), det (4; 10% instances), advmod (3; 8% instances), conj (3; 8% instances), punct (3; 8% instances), cop (2; 5% instances), nsubj (2; 5% instances), amod (1; 3% instances), discourse (1; 3% instances)

Children of NUM nodes belong to 12 different parts of speech: CCONJ (8; 21% instances), ADP (7; 18% instances), NUM (7; 18% instances), DET (5; 13% instances), PUNCT (3; 8% instances), ADJ (2; 5% instances), AUX (2; 5% instances), ADV (1; 3% instances), INTJ (1; 3% instances), NOUN (1; 3% instances), PART (1; 3% instances), SCONJ (1; 3% instances)