home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Yiddish-YiTB: POS Tags: NUM

There are 45 NUM lemmas (1%), 51 NUM types (1%) and 110 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 8 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: צװײ, אײן, דרײַען, דרײַסיק, אַכט, גנבֿה, זיבעציק, געװינס, זיבן, ליד

The 10 most frequent NUM types: צוויי, איין, צװײ, דרײַ, אײן, דרײַסיק, אַכט, זיבעציק, צען, איינס

The 10 most frequent ambiguous lemmas: ליד (NOUN 5, NUM 2), מיליאָן (NOUN 2, NUM 2), פֿיר (VERB 5, NUM 2), אַנטיסעמיט (NOUN 3, NUM 1), הונדערן (NOUN 1, NUM 1), הימל (NOUN 14, NUM 1), טױזנט (ADJ 1, NUM 1), נעבעך (ADV 2, ADJ 1, NUM 1), רעליגיע (NOUN 17, NUM 1)

The 10 most frequent ambiguous types: מיליאָן (NOUN 2, NUM 2), פֿיר (NUM 2, VERB 2), הונדערט (NOUN 1, NUM 1), פֿינף (NOUN 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.133333 (the average of all parts of speech is 1.222136).

The 1st highest number of forms (3) was observed with the lemma “אײן”: איין, איינס, אײן.

The 2nd highest number of forms (3) was observed with the lemma “גנבֿה”: 1.5, 11, 1917.

The 3rd highest number of forms (2) was observed with the lemma “געװינס”: 18,000, 1984.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (79; 72% instances), obl (7; 6% instances), compound (6; 5% instances), conj (6; 5% instances), obj (5; 5% instances), nmod (2; 2% instances), root (2; 2% instances), fixed (1; 1% instances), flat (1; 1% instances), nsubj (1; 1% instances)

Parents of NUM nodes belong to 8 different parts of speech: NOUN (82; 75% instances), VERB (13; 12% instances), NUM (7; 6% instances), PROPN (3; 3% instances), (2; 2% instances), ADJ (1; 1% instances), ADV (1; 1% instances), DET (1; 1% instances)

85 (77%) NUM nodes are leaves.

21 (19%) NUM nodes have one child.

0 (0%) NUM nodes have two children.

4 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 11 different relations: case (8; 21% instances), cc (8; 21% instances), compound (4; 10% instances), det (4; 10% instances), advmod (3; 8% instances), conj (3; 8% instances), punct (3; 8% instances), cop (2; 5% instances), nsubj (2; 5% instances), amod (1; 3% instances), discourse (1; 3% instances)

Children of NUM nodes belong to 12 different parts of speech: CCONJ (8; 21% instances), ADP (7; 18% instances), NUM (7; 18% instances), DET (5; 13% instances), PUNCT (3; 8% instances), ADJ (2; 5% instances), AUX (2; 5% instances), ADV (1; 3% instances), INTJ (1; 3% instances), NOUN (1; 3% instances), PART (1; 3% instances), SCONJ (1; 3% instances)