home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ancient_Hebrew-PTNK: POS Tags: NUM

There are 25 NUM lemmas (1%), 558 NUM types (2%) and 2475 NUM tokens (2%). Out of 15 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: אחד, שׁנים, שׁבע, חמשׁ, מאה, שׁלשׁ, ארבע, אלף, שׁשׁ, עשׂר

The 10 most frequent NUM types: אֶחָ֖ד, אֶ֖לֶף, שְׁתֵּ֣י, מֵאֹ֖ות, שְׁנֵי, אֶחָד֙, מֵאֹֽות, אֶחָ֑ד, שִׁבְעַ֥ת, אֶחָֽד

The 10 most frequent ambiguous lemmas: שׁבע (NUM 230, VERB 70, PROPN 11), חמשׁ (NUM 211, VERB 2, NOUN 1), שׁלשׁ (NUM 183, VERB 4), ארבע (NUM 159, PROPN 2), אלף (NUM 117, NOUN 10), שׁשׁ (NUM 98, NOUN 34), עשׂר (NUM 75, VERB 5), שׁני (NUM 63, NOUN 35), שׁשׁי (NUM 9, PROPN 1)

The 10 most frequent ambiguous types: שְׁנֵ֣י (NUM 26, NOUN 7), שְׁנֵ֥י (NUM 23, NOUN 3), שֵׁ֥שׁ (NUM 15, NOUN 10), שְׁנֵ֖י (NUM 14, NOUN 1), שְׁנֵֽי (NUM 14, NOUN 2), שְׁנֵ֤י (NUM 11, NOUN 2), שְׁנֵי֙ (NUM 11, NOUN 5), שֵׁ֣שׁ (NOUN 13, NUM 6), שִׁבְעָ֑ה (NUM 2, PROPN 1), אַלְפֵ֣י (NOUN 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 22.320000 (the average of all parts of speech is 7.796973).

The 1st highest number of forms (48) was observed with the lemma “שׁנים”: שְּׁנֵ֖י, שְּׁנֵ֣י, שְּׁנֵ֥י, שְׁ֚תַּיִם, שְׁנֵ֖י, שְׁנֵ֛י, שְׁנֵ֣י, שְׁנֵ֣ים, שְׁנֵ֤י, שְׁנֵ֥י, שְׁנֵ֥ים, שְׁנֵ֧י, שְׁנֵ֧ים, שְׁנֵ֨י, שְׁנֵֽי, שְׁנֵי, שְׁנֵי֙, שְׁנֵים, שְׁנַ֔יִם, שְׁנַ֖יִם, שְׁנַ֜יִם, שְׁנַ֣יִם, שְׁנַ֤יִם, שְׁנַ֥יִם, שְׁנַ֧יִם, שְׁנַ֨יִם, שְׁנַ֨יִם֙, שְׁנַיִם֒, שְׁנָ֑יִם, שְׁנָֽיִם, שְׁתֵּ֖י, שְׁתֵּ֛י, שְׁתֵּ֣י, שְׁתֵּ֣ים, שְׁתֵּ֤י, שְׁתֵּ֤ים, שְׁתֵּ֥י, שְׁתֵּ֥ים, שְׁתֵּ֧י, שְׁתֵּֽי, שְׁתֵּי, שְׁתֵּי֙, שְׁתֵּי֩, שְׁתֵּים, שְׁתַּ֖יִם, שְׁתַּ֤יִם, שְׁתַּ֥יִם, שְׁתַּ֧יִם.

The 2nd highest number of forms (44) was observed with the lemma “מאה”: מְאַ֖ת, מְאַ֣ת, מְאַ֥ת, מְאַ֧ת, מְאַ֨ת, מְאַת֙, מֵ֣אָה, מֵּאֹ֑ות, מֵּאֹ֔ות, מֵּאֹֽות, מֵּאֹות֙, מֵאָ֔ה, מֵאָ֖ה, מֵאָ֗ה, מֵאָ֣ה, מֵאָ֤ה, מֵאָ֥ה, מֵאָ֧ה, מֵאָ֨ה, מֵאָֽה, מֵאָה֩, מֵאָה֮, מֵאֹ֑ות, מֵאֹ֔ות, מֵאֹ֖ות, מֵאֹ֖ת, מֵאֹ֗ות, מֵאֹ֛ות, מֵאֹ֜ות, מֵאֹ֣ות, מֵאֹ֤ות, מֵאֹ֥ות, מֵאֹ֧ות, מֵאֹ֧ת, מֵאֹ֨ות, מֵאֹֽות, מֵאֹות֙, מָּאתָ֑יִם, מָאתַ֔יִם, מָאתַ֖יִם, מָאתַ֣יִם, מָאתַ֨יִם֙, מָאתָ֑יִם, מָאתָֽיִם.

The 3rd highest number of forms (43) was observed with the lemma “חמשׁ”: חְמֵ֥שׁ, חֲמִשִּֽׁים, חֲמִשִּׁ֑ים, חֲמִשִּׁ֔ים, חֲמִשִּׁ֖ים, חֲמִשִּׁ֗ים, חֲמִשִּׁ֛ים, חֲמִשִּׁ֣ים, חֲמִשִּׁ֤ים, חֲמִשִּׁ֥ים, חֲמִשִּׁים֙, חֲמִשָּֽׁה, חֲמִשָּׁ֑ה, חֲמִשָּׁ֔ה, חֲמִשָּׁ֕ה, חֲמִשָּׁ֖ה, חֲמִשָּׁ֣ה, חֲמִשָּׁ֤ה, חֲמִשָּׁ֥ה, חֲמִשָּׁ֧ה, חֲמִשָּׁ֨ה, חֲמִשָּׁה֙, חֲמִשָּׁה֩, חֲמֵ֖שֶׁת, חֲמֵ֖שׁ, חֲמֵ֛שֶׁת, חֲמֵ֣שׁ, חֲמֵ֤שׁ, חֲמֵ֥שֶׁת, חֲמֵ֥שׁ, חֲמֵ֧שֶׁת, חֲמֵ֨שׁ, חֲמֵֽשֶׁת, חֲמֵשׁ, חָֽמֵשׁ, חָמֵ֖שׁ, חָמֵ֜שׁ, חָמֵ֣שׁ, חָמֵ֤שׁ, חָמֵ֥שׁ, חָמֵ֧שׁ, חָמֵ֨שׁ, חָמֵשׁ֩.

NUM occurs with 3 features: NumType (2475; 100% instances), Number (2475; 100% instances), Gender (1451; 59% instances)

NUM occurs with 7 feature-value pairs: Gender=Fem, Gender=Masc, NumType=Card, NumType=Ord, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 11 feature combinations. The most frequent feature combination is Number=Sing|NumType=Card (784 tokens). Examples: אֶחָ֖ד, אֶ֖לֶף, אֶחָד֙, אֶחָ֑ד, אֶחָֽד, אֶחָ֥ד, עָשָׂ֖ר, שֶׁ֣בַע, אֶחָ֔ד, אַרְבַּ֥ע

Relations

NUM nodes are attached to their parents using 16 different relations: nummod (1606; 65% instances), flat (293; 12% instances), conj (282; 11% instances), obl (75; 3% instances), root (56; 2% instances), compound:smixut (39; 2% instances), parataxis (37; 1% instances), nsubj (29; 1% instances), obj (27; 1% instances), nmod (9; 0% instances), orphan (7; 0% instances), xcomp (6; 0% instances), appos (3; 0% instances), ccomp (3; 0% instances), dislocated (2; 0% instances), advcl (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (1665; 67% instances), NUM (538; 22% instances), VERB (147; 6% instances), (56; 2% instances), PRON (47; 2% instances), ADJ (9; 0% instances), AUX (6; 0% instances), PROPN (6; 0% instances), ADV (1; 0% instances)

1416 (57%) NUM nodes are leaves.

768 (31%) NUM nodes have one child.

170 (7%) NUM nodes have two children.

121 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 10.

Children of NUM nodes are attached using 20 different relations: det (342; 20% instances), flat (294; 17% instances), cc (288; 17% instances), conj (252; 14% instances), punct (186; 11% instances), nsubj (118; 7% instances), case (85; 5% instances), nmod (60; 3% instances), obl (42; 2% instances), cop (21; 1% instances), orphan (15; 1% instances), parataxis (12; 1% instances), dislocated (9; 1% instances), advcl (3; 0% instances), advmod (3; 0% instances), xcomp (3; 0% instances), acl:relcl (2; 0% instances), csubj (2; 0% instances), appos (1; 0% instances), discourse (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NUM (538; 31% instances), DET (338; 19% instances), CCONJ (288; 17% instances), NOUN (241; 14% instances), PUNCT (186; 11% instances), ADP (87; 5% instances), AUX (22; 1% instances), PRON (19; 1% instances), VERB (16; 1% instances), ADV (3; 0% instances), INTJ (1; 0% instances)