Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: NUM
There are 107 NUM lemmas (2%), 526 NUM types (4%) and 1285 NUM tokens (5%).
Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.
The 10 most frequent NUM lemmas: ·в҃·, полъ, ·г҃·, два, ·е҃·, десѧть, ·д҃·, ·ѕ҃·, ·ӏ҃·, триѥ
The 10 most frequent NUM types: поло, ·в҃·, полъ, три, ·г҃·, :в҃:, :в:, ·г·, ·ӏ҃·, в҃
The 10 most frequent ambiguous lemmas: ·в҃· (NUM 156, ADJ 2), полъ (NUM 143, NOUN 3), ·г҃· (NUM 109, ADV 3, ADJ 1), ·е҃· (NUM 70, ADJ 1), десѧть (NUM 68, NOUN 1, X 1), ·д҃· (NUM 56, ADJ 2), ·ѕ҃· (NUM 48, ADJ 3), ·з҃· (NUM 30, ADJ 1), ·и҃· (NUM 16, ADJ 1), двои (NUM 9, ADJ 2)
The 10 most frequent ambiguous types: поло (NUM 48, NOUN 1), ·г҃· (NUM 30, ADV 1), в҃ (NUM 18, ADJ 1), :г҃: (NUM 15, ADV 2), пѧть (NUM 13, ADJ 2), десѧте (NUM 8, ADJ 1), дова (NUM 8, X 1), осмь (NUM 6, ADJ 1), шесть (NUM 6, ADJ 1), ·ӏ· (NUM 5, CCONJ 1)
- поло
- ·г҃·
- в҃
- :г҃:
- пѧть
- десѧте
- дова
- осмь
- шесть
- ·ӏ·
Morphology
The form / lemma ratio of NUM is 4.915888 (the average of all parts of speech is 2.421872).
The 1st highest number of forms (38) was observed with the lemma “десѧть”: (д)есѧти, (де)
The 2nd highest number of forms (31) was observed with the lemma “полъ”: (п)[ол]о, (п)[оло, (п)олъ, (полъ, пл, п[ло], п[ол]ъ, п[оло], пло, по, по:ло, по:лꙑ, по
The 3rd highest number of forms (30) was observed with the lemma “два”: (д)[в]е, Вуо, д)
NUM occurs with 7 features: NumType (1117; 87% instances), NumForm (719; 56% instances), Case (557; 43% instances), Gender (294; 23% instances), Number (203; 16% instances), Degree (1; 0% instances), Typo (1; 0% instances)
NUM occurs with 21 feature-value pairs: Case=Acc, Case=Acc,Nom, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Combi, NumForm=Cyril, NumForm=Digit, NumForm=Word, NumType=Card, NumType=Sets, Number=Dual, Number=Plur, Number=Sing, Typo=Yes
NUM occurs with 74 feature combinations.
The most frequent feature combination is NumForm=Digit|NumType=Card (679 tokens).
Examples: ·в҃·, ·г҃·, :в҃:, :в:, ·г·, ·ӏ҃·, в҃, г҃, :г҃:, ·в·
Relations
NUM nodes are attached to their parents using 18 different relations: nummod:gov (895; 70% instances), nsubj (82; 6% instances), conj (80; 6% instances), nummod (75; 6% instances), flat (43; 3% instances), nmod (37; 3% instances), root (29; 2% instances), obj (17; 1% instances), dep (7; 1% instances), obl (6; 0% instances), orphan (4; 0% instances), advcl (2; 0% instances), list (2; 0% instances), parataxis (2; 0% instances), amod (1; 0% instances), appos (1; 0% instances), dislocated (1; 0% instances), nsubj:pass (1; 0% instances)
Parents of NUM nodes belong to 10 different parts of speech: NOUN (969; 75% instances), NUM (134; 10% instances), PROPN (68; 5% instances), VERB (37; 3% instances), (29; 2% instances), X (23; 2% instances), ADJ (19; 1% instances), ADP (2; 0% instances), DET (2; 0% instances), PRON (2; 0% instances)
978 (76%) NUM nodes are leaves.
210 (16%) NUM nodes have one child.
58 (5%) NUM nodes have two children.
39 (3%) NUM nodes have three or more children.
The highest child degree of a NUM node is 37.
Children of NUM nodes are attached using 25 different relations: nmod (117; 23% instances), punct (104; 20% instances), conj (85; 17% instances), flat (60; 12% instances), case (40; 8% instances), cc (31; 6% instances), dep (17; 3% instances), nsubj (14; 3% instances), advmod (9; 2% instances), mark (4; 1% instances), advcl (3; 1% instances), cop (3; 1% instances), nummod:gov (3; 1% instances), orphan (3; 1% instances), acl:relcl (2; 0% instances), det (2; 0% instances), nummod (2; 0% instances), obl (2; 0% instances), acl (1; 0% instances), amod (1; 0% instances), appos (1; 0% instances), iobj (1; 0% instances), list (1; 0% instances), parataxis (1; 0% instances), reparandum (1; 0% instances)
Children of NUM nodes belong to 15 different parts of speech: NUM (134; 26% instances), PUNCT (104; 20% instances), ADJ (68; 13% instances), ADP (58; 11% instances), NOUN (44; 9% instances), CCONJ (32; 6% instances), X (23; 5% instances), PROPN (14; 3% instances), PART (8; 2% instances), VERB (7; 1% instances), DET (5; 1% instances), SCONJ (4; 1% instances), AUX (3; 1% instances), PRON (3; 1% instances), ADV (1; 0% instances)