home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Coptic-Bohairic: POS Tags: NUM

There are 30 NUM lemmas (1%), 30 NUM types (1%) and 162 NUM tokens (0%). Out of 15 observed tags, the rank of NUM is: 9 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: ⲟⲩⲁⲓ, ⲃ, ⲓⲃ, ⲅ, ϣⲟ, ⲍ, ⲣ, ⲉ, ⲋ, ⲇ

The 10 most frequent NUM types: ⲟⲩⲁⲓ, ⲃ, ⲓⲃ, ⲅ, ⲟⲩⲓ, ϣⲟ, ⲍ, ⲣ, ⲉ, ⲋ

The 10 most frequent ambiguous lemmas: ⲟⲩⲁⲓ (NUM 80, NOUN 4), ⲣ (NUM 5, VERB 1), ⲉ (ADP 622, PART 188, NUM 4, SCONJ 1), ⲓ (VERB 185, NUM 3, NOUN 1), ⲛ (ADP 1990, ADV 44, PART 22, NUM 2), ⲙⲏϯ (NOUN 8, NUM 1), ⲛⲟⲩϯ (NOUN 159, PRON 3, NUM 1), ⲥ (PRON 2, NUM 1), ⲱ (PART 14, CCONJ 1, NUM 1)

The 10 most frequent ambiguous types: ⲟⲩⲁⲓ (NUM 76, NOUN 4), ⲅ (NUM 6, PRON 1), ϣⲟ (NUM 5, NOUN 1), ⲣ (NUM 5, VERB 5), ⲉ (SCONJ 471, ADP 452, PART 206, PRON 5, NUM 4, AUX 1), ⲓ (PRON 243, VERB 152, NUM 3, NOUN 1), ⲙ (ADP 582, NUM 2, PART 1), ⲛ (ADP 1078, PRON 91, ADV 44, AUX 42, PART 21, DET 13, NUM 2), ⲕ (PRON 228, NUM 1), ⲙⲏϯ (NOUN 8, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.149363).

The 1st highest number of forms (2) was observed with the lemma “ⲟⲩⲁⲓ”: ⲟⲩⲁⲓ, ⲟⲩⲓ.

The 2nd highest number of forms (1) was observed with the lemma “ϣⲟ”: ϣⲟ.

The 3rd highest number of forms (1) was observed with the lemma “ⲃ”: ⲃ.

NUM occurs with 2 features: NumType (161; 99% instances), Foreign (1; 1% instances)

NUM occurs with 2 feature-value pairs: Foreign=Yes, NumType=Card

NUM occurs with 3 feature combinations. The most frequent feature combination is NumType=Card (160 tokens). Examples: ⲟⲩⲁⲓ, ⲃ, ⲓⲃ, ⲅ, ⲟⲩⲓ, ϣⲟ, ⲍ, ⲣ, ⲉ, ⲋ

Relations

NUM nodes are attached to their parents using 16 different relations: nummod (46; 28% instances), nsubj (21; 13% instances), obl (19; 12% instances), nmod:unmarked (17; 10% instances), obj (12; 7% instances), dislocated (11; 7% instances), nmod (10; 6% instances), conj (7; 4% instances), orphan (6; 4% instances), ccomp (4; 2% instances), root (3; 2% instances), advcl (2; 1% instances), acl:relcl (1; 1% instances), appos (1; 1% instances), obl:unmarked (1; 1% instances), parataxis (1; 1% instances)

Parents of NUM nodes belong to 7 different parts of speech: VERB (65; 40% instances), NOUN (61; 38% instances), NUM (27; 17% instances), DET (3; 2% instances), (3; 2% instances), PRON (2; 1% instances), PROPN (1; 1% instances)

60 (37%) NUM nodes are leaves.

41 (25%) NUM nodes have one child.

37 (23%) NUM nodes have two children.

24 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 18 different relations: det (58; 28% instances), case (45; 22% instances), nmod (21; 10% instances), nmod:unmarked (19; 9% instances), acl:relcl (13; 6% instances), mark (8; 4% instances), advmod (6; 3% instances), cc (5; 2% instances), conj (5; 2% instances), cop (4; 2% instances), nummod (4; 2% instances), parataxis (4; 2% instances), appos (3; 1% instances), nsubj (3; 1% instances), orphan (3; 1% instances), punct (3; 1% instances), advcl (1; 0% instances), csubj (1; 0% instances)

Children of NUM nodes belong to 12 different parts of speech: DET (61; 30% instances), ADP (42; 20% instances), NUM (27; 13% instances), NOUN (24; 12% instances), VERB (15; 7% instances), PRON (9; 4% instances), SCONJ (8; 4% instances), ADV (6; 3% instances), PART (5; 2% instances), CCONJ (3; 1% instances), PROPN (3; 1% instances), PUNCT (3; 1% instances)