Treebank Statistics: UD_Coptic-Scriptorium: POS Tags: DET
There are 29 DET lemmas (1%), 56 DET types (2%) and 7604 DET tokens (13%).
Out of 15 observed tags, the rank of DET is: 10 in number of lemmas, 7 in number of types and 4 in number of tokens.
The 10 most frequent DET lemmas: ⲡ, ⲟⲩ, ⲡⲉϥ, ⲡⲁ, ⲡⲁⲓ, ⲡⲉⲓ, ⲡⲉⲕ, ⲡⲉⲛ, ⲡⲉⲩ, ⲕⲉ
The 10 most frequent DET types: ⲡ, ⲧ, ⲟⲩ, ⲛ, ⲡⲉ, ϩⲉⲛ, ⲡⲁ, ⲡⲉϥ, ⲧⲉ, ⲡⲁⲓ
The 10 most frequent ambiguous lemmas: ⲡ (DET 4472, VERB 1), ⲟⲩ (DET 985, PRON 109, ADV 8), ⲡⲉⲛ (DET 147, NOUN 2), ⲕⲉ (DET 97, NOUN 29), ϭⲉ (PART 55, DET 7, ADV 3), ϯ (VERB 190, DET 4, NOUN 2), ⲛⲁ (ADP 598, AUX 428, NOUN 15, VERB 12, DET 2, ADV 1, PART 1), ϩⲛ (ADP 1002, DET 1, PART 1), ⲛ (ADP 3926, ADV 134, PART 7, DET 1, NUM 1)
The 10 most frequent ambiguous types: ⲡ (DET 2350, PRON 67), ⲧ (DET 739, PRON 56), ⲟⲩ (PRON 708, DET 677, ADV 8), ⲛ (ADP 2312, DET 672, PRON 371, AUX 345, ADV 123, VERB 6, PART 4, NUM 1, X 1), ⲡⲉ (DET 326, PRON 286, NOUN 19, PART 2, VERB 1), ϩⲉⲛ (DET 211, ADP 1), ⲧⲉ (DET 164, PRON 110, VERB 1), ⲛⲉ (DET 154, AUX 149, PRON 59, PART 1), ⲩ (PRON 982, DET 97), ⲕⲉ (DET 94, NOUN 2)
- ⲡ
- ⲧ
- ⲟⲩ
- ⲛ
- ADP 2312: ⲉ ⲩ ⲧⲛⲧⲱⲛ ⲉ ϩⲉⲛ ϩⲏⲃⲥ ⲉ ⲁ ⲩ ϫⲉⲣⲱ ⲟⲩ ϩⲛ ϩⲉⲛ ⲙⲁ ⲛ ⲕⲁⲕⲉ ·
- DET 672: ⲁⲩⲱ ⲛ ⲁⲧϩⲏⲧ ⲉⲧ ⲣϩⲟⲩⲟ ⲉⲙⲁⲧⲉ ϩⲛ ϩⲉⲛ ⲙⲛⲧⲥⲟϭ · ⲉ ⲩ ⲧⲛⲧⲱⲛ ⲉ ϩⲉⲛ ⲛⲩⲕⲧⲉⲣⲓⲥ ·
- PRON 371: ⲁⲗⲗⲁ ⲉϥⲉ ϣⲱⲡⲉ ⲛⲁ ⲛ ⲛ ⲟⲩ ϫⲁϫⲉ ⲉⲧⲃⲉ ⲡ ⲛⲟⲩⲧⲉ ⲉ ⲡⲉⲛ ⲥⲟⲛ ⲡⲉ
- AUX 345: ⲁⲗⲗⲁ ⲉⲣⲉ ⲛⲉⲩ ⲥⲁⲛⲇⲁⲗⲓⲟⲛ ⲟⲧϩ ⲉⲣⲁⲧ ⲟⲩ . ⲟⲩⲇⲉ ⲛ ⲥⲉ ⲧⲙ ϯ ϣⲧⲏⲛ ⲥⲛⲧⲉ ϩⲓⲱ ⲟⲩ
- ADV 123: ⲛ ⲧⲉⲧⲛ ϩⲁⲣⲉϩ ⲅⲁⲣ ⲁⲛ .
- VERB 6: ⲉⲣϣⲁⲛ ⲡ ⲕⲁⲣⲡⲟⲥ ⲇⲉ ⲡⲱϩ ⲛ ⲧⲉ ⲩⲛⲟⲩ ϣⲁ ϥ ⲛ ⲡ ⲟϩⲥ ϫⲉ ⲁ ⲡ ⲧⲏ ⲙ ⲡ ⲱϩⲥ ϣⲱⲡⲉ
- PART 4: ϩⲱⲥⲧⲉ · ϣⲉ ϫⲟⲩⲱⲧ ⲛ ⲕⲉⲛⲧⲏⲛⲁⲣⲓⲟⲛ ⲛ ⲛⲟⲩⲃ ⲛ ⲥⲉ ⲧⲁⲁ ⲩ ⲛ ⲭⲁⲣⲓⲥⲙⲁ ⲙ ⲡ ⲁⲡⲟⲗⲗⲱⲛ ·
- NUM 1: ⲁⲩⲱ ⲧⲉ ⲥϩⲓⲙⲉ ⲉ ⲧⲉⲧⲛ ⲛⲁⲩ ⲉⲣⲟ ⲥ ⲧⲁϣ ⲛ ⲟⲩⲁ ⲧⲉ ·
- X 1: ⲁⲩⲱ ⲁ ⲥ ϣⲱⲡⲉ ⲉⲣⲉ ⲡ ⲣⲏ ⲛⲁ ϣⲁ ⲡ ⲛⲟⲩⲧⲉ ⲁ ϥ ⲟⲩⲉϩⲥⲁϩⲛⲉ ⲛ ⲟⲩ ⲧⲏⲩ ⲉ ϥ ⲣⲟⲕϩ ⲁⲩⲱ ⲛ .. ⲣⲟⲟⲃ ⲁⲩⲱ ⲁ ⲡ ⲣⲏ ϩⲓⲟⲩⲉ ⲉ ⲧ ⲁⲡⲉ ⲛ ⲓⲱⲛⲁ ⲁ ϥ ϣⲱⲥⲙ ⲛ ϩⲏⲧ ⲁⲩⲱ ⲁ ϥ ⲕⲁ ⲧⲟⲟⲧ ϥ ⲉⲃⲟⲗ ⲡⲉϫⲁ ϥ ϫⲉ ⲛⲁⲛⲟⲩ ⲥ ⲛⲁ ⲓ ⲉ ⲙⲟⲩ ⲉϩⲟⲩⲉ ⲱⲛϩ :
- ⲡⲉ
- ϩⲉⲛ
- ⲧⲉ
- ⲛⲉ
- ⲩ
- ⲕⲉ
Morphology
The form / lemma ratio of DET is 1.931034 (the average of all parts of speech is 1.141945).
The 1st highest number of forms (8) was observed with the lemma “ⲡ”: ⲑ, ⲙ, ⲛ, ⲛⲉ, ⲡ, ⲡⲉ, ⲧ, ⲧⲉ.
The 2nd highest number of forms (4) was observed with the lemma “ⲡⲁ”: ⲛⲁ, ⲛⲁⲓ, ⲡⲁ, ⲧⲁ.
The 3rd highest number of forms (4) was observed with the lemma “ⲡⲉⲓ”: ⲛⲉⲓ, ⲡⲉⲓ, ⲡⲓ, ⲧⲉⲓ.
DET occurs with 9 features: PronType (7596; 100% instances), Definite (7492; 99% instances), Number (7492; 99% instances), Gender (5030; 66% instances), Poss (1447; 19% instances), Number[psor] (1396; 18% instances), Person (1396; 18% instances), Gender[psor] (740; 10% instances), Foreign (1; 0% instances)
DET occurs with 18 feature-value pairs: Definite=Def, Definite=Ind, Foreign=Yes, Gender=Fem, Gender=Masc, Gender[psor]=Fem, Gender[psor]=Masc, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Prs
DET occurs with 37 feature combinations.
The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art (2675 tokens).
Examples: ⲡ, ⲡⲉ
Relations
DET nodes are attached to their parents using 23 different relations: det (5295; 70% instances), nmod:poss (1384; 18% instances), obl (203; 3% instances), dislocated (156; 2% instances), root (97; 1% instances), obj (94; 1% instances), nsubj (77; 1% instances), appos (76; 1% instances), nmod (74; 1% instances), conj (57; 1% instances), acl:relcl (28; 0% instances), parataxis (20; 0% instances), ccomp (18; 0% instances), advcl (9; 0% instances), vocative (5; 0% instances), csubj (3; 0% instances), xcomp (2; 0% instances), advmod (1; 0% instances), compound (1; 0% instances), discourse (1; 0% instances), fixed (1; 0% instances), iobj (1; 0% instances), reparandum (1; 0% instances)
Parents of DET nodes belong to 11 different parts of speech: NOUN (6609; 87% instances), VERB (576; 8% instances), PROPN (117; 2% instances), (97; 1% instances), NUM (96; 1% instances), DET (57; 1% instances), PRON (40; 1% instances), X (5; 0% instances), ADV (3; 0% instances), PART (3; 0% instances), ADP (1; 0% instances)
6756 (89%) DET nodes are leaves.
297 (4%) DET nodes have one child.
344 (5%) DET nodes have two children.
207 (3%) DET nodes have three or more children.
The highest child degree of a DET node is 10.
Children of DET nodes are attached using 26 different relations: acl:relcl (615; 35% instances), case (370; 21% instances), punct (182; 10% instances), cop (96; 5% instances), nsubj (78; 4% instances), cc (67; 4% instances), nmod (67; 4% instances), mark (63; 4% instances), advmod (58; 3% instances), conj (36; 2% instances), parataxis (29; 2% instances), obl:unmarked (22; 1% instances), advcl (18; 1% instances), csubj (17; 1% instances), appos (14; 1% instances), discourse (8; 0% instances), dislocated (7; 0% instances), obl (7; 0% instances), vocative (6; 0% instances), det (4; 0% instances), orphan (3; 0% instances), ccomp (2; 0% instances), amod (1; 0% instances), aux (1; 0% instances), nmod:poss (1; 0% instances), xcomp (1; 0% instances)
Children of DET nodes belong to 14 different parts of speech: VERB (604; 34% instances), ADP (364; 21% instances), NOUN (184; 10% instances), PUNCT (182; 10% instances), PRON (138; 8% instances), ADV (70; 4% instances), SCONJ (64; 4% instances), DET (57; 3% instances), PART (47; 3% instances), CCONJ (45; 3% instances), PROPN (15; 1% instances), ADJ (1; 0% instances), AUX (1; 0% instances), NUM (1; 0% instances)