Treebank Statistics: UD_Coptic-Scriptorium: POS Tags: DET
There are 29 DET
lemmas (1%), 56 DET
types (2%) and 7376 DET
tokens (13%).
Out of 15 observed tags, the rank of DET
is: 10 in number of lemmas, 7 in number of types and 4 in number of tokens.
The 10 most frequent DET
lemmas: ⲡ, ⲟⲩ, ⲡⲉϥ, ⲡⲁ, ⲡⲁⲓ, ⲡⲉⲓ, ⲡⲉⲕ, ⲡⲉⲛ, ⲡⲉⲩ, ⲕⲉ
The 10 most frequent DET
types: ⲡ, ⲧ, ⲛ, ⲟⲩ, ⲡⲉ, ϩⲉⲛ, ⲡⲁ, ⲡⲉϥ, ⲧⲉ, ⲡⲁⲓ
The 10 most frequent ambiguous lemmas: ⲟⲩ (DET 948, PRON 102, ADV 7, X 1), ⲡⲉⲛ (DET 147, NOUN 2), ⲕⲉ (DET 97, NOUN 29), ϭⲉ (PART 53, DET 7, ADV 3), ϯ (VERB 186, DET 4, NOUN 2), ⲛ (ADP 3796, ADV 133, PART 8, DET 4, NUM 1), ⲛⲁ (ADP 580, AUX 415, NOUN 13, VERB 12, DET 2, ADV 1, PART 1), ϩⲛ (ADP 976, DET 1, PART 1)
The 10 most frequent ambiguous types: ⲡ (DET 2264, PRON 65), ⲧ (DET 720, PRON 51), ⲛ (ADP 2227, DET 651, PRON 361, AUX 335, ADV 122, VERB 6, PART 5, NUM 1), ⲟⲩ (PRON 685, DET 649, ADV 7, X 1), ⲡⲉ (DET 321, PRON 285, NOUN 18, PART 2, VERB 1), ϩⲉⲛ (DET 206, ADP 1), ⲧⲉ (DET 158, PRON 108, VERB 1), ⲛⲉ (DET 153, AUX 145, PRON 58, PART 1), ⲕⲉ (DET 94, NOUN 2), ⲩ (PRON 943, DET 93)
- ⲡ
- ⲧ
- ⲛ
- ADP 2227: ⲉ ⲩ ⲧⲛⲧⲱⲛ ⲉ ϩⲉⲛ ϩⲏⲃⲥ ⲉ ⲁ ⲩ ϫⲉⲣⲱ ⲟⲩ ϩⲛ ϩⲉⲛ ⲙⲁ ⲛ ⲕⲁⲕⲉ ·
- DET 651: ⲁⲩⲱ ⲛ ⲁⲧϩⲏⲧ ⲉⲧ ⲣϩⲟⲩⲟ ⲉⲙⲁⲧⲉ ϩⲛ ϩⲉⲛ ⲙⲛⲧⲥⲟϭ · ⲉ ⲩ ⲧⲛⲧⲱⲛ ⲉ ϩⲉⲛ ⲛⲩⲕⲧⲉⲣⲓⲥ ·
- PRON 361: ⲁⲗⲗⲁ ⲉϥⲉ ϣⲱⲡⲉ ⲛⲁ ⲛ ⲛ ⲟⲩ ϫⲁϫⲉ ⲉⲧⲃⲉ ⲡ ⲛⲟⲩⲧⲉ ⲉ ⲡⲉⲛ ⲥⲟⲛ ⲡⲉ
- AUX 335: ⲁⲗⲗⲁ ⲉⲣⲉ ⲛⲉⲩ ⲥⲁⲛⲇⲁⲗⲓⲟⲛ ⲟⲧϩ ⲉⲣⲁⲧ ⲟⲩ . ⲟⲩⲇⲉ ⲛ ⲥⲉ ⲧⲙ ϯ ϣⲧⲏⲛ ⲥⲛⲧⲉ ϩⲓⲱ ⲟⲩ
- ADV 122: ⲛ ⲧⲉⲧⲛ ϩⲁⲣⲉϩ ⲅⲁⲣ ⲁⲛ .
- VERB 6: ⲉⲣϣⲁⲛ ⲡ ⲕⲁⲣⲡⲟⲥ ⲇⲉ ⲡⲱϩ ⲛ ⲧⲉ ⲩⲛⲟⲩ ϣⲁ ϥ ⲛ ⲡ ⲟϩⲥ ϫⲉ ⲁ ⲡ ⲧⲏ ⲙ ⲡ ⲱϩⲥ ϣⲱⲡⲉ
- PART 5: ϩⲱⲥⲧⲉ · ϣⲉ ϫⲟⲩⲱⲧ ⲛ ⲕⲉⲛⲧⲏⲛⲁⲣⲓⲟⲛ ⲛ ⲛⲟⲩⲃ ⲛ ⲥⲉ ⲧⲁⲁ ⲩ ⲛ ⲭⲁⲣⲓⲥⲙⲁ ⲙ ⲡ ⲁⲡⲟⲗⲗⲱⲛ ·
- NUM 1: ⲁⲩⲱ ⲧⲉ ⲥϩⲓⲙⲉ ⲉ ⲧⲉⲧⲛ ⲛⲁⲩ ⲉⲣⲟ ⲥ ⲧⲁϣ ⲛ ⲟⲩⲁ ⲧⲉ ·
- ⲟⲩ
- ⲡⲉ
- ϩⲉⲛ
- ⲧⲉ
- ⲛⲉ
- ⲕⲉ
- ⲩ
Morphology
The form / lemma ratio of DET
is 1.931034 (the average of all parts of speech is 1.137647).
The 1st highest number of forms (8) was observed with the lemma “ⲡ”: ⲑ, ⲙ, ⲛ, ⲛⲉ, ⲡ, ⲡⲉ, ⲧ, ⲧⲉ.
The 2nd highest number of forms (4) was observed with the lemma “ⲡⲁ”: ⲛⲁ, ⲛⲁⲓ, ⲡⲁ, ⲧⲁ.
The 3rd highest number of forms (4) was observed with the lemma “ⲡⲉⲓ”: ⲛⲉⲓ, ⲡⲉⲓ, ⲡⲓ, ⲧⲉⲓ.
DET
occurs with 9 features: PronType (7346; 100% instances), Definite (7242; 98% instances), Number (7242; 98% instances), Gender (4854; 66% instances), Poss (1406; 19% instances), Number[psor] (1356; 18% instances), Person (1356; 18% instances), Gender[psor] (719; 10% instances), Foreign (1; 0% instances)
DET
occurs with 18 feature-value pairs: Definite=Def
, Definite=Ind
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Gender[psor]=Fem
, Gender[psor]=Masc
, Number=Plur
, Number=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Person=1
, Person=2
, Person=3
, Poss=Yes
, PronType=Art
, PronType=Dem
, PronType=Prs
DET
occurs with 37 feature combinations.
The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art
(2584 tokens).
Examples: ⲡ, ⲡⲉ
Relations
DET
nodes are attached to their parents using 23 different relations: det (5120; 69% instances), nmod:poss (1343; 18% instances), obl (201; 3% instances), dislocated (153; 2% instances), root (97; 1% instances), obj (92; 1% instances), nsubj (77; 1% instances), appos (74; 1% instances), nmod (73; 1% instances), conj (56; 1% instances), acl:relcl (28; 0% instances), parataxis (20; 0% instances), ccomp (17; 0% instances), advcl (9; 0% instances), vocative (5; 0% instances), csubj (3; 0% instances), xcomp (2; 0% instances), advmod (1; 0% instances), compound (1; 0% instances), discourse (1; 0% instances), fixed (1; 0% instances), iobj (1; 0% instances), reparandum (1; 0% instances)
Parents of DET
nodes belong to 11 different parts of speech: NOUN (6400; 87% instances), VERB (566; 8% instances), PROPN (115; 2% instances), (97; 1% instances), NUM (89; 1% instances), DET (57; 1% instances), PRON (40; 1% instances), X (5; 0% instances), PART (4; 0% instances), ADV (2; 0% instances), ADP (1; 0% instances)
6539 (89%) DET
nodes are leaves.
292 (4%) DET
nodes have one child.
341 (5%) DET
nodes have two children.
204 (3%) DET
nodes have three or more children.
The highest child degree of a DET
node is 10.
Children of DET
nodes are attached using 26 different relations: acl:relcl (607; 35% instances), case (366; 21% instances), punct (180; 10% instances), cop (95; 5% instances), nsubj (77; 4% instances), cc (66; 4% instances), nmod (66; 4% instances), mark (62; 4% instances), advmod (54; 3% instances), conj (36; 2% instances), parataxis (28; 2% instances), obl:unmarked (22; 1% instances), advcl (18; 1% instances), csubj (17; 1% instances), appos (15; 1% instances), discourse (8; 0% instances), dislocated (7; 0% instances), obl (7; 0% instances), vocative (5; 0% instances), det (4; 0% instances), orphan (3; 0% instances), ccomp (2; 0% instances), amod (1; 0% instances), aux (1; 0% instances), nmod:poss (1; 0% instances), xcomp (1; 0% instances)
Children of DET
nodes belong to 14 different parts of speech: VERB (596; 34% instances), ADP (360; 21% instances), NOUN (181; 10% instances), PUNCT (180; 10% instances), PRON (136; 8% instances), ADV (67; 4% instances), SCONJ (63; 4% instances), DET (57; 3% instances), PART (47; 3% instances), CCONJ (44; 3% instances), PROPN (15; 1% instances), ADJ (1; 0% instances), AUX (1; 0% instances), NUM (1; 0% instances)