home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Coptic-Scriptorium: POS Tags: PART

There are 37 PART lemmas (1%), 42 PART types (1%) and 2075 PART tokens (4%). Out of 15 observed tags, the rank of PART is: 8 in number of lemmas, 9 in number of types and 9 in number of tokens.

The 10 most frequent PART lemmas: ⲇⲉ, ⲉ, ⲉⲣⲉ, ⲛϭⲓ, ⲅⲁⲣ, ϭⲉ, ⲱ, ⲉⲓⲥ, ⲙⲉⲛ, ϩⲁⲙⲏⲛ

The 10 most frequent PART types: ⲉ, ⲇⲉ, ⲛϭⲓ, ⲅⲁⲣ, ⲛⲧ, ϭⲉ, ⲱ, ⲉⲓⲥ, ⲉⲣⲉ, ⲙⲉⲛ

The 10 most frequent ambiguous lemmas: ⲉ (ADP 1194, PART 408, SCONJ 4, X 1), ⲉⲣⲉ (SCONJ 888, PART 338, AUX 11, ADP 2), ⲅⲁⲣ (PART 207, SCONJ 1), ϭⲉ (PART 55, DET 7, ADV 3), ⲙⲉⲛ (PART 30, CCONJ 2), ⲟⲩⲛ (VERB 33, AUX 15, PART 14), ⲙⲛⲛⲥⲁ (ADP 29, PART 13), ϩⲏⲏⲧⲉ (PART 12, NOUN 2), ⲟⲩⲟⲓ (NOUN 10, PART 10), ⲛ (ADP 3926, ADV 134, PART 7, DET 1, NUM 1)

The 10 most frequent ambiguous types: ⲉ (SCONJ 861, ADP 824, PART 653, PRON 50, AUX 7, NOUN 1, X 1), ⲅⲁⲣ (PART 207, SCONJ 1), ⲛⲧ (SCONJ 105, PART 58, VERB 9), ϭⲉ (PART 55, DET 10, ADV 3), ⲉⲣⲉ (SCONJ 55, PART 35, AUX 11, PRON 6), ⲙⲉⲛ (PART 30, CCONJ 2), ⲟⲩⲛ (VERB 24, PART 14, AUX 11), ⲙⲛⲛⲥⲁ (ADP 15, PART 13), ϩⲏⲏⲧⲉ (PART 12, NOUN 2), ⲟⲩⲟⲓ (NOUN 10, PART 8)

Morphology

The form / lemma ratio of PART is 1.135135 (the average of all parts of speech is 1.141945).

The 1st highest number of forms (4) was observed with the lemma “ⲉⲣⲉ”: ⲉ, ⲉⲛⲧ, ⲉⲣⲉ, ⲛⲧ.

The 2nd highest number of forms (3) was observed with the lemma “ⲛ”: ⲙ, ⲙⲙⲟ, ⲛ.

The 3rd highest number of forms (2) was observed with the lemma “ⲉⲛⲉ”: ⲉⲛⲉ, ⲛⲉ.

PART occurs with 4 features: Foreign (920; 44% instances), Emph (339; 16% instances), ExtPos (15; 1% instances), Polarity (10; 0% instances)

PART occurs with 4 feature-value pairs: Emph=Yes, ExtPos=ADV, Foreign=Yes, Polarity=Neg

PART occurs with 7 feature combinations. The most frequent feature combination is Foreign=Yes (918 tokens). Examples: ⲇⲉ, ⲅⲁⲣ, ⲱ, ⲙⲉⲛ, ϩⲁⲙⲏⲛ, ⲁⲣⲁ, ⲟⲩⲛ, ⲭⲁⲓⲣⲉ, ⲭⲱⲣⲓⲥ, ⲁⲛⲧⲓ

Relations

PART nodes are attached to their parents using 12 different relations: advmod (962; 46% instances), mark (779; 38% instances), case (224; 11% instances), discourse (68; 3% instances), fixed (14; 1% instances), ccomp (10; 0% instances), root (8; 0% instances), advcl (6; 0% instances), cc (1; 0% instances), conj (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)

Parents of PART nodes belong to 10 different parts of speech: VERB (1574; 76% instances), NOUN (323; 16% instances), PROPN (53; 3% instances), DET (47; 2% instances), PRON (38; 2% instances), PART (14; 1% instances), NUM (9; 0% instances), (8; 0% instances), ADV (7; 0% instances), CCONJ (2; 0% instances)

2031 (98%) PART nodes are leaves.

27 (1%) PART nodes have one child.

4 (0%) PART nodes have two children.

13 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 4.

Children of PART nodes are attached using 13 different relations: fixed (15; 19% instances), mark (14; 18% instances), vocative (14; 18% instances), punct (12; 15% instances), obl (11; 14% instances), parataxis (3; 4% instances), advcl (2; 3% instances), ccomp (2; 3% instances), advmod (1; 1% instances), cc (1; 1% instances), dislocated (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances)

Children of PART nodes belong to 9 different parts of speech: NOUN (16; 21% instances), PART (14; 18% instances), SCONJ (14; 18% instances), PUNCT (12; 15% instances), PRON (11; 14% instances), VERB (5; 6% instances), DET (3; 4% instances), PROPN (2; 3% instances), CCONJ (1; 1% instances)