home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Coptic-Scriptorium: POS Tags: PART

There are 42 PART lemmas (2%), 44 PART types (2%) and 1360 PART tokens (3%). Out of 15 observed tags, the rank of PART is: 8 in number of lemmas, 9 in number of types and 11 in number of tokens.

The 10 most frequent PART lemmas: ⲇⲉ, ⲉ, ⲉⲣⲉ, ⲅⲁⲣ, ⲛϭⲓ, ϭⲉ, ⲉⲓⲥ, ⲱ, ⲙⲉⲛ, ϩⲁⲙⲏⲛ

The 10 most frequent PART types: ⲉ, ⲇⲉ, ⲅⲁⲣ, ⲛϭⲓ, ϭⲉ, ⲛⲧ, ⲉⲓⲥ, ⲉⲣⲉ, ⲱ, ⲙⲉⲛ

The 10 most frequent ambiguous lemmas: ⲉ (ADP 807, PART 250, SCONJ 12, CCONJ 1, X 1), ⲉⲣⲉ (SCONJ 638, PART 240, ADP 4, AUX 2, PRON 1), ⲅⲁⲣ (PART 138, CCONJ 1), ϭⲉ (PART 39, DET 9, ADV 2), ⲙⲉⲛ (PART 15, CCONJ 2), ⲉⲧⲉⲣⲉ (SCONJ 879, PART 8, AUX 1), ⲛ (ADP 2624, ADV 88, DET 14, PART 8, AUX 5, NOUN 1, PRON 1), ⲙⲛⲛⲥⲁ (ADP 18, PART 7, CCONJ 1), ⲁⲣⲁ (PART 6, CCONJ 1), ⲉⲧⲃⲉ (ADP 110, PART 5)

The 10 most frequent ambiguous types: ⲉ (SCONJ 636, ADP 554, PART 433, PRON 21, CCONJ 1, X 1), ⲅⲁⲣ (PART 138, CCONJ 1), ϭⲉ (PART 39, DET 9, ADV 2), ⲛⲧ (SCONJ 78, PART 39, VERB 7), ⲉⲣⲉ (SCONJ 45, PART 25, PRON 3, AUX 2), ⲙⲉⲛ (PART 15, CCONJ 2), ⲙⲛⲛⲥⲁ (ADP 9, PART 7, CCONJ 1), ⲁⲣⲁ (PART 6, CCONJ 1), ⲉⲧⲃⲉ (ADP 96, PART 5), ⲛ (ADP 1575, DET 488, PRON 246, AUX 218, ADV 93, PART 5, VERB 4, NOUN 1)

Morphology

The form / lemma ratio of PART is 1.047619 (the average of all parts of speech is 1.112443).

The 1st highest number of forms (3) was observed with the lemma “ⲉⲣⲉ”: ⲉ, ⲉⲣⲉ, ⲛⲧ.

The 2nd highest number of forms (3) was observed with the lemma “ⲉⲧⲉⲣⲉ”: ⲉⲛⲧ, ⲉⲧ, ⲛⲧ.

The 3rd highest number of forms (3) was observed with the lemma “ⲛ”: ⲙ, ⲙⲙⲟ, ⲛ.

PART occurs with 1 features: Polarity (4; 0% instances)

PART occurs with 1 feature-value pairs: Polarity=Neg

PART occurs with 2 feature combinations. The most frequent feature combination is _ (1356 tokens). Examples: ⲉ, ⲇⲉ, ⲅⲁⲣ, ⲛϭⲓ, ϭⲉ, ⲛⲧ, ⲉⲓⲥ, ⲉⲣⲉ, ⲱ, ⲙⲉⲛ

Relations

PART nodes are attached to their parents using 12 different relations: advmod (640; 47% instances), mark (529; 39% instances), case (138; 10% instances), discourse (32; 2% instances), ccomp (8; 1% instances), fixed (4; 0% instances), advcl (3; 0% instances), root (2; 0% instances), amod (1; 0% instances), cc (1; 0% instances), conj (1; 0% instances), orphan (1; 0% instances)

Parents of PART nodes belong to 11 different parts of speech: VERB (1054; 78% instances), NOUN (210; 15% instances), DET (31; 2% instances), PRON (30; 2% instances), PROPN (20; 1% instances), NUM (5; 0% instances), PART (5; 0% instances), (2; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances), CCONJ (1; 0% instances)

1331 (98%) PART nodes are leaves.

18 (1%) PART nodes have one child.

4 (0%) PART nodes have two children.

7 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 4.

Children of PART nodes are attached using 13 different relations: fixed (10; 20% instances), mark (10; 20% instances), punct (9; 18% instances), vocative (7; 14% instances), obl (3; 6% instances), parataxis (3; 6% instances), advcl (2; 4% instances), advmod (1; 2% instances), cc (1; 2% instances), ccomp (1; 2% instances), dislocated (1; 2% instances), nmod (1; 2% instances), nsubj (1; 2% instances)

Children of PART nodes belong to 8 different parts of speech: CCONJ (15; 30% instances), NOUN (12; 24% instances), PUNCT (9; 18% instances), PART (5; 10% instances), VERB (4; 8% instances), PRON (3; 6% instances), DET (1; 2% instances), PROPN (1; 2% instances)