home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Pesh-ChibErgIS: POS Tags: PART

There are 65 PART lemmas (9%), 80 PART types (7%) and 557 PART tokens (13%). Out of 15 observed tags, the rank of PART is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent PART lemmas: _, =ma, =kan, =na, =pe, =hã, ãma, ĩka, =ras, =sa

The 10 most frequent PART types: =ma, =kan, =hã, =na, =pe, ãma, =ras, =ken, =sa, ĩka

The 10 most frequent ambiguous lemmas: _ (PUNCT 681, PART 188, ADP 88, AUX 46, SCONJ 37, DET 6, PRON 4, X 3), =ma (PART 57, SCONJ 26, ADP 14), =kan (PART 47, ADP 9, SCONJ 4), =na (PART 23, DET 1), ãma (PART 19, INTJ 1), ĩka (PART 15, CCONJ 2), =ras (PART 13, SCONJ 13), =mã (SCONJ 14, PART 12, ADP 3), =ra (ADP 12, PART 6, SCONJ 3), =ri (ADP 7, PART 4, CCONJ 1)

The 10 most frequent ambiguous types: =ma (PART 115, SCONJ 37, ADP 14), =kan (PART 66, ADP 10, SCONJ 5), =na (PART 26, DET 2), ãma (PART 19, INTJ 1), =ras (SCONJ 32, PART 18), =ken (PART 17, ADP 12, SCONJ 3), ĩka (PART 15, CCONJ 2), =mã (SCONJ 14, PART 13, ADP 3), =ra (ADP 18, PART 11, SCONJ 3), =ro (ADP 12, PART 6, SCONJ 3)

Morphology

The form / lemma ratio of PART is 1.230769 (the average of all parts of speech is 1.743590).

The 1st highest number of forms (35) was observed with the lemma “_”: =eri, =ha, =hã, =hãʔ, =hĩ, =kan, =kanke, =ken, =ma, =mã, =na, =ni, =pa, =pe, =pera, =pero, =pes, =pra, =ra, =ras, =rike, =rina, =riʃka, =ro, =sa, =sah, =tVh, =wi, =wĩ, =wĩʔ, =ʃona, =ʃonwa, =ʔi, =ʔã, =ʔĩ.

The 2nd highest number of forms (2) was observed with the lemma “=kan”: =kan, =kanka.

The 3rd highest number of forms (2) was observed with the lemma “nĩhã”: nĩhã, ũtanĩhã.

PART occurs with 2 features: Clusivity (1; 0% instances), PronType (1; 0% instances)

PART occurs with 2 feature-value pairs: Clusivity=Ex, PronType=Int

PART occurs with 3 feature combinations. The most frequent feature combination is _ (555 tokens). Examples: =ma, =kan, =hã, =na, =pe, ãma, =ras, =ken, =sa, ĩka

Relations

PART nodes are attached to their parents using 17 different relations: case (221; 40% instances), advmod (155; 28% instances), discourse (98; 18% instances), mark (43; 8% instances), cc (7; 1% instances), compound (7; 1% instances), reparandum (7; 1% instances), dep (3; 1% instances), dislocated (3; 1% instances), root (3; 1% instances), ccomp (2; 0% instances), obj (2; 0% instances), obl:arg (2; 0% instances), appos (1; 0% instances), obl (1; 0% instances), obl:lmod (1; 0% instances), parataxis (1; 0% instances)

Parents of PART nodes belong to 9 different parts of speech: PRON (186; 33% instances), VERB (176; 32% instances), NOUN (138; 25% instances), PART (24; 4% instances), ADV (12; 2% instances), X (11; 2% instances), NUM (5; 1% instances), (3; 1% instances), AUX (2; 0% instances)

443 (80%) PART nodes are leaves.

87 (16%) PART nodes have one child.

13 (2%) PART nodes have two children.

14 (3%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 17 different relations: punct (71; 44% instances), dep (22; 14% instances), discourse (14; 9% instances), reparandum (12; 7% instances), compound (8; 5% instances), det (8; 5% instances), advmod (7; 4% instances), mark (7; 4% instances), cop (2; 1% instances), nsubj (2; 1% instances), obl:lmod (2; 1% instances), orphan (2; 1% instances), case (1; 1% instances), ccomp (1; 1% instances), obl (1; 1% instances), obl:mod (1; 1% instances), xcomp (1; 1% instances)

Children of PART nodes belong to 13 different parts of speech: PUNCT (71; 44% instances), PRON (25; 15% instances), PART (24; 15% instances), NOUN (13; 8% instances), ADV (7; 4% instances), X (6; 4% instances), ADP (5; 3% instances), AUX (2; 1% instances), DET (2; 1% instances), INTJ (2; 1% instances), NUM (2; 1% instances), VERB (2; 1% instances), CCONJ (1; 1% instances)