home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: POS Tags: PART

There are 5 PART lemmas (0%), 11 PART types (0%) and 1902 PART tokens (2%). Out of 17 observed tags, the rank of PART is: 17 in number of lemmas, 17 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: to, ‘s, n’t, not, -

The 10 most frequent PART types: to, not, ‘s, n’t, n’t, ’s, ‘, ’, n`t, -

The 10 most frequent ambiguous lemmas: to (PART 1119, ADP 635, X 1), ’s (PART 286, AUX 1), - (PUNCT 81, ADP 17, CCONJ 3, PART 1, SYM 1)

The 10 most frequent ambiguous types: to (PART 1119, ADP 635, X 1), ’s (PART 230, AUX 118, VERB 26, PRON 2), ’s (AUX 51, PART 36, VERB 20), (PUNCT 139, PART 11), (PART 9, PUNCT 3), - (PUNCT 81, ADP 17, CCONJ 3, PART 1, SYM 1)

Morphology

The form / lemma ratio of PART is 2.200000 (the average of all parts of speech is 1.227660).

The 1st highest number of forms (4) was observed with the lemma “’s”: ’, ‘s, ’, ’s.

The 2nd highest number of forms (4) was observed with the lemma “n’t”: n’t, n`t, n’t, ’t.

The 3rd highest number of forms (1) was observed with the lemma “-”: -.

PART occurs with 1 features: Polarity (449; 24% instances)

PART occurs with 1 feature-value pairs: Polarity=Neg

PART occurs with 2 feature combinations. The most frequent feature combination is _ (1453 tokens). Examples: to, ‘s, ’s, n’t, not, ‘, ’, n`t, -

Relations

PART nodes are attached to their parents using 14 different relations: mark (1087; 57% instances), advmod (469; 25% instances), case (286; 15% instances), fixed (21; 1% instances), obl (13; 1% instances), conj (6; 0% instances), dep (4; 0% instances), nmod (4; 0% instances), advcl (2; 0% instances), cc (2; 0% instances), goeswith (2; 0% instances), parataxis (2; 0% instances), root (2; 0% instances), xcomp (2; 0% instances)

Parents of PART nodes belong to 14 different parts of speech: VERB (1411; 74% instances), NOUN (174; 9% instances), PROPN (162; 9% instances), ADJ (103; 5% instances), ADV (18; 1% instances), NUM (8; 0% instances), ADP (7; 0% instances), PRON (7; 0% instances), AUX (3; 0% instances), SCONJ (3; 0% instances), (2; 0% instances), X (2; 0% instances), CCONJ (1; 0% instances), PART (1; 0% instances)

1881 (99%) PART nodes are leaves.

12 (1%) PART nodes have one child.

3 (0%) PART nodes have two children.

6 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 13 different relations: punct (13; 31% instances), cc (8; 19% instances), advmod (7; 17% instances), dep (3; 7% instances), cop (2; 5% instances), nsubj (2; 5% instances), compound (1; 2% instances), conj (1; 2% instances), mark (1; 2% instances), nmod (1; 2% instances), nmod:poss (1; 2% instances), nummod (1; 2% instances), obj (1; 2% instances)

Children of PART nodes belong to 13 different parts of speech: PUNCT (13; 31% instances), CCONJ (8; 19% instances), ADV (4; 10% instances), NOUN (3; 7% instances), PRON (3; 7% instances), ADP (2; 5% instances), AUX (2; 5% instances), SCONJ (2; 5% instances), ADJ (1; 2% instances), NUM (1; 2% instances), PART (1; 2% instances), VERB (1; 2% instances), X (1; 2% instances)