home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: POS Tags: PART

There are 3 PART lemmas (0%), 15 PART types (0%) and 3851 PART tokens (2%). Out of 17 observed tags, the rank of PART is: 17 in number of lemmas, 17 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: to, not, ‘s

The 10 most frequent PART types: to, not, n’t, ‘s, ’s, n’t, na, ‘, ’, n`t

The 10 most frequent ambiguous lemmas: to (PART 2198, ADP 1307, SCONJ 26, X 1)

The 10 most frequent ambiguous types: to (PART 2078, ADP 1287, SCONJ 26, DET 1, VERB 1, X 1), ’s (AUX 393, PART 373, VERB 53, PRON 26), ’s (PART 153, AUX 106, PRON 9, VERB 7), (PUNCT 150, PART 32, NOUN 2), (PART 23, PUNCT 13), s (AUX 3, PART 3, NOUN 1, VERB 1, X 1), a (DET 2854, ADP 2, PART 2, SCONJ 2, AUX 1, NOUN 1, PROPN 1), do (AUX 274, VERB 166, NOUN 3, PROPN 3, PART 1), the (DET 7251, PART 1, X 1)

Morphology

The form / lemma ratio of PART is 5.000000 (the average of all parts of speech is 1.226279).

The 1st highest number of forms (6) was observed with the lemma “to”: a, do, na, ta, the, to.

The 2nd highest number of forms (5) was observed with the lemma “’s”: ’, ‘s, s, ’, ’s.

The 3rd highest number of forms (4) was observed with the lemma “not”: n’t, n`t, not, n’t.

PART occurs with 2 features: Polarity (1069; 28% instances), Typo (7; 0% instances)

PART occurs with 2 feature-value pairs: Polarity=Neg, Typo=Yes

PART occurs with 4 feature combinations. The most frequent feature combination is _ (2776 tokens). Examples: to, ‘s, ’s, na, ‘, ’, a, ta

Relations

PART nodes are attached to their parents using 11 different relations: mark (2166; 56% instances), advmod (1050; 27% instances), case (584; 15% instances), xcomp (19; 0% instances), conj (10; 0% instances), acl (8; 0% instances), advcl (4; 0% instances), root (4; 0% instances), orphan (2; 0% instances), parataxis (2; 0% instances), reparandum (2; 0% instances)

Parents of PART nodes belong to 16 different parts of speech: VERB (2844; 74% instances), PROPN (355; 9% instances), NOUN (353; 9% instances), ADJ (188; 5% instances), ADV (43; 1% instances), PRON (28; 1% instances), AUX (10; 0% instances), NUM (10; 0% instances), DET (5; 0% instances), (4; 0% instances), ADP (2; 0% instances), CCONJ (2; 0% instances), INTJ (2; 0% instances), SCONJ (2; 0% instances), X (2; 0% instances), PART (1; 0% instances)

3818 (99%) PART nodes are leaves.

23 (1%) PART nodes have one child.

3 (0%) PART nodes have two children.

7 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 6.

Children of PART nodes are attached using 12 different relations: punct (25; 42% instances), cc (10; 17% instances), advmod (7; 12% instances), cop (4; 7% instances), nsubj (4; 7% instances), discourse (3; 5% instances), case (1; 2% instances), conj (1; 2% instances), det (1; 2% instances), mark (1; 2% instances), nmod (1; 2% instances), parataxis (1; 2% instances)

Children of PART nodes belong to 12 different parts of speech: PUNCT (25; 42% instances), CCONJ (10; 17% instances), ADV (6; 10% instances), AUX (4; 7% instances), PRON (4; 7% instances), INTJ (3; 5% instances), DET (2; 3% instances), ADJ (1; 2% instances), ADP (1; 2% instances), PART (1; 2% instances), SCONJ (1; 2% instances), VERB (1; 2% instances)