home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: POS Tags: PART

There are 3 PART lemmas (0%), 15 PART types (0%) and 5764 PART tokens (2%). Out of 17 observed tags, the rank of PART is: 17 in number of lemmas, 17 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: to, not, ‘s

The 10 most frequent PART types: to, not, n’t, ‘s, ’s, n’t, na, ‘, ’, ta

The 10 most frequent ambiguous lemmas: to (PART 3288, ADP 1877, SCONJ 45, X 1)

The 10 most frequent ambiguous types: to (PART 3082, ADP 1852, SCONJ 45, DET 1, NUM 1, VERB 1, X 1), ’s (AUX 826, PART 478, VERB 96, PRON 43), ’s (PART 208, AUX 200, PRON 17, VERB 12), na (PART 132, INTJ 4, X 1), (PUNCT 164, PART 50, NOUN 2), (PART 31, PUNCT 24), ta (PART 11, ADP 4), s (PART 4, AUX 1, NOUN 1, VERB 1, X 1), a (DET 4058, ADP 4, PART 2, SCONJ 2, ADV 1, AUX 1, NOUN 1, PROPN 1), do (AUX 442, VERB 284, NOUN 3, PROPN 3, PART 1)

Morphology

The form / lemma ratio of PART is 5.000000 (the average of all parts of speech is 1.243866).

The 1st highest number of forms (6) was observed with the lemma “to”: a, do, na, ta, the, to.

The 2nd highest number of forms (5) was observed with the lemma “’s”: ’, ‘s, s, ’, ’s.

The 3rd highest number of forms (4) was observed with the lemma “not”: n’t, n`t, not, n’t.

PART occurs with 3 features: Polarity (1703; 30% instances), Style (11; 0% instances), Typo (8; 0% instances)

PART occurs with 3 feature-value pairs: Polarity=Neg, Style=Coll, Typo=Yes

PART occurs with 5 feature combinations. The most frequent feature combination is _ (4043 tokens). Examples: to, ‘s, ’s, na, ‘, ’, a

Relations

PART nodes are attached to their parents using 13 different relations: mark (3240; 56% instances), advmod (1665; 29% instances), case (773; 13% instances), xcomp (30; 1% instances), conj (18; 0% instances), root (11; 0% instances), acl (8; 0% instances), reparandum (7; 0% instances), advcl (4; 0% instances), parataxis (3; 0% instances), ccomp (2; 0% instances), orphan (2; 0% instances), nmod (1; 0% instances)

Parents of PART nodes belong to 15 different parts of speech: VERB (4244; 74% instances), NOUN (558; 10% instances), PROPN (465; 8% instances), ADJ (280; 5% instances), AUX (66; 1% instances), ADV (58; 1% instances), PRON (44; 1% instances), NUM (16; 0% instances), (11; 0% instances), DET (7; 0% instances), INTJ (5; 0% instances), ADP (3; 0% instances), X (3; 0% instances), PART (2; 0% instances), SCONJ (2; 0% instances)

5701 (99%) PART nodes are leaves.

42 (1%) PART nodes have one child.

8 (0%) PART nodes have two children.

13 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 9.

Children of PART nodes are attached using 13 different relations: punct (49; 42% instances), cc (19; 16% instances), advmod (12; 10% instances), cop (10; 9% instances), nsubj (9; 8% instances), mark (5; 4% instances), discourse (4; 3% instances), obl (3; 3% instances), case (1; 1% instances), conj (1; 1% instances), det (1; 1% instances), nsubj:outer (1; 1% instances), parataxis (1; 1% instances)

Children of PART nodes belong to 13 different parts of speech: PUNCT (49; 42% instances), CCONJ (19; 16% instances), ADV (10; 9% instances), AUX (10; 9% instances), PRON (9; 8% instances), SCONJ (5; 4% instances), DET (4; 3% instances), INTJ (4; 3% instances), PART (2; 2% instances), ADJ (1; 1% instances), ADP (1; 1% instances), NOUN (1; 1% instances), VERB (1; 1% instances)