home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English: POS Tags: PART

There are 13 PART lemmas (0%), 21 PART types (0%) and 6827 PART tokens (3%). Out of 17 observed tags, the rank of PART is: 17 in number of lemmas, 17 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: to, not, ‘s, s, ‘, na, ta, too, -s, 2

The 10 most frequent PART types: to, not, n’t, ‘s, s, nt, ‘, ’s, na, n’t

The 10 most frequent ambiguous lemmas: to (PART 3988, ADP 2208, SCONJ 84, ADV 12, NOUN 2, VERB 1), not (PART 1974, ADV 193, CCONJ 16), ’s (PART 701, PRON 14), s (PART 98, X 10, PRON 7, NOUN 3, PROPN 1), (PUNCT 244, PART 39, NOUN 7, AUX 1), ta (PART 7, ADP 4), too (ADV 160, PART 2, ADP 1), 2 (NUM 145, X 30, PROPN 2, ADP 1, PART 1), `s (AUX 9, PART 1), a (DET 5348, NOUN 20, X 14, ADV 6, ADP 4, AUX 1, CCONJ 1, PART 1)

The 10 most frequent ambiguous types: to (PART 3942, ADP 2182, SCONJ 82, ADV 12, NOUN 2, VERB 1), not (PART 971, ADV 162, CCONJ 14), ’s (PART 670, AUX 391, VERB 58, PRON 14), s (AUX 100, PART 97, VERB 11, X 10, PRON 7, NOUN 2, PROPN 1), (PUNCT 240, PART 36, NOUN 7), ’s (PART 29, AUX 11, VERB 3, PRON 1), ta (PART 7, ADP 3), (PART 3, PUNCT 2), n (CCONJ 3, PART 2, NOUN 1), too (ADV 150, PART 2, ADP 1)

Morphology

The form / lemma ratio of PART is 1.615385 (the average of all parts of speech is 1.176027).

The 1st highest number of forms (6) was observed with the lemma “not”: n, n’t, not, nt, n’t, t.

The 2nd highest number of forms (2) was observed with the lemma “’”: ’, ’.

The 3rd highest number of forms (2) was observed with the lemma “’s”: ’s, ’s.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 12 different relations: mark (3984; 58% instances), advmod (1935; 28% instances), case (840; 12% instances), conj (20; 0% instances), xcomp (17; 0% instances), advcl (9; 0% instances), fixed (9; 0% instances), ccomp (4; 0% instances), parataxis (3; 0% instances), root (3; 0% instances), compound (2; 0% instances), reparandum (1; 0% instances)

Parents of PART nodes belong to 15 different parts of speech: VERB (5285; 77% instances), PROPN (573; 8% instances), NOUN (498; 7% instances), ADJ (327; 5% instances), ADV (73; 1% instances), PRON (33; 0% instances), AUX (14; 0% instances), DET (6; 0% instances), NUM (5; 0% instances), PART (4; 0% instances), (3; 0% instances), ADP (2; 0% instances), SCONJ (2; 0% instances), CCONJ (1; 0% instances), SYM (1; 0% instances)

6754 (99%) PART nodes are leaves.

52 (1%) PART nodes have one child.

14 (0%) PART nodes have two children.

7 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 4.

Children of PART nodes are attached using 17 different relations: punct (40; 39% instances), cc (23; 22% instances), mark (12; 12% instances), advmod (11; 11% instances), advcl (3; 3% instances), _ (2; 2% instances), conj (2; 2% instances), acl (1; 1% instances), amod (1; 1% instances), case (1; 1% instances), ccomp (1; 1% instances), csubj (1; 1% instances), discourse (1; 1% instances), flat (1; 1% instances), nsubj (1; 1% instances), orphan (1; 1% instances), parataxis (1; 1% instances)

Children of PART nodes belong to 10 different parts of speech: PUNCT (40; 39% instances), CCONJ (23; 22% instances), ADV (13; 13% instances), SCONJ (12; 12% instances), VERB (9; 9% instances), ADJ (2; 2% instances), ADP (1; 1% instances), INTJ (1; 1% instances), NOUN (1; 1% instances), PROPN (1; 1% instances)