home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-EWT: POS Tags: PART

There are 3 PART lemmas (0%), 20 PART types (0%) and 7041 PART tokens (3%). Out of 17 observed tags, the rank of PART is: 17 in number of lemmas, 17 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: to, not, ‘s

The 10 most frequent PART types: to, not, n’t, ‘s, s, nt, ‘, ’s, na, n’t

The 10 most frequent ambiguous lemmas: to (PART 4017, ADP 2206, SCONJ 83, NOUN 2, VERB 1)

The 10 most frequent ambiguous types: to (PART 3946, ADP 2175, SCONJ 82, ADV 12, NOUN 2, X 2, DET 1, VERB 1), ’s (PART 670, AUX 395, VERB 53, PRON 14), s (AUX 100, PART 98, VERB 10, ADJ 7, PRON 7, X 4, NOUN 2, PROPN 1), (PUNCT 240, PART 36, NOUN 7), ’s (PART 29, AUX 11, VERB 3, PRON 1), ta (PART 7, ADP 3), (PART 3, PUNCT 2), n (CCONJ 4, PART 2), too (ADV 150, PART 2, ADP 1), 2 (NUM 173, PROPN 2, X 2, ADP 1, PART 1)

Morphology

The form / lemma ratio of PART is 6.666667 (the average of all parts of speech is 1.228673).

The 1st highest number of forms (8) was observed with the lemma “to”: 2, a, na, ot, ta, the, to, too.

The 2nd highest number of forms (7) was observed with the lemma “’s”: ’, ‘s, -s, `s, s, ’, ’s.

The 3rd highest number of forms (5) was observed with the lemma “not”: n, n’t, not, nt, n’t.

PART occurs with 2 features: Typo (106; 2% instances), Abbr (27; 0% instances)

PART occurs with 2 feature-value pairs: Abbr=Yes, Typo=Yes

PART occurs with 3 feature combinations. The most frequent feature combination is _ (6908 tokens). Examples: to, not, n’t, ‘s, nt, ‘, ’s, n’t, ’, n

Relations

PART nodes are attached to their parents using 15 different relations: mark (3987; 57% instances), advmod (2125; 30% instances), case (841; 12% instances), conj (20; 0% instances), fixed (19; 0% instances), xcomp (17; 0% instances), advcl (9; 0% instances), cc (8; 0% instances), ccomp (4; 0% instances), parataxis (3; 0% instances), root (3; 0% instances), compound (2; 0% instances), acl:relcl (1; 0% instances), csubj (1; 0% instances), reparandum (1; 0% instances)

Parents of PART nodes belong to 14 different parts of speech: VERB (5357; 76% instances), PROPN (584; 8% instances), NOUN (548; 8% instances), ADJ (367; 5% instances), ADV (62; 1% instances), PRON (46; 1% instances), AUX (32; 0% instances), DET (12; 0% instances), SCONJ (12; 0% instances), NUM (8; 0% instances), PART (7; 0% instances), (3; 0% instances), ADP (2; 0% instances), SYM (1; 0% instances)

6987 (99%) PART nodes are leaves.

40 (1%) PART nodes have one child.

8 (0%) PART nodes have two children.

6 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 4.

Children of PART nodes are attached using 11 different relations: cc (23; 30% instances), mark (15; 20% instances), fixed (14; 18% instances), punct (11; 14% instances), advmod (6; 8% instances), advcl (2; 3% instances), ccomp (1; 1% instances), conj (1; 1% instances), csubj (1; 1% instances), discourse (1; 1% instances), obj (1; 1% instances)

Children of PART nodes belong to 8 different parts of speech: CCONJ (23; 30% instances), SCONJ (15; 20% instances), PUNCT (11; 14% instances), VERB (10; 13% instances), ADV (8; 11% instances), PART (7; 9% instances), INTJ (1; 1% instances), NOUN (1; 1% instances)