home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-LinES: POS Tags: PART

There are 3 PART lemmas (0%), 6 PART types (0%) and 2448 PART tokens (3%). Out of 17 observed tags, the rank of PART is: 16 in number of lemmas, 16 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: to, not, ‘s

The 10 most frequent PART types: to, not, ‘s, n’t, ‘, t’

The 10 most frequent ambiguous lemmas: to (PART 1315, ADP 918, ADV 3), not (PART 773, ADV 1)

The 10 most frequent ambiguous types: to (PART 1282, ADP 906, ADV 3), not (PART 479, ADV 1), ’s (PART 332, AUX 155, VERB 36, PRON 1), (PUNCT 274, PART 28)

Morphology

The form / lemma ratio of PART is 2.000000 (the average of all parts of speech is 1.240477).

The 1st highest number of forms (2) was observed with the lemma “’s”: ’, ‘s.

The 2nd highest number of forms (2) was observed with the lemma “not”: n’t, not.

The 3rd highest number of forms (2) was observed with the lemma “to”: t’, to.

PART occurs with 1 features: Polarity (773; 32% instances)

PART occurs with 1 feature-value pairs: Polarity=Neg

PART occurs with 2 feature combinations. The most frequent feature combination is _ (1675 tokens). Examples: to, ‘s, ‘, t’

Relations

PART nodes are attached to their parents using 12 different relations: mark (1309; 53% instances), advmod (761; 31% instances), case (359; 15% instances), conj (7; 0% instances), root (3; 0% instances), amod (2; 0% instances), obj (2; 0% instances), advcl (1; 0% instances), fixed (1; 0% instances), flat (1; 0% instances), orphan (1; 0% instances), xcomp (1; 0% instances)

Parents of PART nodes belong to 11 different parts of speech: VERB (1793; 73% instances), NOUN (262; 11% instances), PROPN (200; 8% instances), ADJ (112; 5% instances), ADV (31; 1% instances), AUX (22; 1% instances), PRON (19; 1% instances), (3; 0% instances), SCONJ (3; 0% instances), DET (2; 0% instances), NUM (1; 0% instances)

2432 (99%) PART nodes are leaves.

11 (0%) PART nodes have one child.

3 (0%) PART nodes have two children.

2 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 6.

Children of PART nodes are attached using 11 different relations: cc (7; 27% instances), punct (6; 23% instances), advmod (3; 12% instances), case (2; 8% instances), parataxis (2; 8% instances), advcl (1; 4% instances), conj (1; 4% instances), discourse (1; 4% instances), mark (1; 4% instances), nmod (1; 4% instances), nsubj (1; 4% instances)

Children of PART nodes belong to 9 different parts of speech: CCONJ (7; 27% instances), PUNCT (6; 23% instances), VERB (4; 15% instances), ADV (3; 12% instances), ADP (2; 8% instances), INTJ (1; 4% instances), NOUN (1; 4% instances), PRON (1; 4% instances), SCONJ (1; 4% instances)