home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: POS Tags: PART

There are 4 PART lemmas (0%), 6 PART types (0%) and 164 PART tokens (0%). Out of 17 observed tags, the rank of PART is: 16 in number of lemmas, 17 in number of types and 16 in number of tokens.

The 10 most frequent PART lemmas: t, l’, ci, là

The 10 most frequent PART types: -t, l’, -ci, ci, là, t’

The 10 most frequent ambiguous lemmas: t (PART 84, NOUN 1), l’ (PART 71, DET 9, PROPN 1), ci (PART 8, ADV 1), (ADV 94, PART 1)

The 10 most frequent ambiguous types: l’ (DET 6341, PRON 214, PART 71, PROPN 1), -ci (PART 6, ADV 1), ci (PART 2, ADV 1), (ADV 67, DET 1, PART 1), t’ (PRON 4, PART 1)

Morphology

The form / lemma ratio of PART is 1.500000 (the average of all parts of speech is 1.302640).

The 1st highest number of forms (2) was observed with the lemma “ci”: -ci, ci.

The 2nd highest number of forms (2) was observed with the lemma “t”: -t, t’.

The 3rd highest number of forms (1) was observed with the lemma “l’”: l’.

PART occurs with 1 features: Typo (2; 1% instances)

PART occurs with 1 feature-value pairs: Typo=Yes

PART occurs with 2 feature combinations. The most frequent feature combination is _ (162 tokens). Examples: -t, l’, -ci, ci, t’

Relations

PART nodes are attached to their parents using 6 different relations: expl (84; 51% instances), nsubj (67; 41% instances), advmod (8; 5% instances), nsubj:caus (3; 2% instances), goeswith (1; 1% instances), nsubj:pass (1; 1% instances)

Parents of PART nodes belong to 3 different parts of speech: PRON (87; 53% instances), VERB (71; 43% instances), NOUN (6; 4% instances)

93 (57%) PART nodes are leaves.

71 (43%) PART nodes have one child.

The highest child degree of a PART node is 1.

Children of PART nodes are attached using 1 different relations: fixed (71; 100% instances)

Children of PART nodes belong to 1 different parts of speech: PRON (71; 100% instances)