home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-FTB: POS Tags: CCONJ

There are 10 CCONJ lemmas (1%), 11 CCONJ types (1%) and 11973 CCONJ tokens (2%). Out of 16 observed tags, the rank of CCONJ is: 14 in number of lemmas, 15 in number of types and 12 in number of tokens.

The 10 most frequent CCONJ lemmas: _, mais, et, or, car, ou, puis, ni, soit, sinon

The 10 most frequent CCONJ types: _, Mais, Et, Or, Car, Ou, Puis, Ni, Soit, Sinon

The 10 most frequent ambiguous lemmas: _ (NOUN 115984, ADP 89082, DET 79464, PUNCT 73864, VERB 47092, ADJ 36213, ADV 22183, PROPN 21225, PRON 20877, NUM 17577, AUX 12831, CCONJ 11039, SCONJ 4969, X 2163, PART 239, INTJ 33), puis (CCONJ 14, ADV 4), sinon (SCONJ 4, ADV 2, CCONJ 2)

The 10 most frequent ambiguous types: _ (NOUN 115984, ADP 89082, DET 79464, PUNCT 73864, VERB 47092, ADJ 36213, ADV 22183, PROPN 21225, PRON 20877, NUM 17577, AUX 12831, CCONJ 11039, SCONJ 4969, X 2163, PART 239, INTJ 33), Puis (CCONJ 14, ADV 4), Sinon (SCONJ 4, ADV 2, CCONJ 2)

Morphology

The form / lemma ratio of CCONJ is 1.100000 (the average of all parts of speech is 1.170225).

The 1st highest number of forms (2) was observed with the lemma “mais”: MAIS, Mais.

The 2nd highest number of forms (1) was observed with the lemma “_”: _.

The 3rd highest number of forms (1) was observed with the lemma “car”: Car.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 17 different relations: cc (11604; 97% instances), fixed (308; 3% instances), root (13; 0% instances), conj (9; 0% instances), obj (8; 0% instances), acl (7; 0% instances), advmod (7; 0% instances), xcomp (4; 0% instances), obl (3; 0% instances), parataxis (3; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), dep (1; 0% instances), mark (1; 0% instances), nmod (1; 0% instances), nsubj (1; 0% instances), orphan (1; 0% instances)

Parents of CCONJ nodes belong to 15 different parts of speech: NOUN (5357; 45% instances), VERB (3220; 27% instances), ADJ (1191; 10% instances), PROPN (1121; 9% instances), PRON (254; 2% instances), AUX (244; 2% instances), NUM (196; 2% instances), ADV (175; 1% instances), ADP (95; 1% instances), X (53; 0% instances), DET (26; 0% instances), CCONJ (23; 0% instances), (13; 0% instances), INTJ (3; 0% instances), SCONJ (2; 0% instances)

11087 (93%) CCONJ nodes are leaves.

620 (5%) CCONJ nodes have one child.

154 (1%) CCONJ nodes have two children.

112 (1%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 8.

Children of CCONJ nodes are attached using 19 different relations: punct (621; 48% instances), advmod (513; 40% instances), fixed (68; 5% instances), dep (32; 2% instances), nmod (30; 2% instances), conj (8; 1% instances), obj (4; 0% instances), advcl (3; 0% instances), nsubj (3; 0% instances), obl (3; 0% instances), acl (2; 0% instances), cop (2; 0% instances), amod (1; 0% instances), cc (1; 0% instances), det (1; 0% instances), mark (1; 0% instances), nummod (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)

Children of CCONJ nodes belong to 14 different parts of speech: PUNCT (622; 48% instances), ADV (468; 36% instances), ADP (70; 5% instances), NOUN (53; 4% instances), CCONJ (23; 2% instances), VERB (22; 2% instances), PROPN (8; 1% instances), SCONJ (7; 1% instances), PRON (6; 0% instances), ADJ (5; 0% instances), X (5; 0% instances), DET (3; 0% instances), AUX (2; 0% instances), NUM (2; 0% instances)