home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-PUD: POS Tags: CCONJ

There are 1 CCONJ lemmas (4%), 8 CCONJ types (0%) and 578 CCONJ tokens (2%). Out of 14 observed tags, the rank of CCONJ is: 7 in number of lemmas, 12 in number of types and 10 in number of tokens.

The 10 most frequent CCONJ lemmas: _

The 10 most frequent CCONJ types: e, mas, ou, porém, Entretanto, como, contudo, portanto

The 10 most frequent ambiguous lemmas: _ (NOUN 4636, ADP 2571, PUNCT 2547, VERB 2512, DET 2070, ADJ 1554, PROPN 1352, PRON 910, ADV 841, CCONJ 578, NUM 471, AUX 328, SYM 34, X 9)

The 10 most frequent ambiguous types: como (ADP 119, ADV 5, CCONJ 4), portanto (ADP 1, CCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 8.000000 (the average of all parts of speech is 228.814815).

The 1st highest number of forms (8) was observed with the lemma “_”: Entretanto, como, contudo, e, mas, ou, portanto, porém.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 4 different relations: cc (531; 92% instances), discourse (43; 7% instances), cc:preconj (2; 0% instances), nummod (2; 0% instances)

Parents of CCONJ nodes belong to 10 different parts of speech: NOUN (233; 40% instances), VERB (217; 38% instances), PROPN (50; 9% instances), ADJ (40; 7% instances), NUM (14; 2% instances), ADV (11; 2% instances), ADP (7; 1% instances), PRON (3; 1% instances), SYM (2; 0% instances), DET (1; 0% instances)

511 (88%) CCONJ nodes are leaves.

64 (11%) CCONJ nodes have one child.

3 (1%) CCONJ nodes have two children.

The highest child degree of a CCONJ node is 2.

Children of CCONJ nodes are attached using 3 different relations: punct (67; 96% instances), nummod (2; 3% instances), fixed (1; 1% instances)

Children of CCONJ nodes belong to 3 different parts of speech: PUNCT (67; 96% instances), NUM (2; 3% instances), ADV (1; 1% instances)