home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-BR: POS Tags: CCONJ

There are 1 CCONJ lemmas (5%), 47 CCONJ types (0%) and 10984 CCONJ tokens (3%). Out of 14 observed tags, the rank of CCONJ is: 6 in number of lemmas, 14 in number of types and 8 in number of tokens.

The 10 most frequent CCONJ lemmas: _

The 10 most frequent CCONJ types: e, que, mas, ou, se, eo, ea, quando, como, porque

The 10 most frequent ambiguous lemmas: _ (NOUN 57316, PUNCT 42033, PROPN 32948, ADP 30871, VERB 29700, DET 26122, ADJ 15107, CCONJ 10984, ADV 9773, NUM 8491, PRON 7392, AUX 5242, PART 748, X 539)

The 10 most frequent ambiguous types: e (CCONJ 5901, ADJ 14, X 9, ADP 2, AUX 2, DET 1), que (PRON 2970, CCONJ 2237, ADP 113, DET 7, NOUN 3, X 1), mas (CCONJ 500, ADV 1), se (PRON 755, PART 392, CCONJ 186, ADP 3, PROPN 1), eo (CCONJ 239, ADP 1), ea (CCONJ 232, VERB 1), quando (CCONJ 158, ADV 104, ADP 3), como (ADP 733, CCONJ 99, ADV 68), porque (CCONJ 108, ADV 6), enquanto (CCONJ 63, ADV 7, ADP 4)

Morphology

The form / lemma ratio of CCONJ is 47.000000 (the average of all parts of speech is 1851.578947).

The 1st highest number of forms (47) was observed with the lemma “_”: &, a, and, animado.Quando, assim, até, because, but, caso, como, conforme, de, do, e, ea, either, embora, enquanto, então, eo, et, he, i.e., logo, mais, mas, mesmo, n, nem, or, ou, pois, porque, porém, q, qua, quando, quanto, que, se, segundo, seja, sem, tampouco, tanto, têm, y.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 12 different relations: cc (7717; 70% instances), mark (2841; 26% instances), fixed (333; 3% instances), dep (29; 0% instances), nummod (22; 0% instances), conj (12; 0% instances), advmod (10; 0% instances), nsubj (6; 0% instances), ccomp (5; 0% instances), obj (5; 0% instances), case (3; 0% instances), nsubj:pass (1; 0% instances)

Parents of CCONJ nodes belong to 13 different parts of speech: VERB (5580; 51% instances), NOUN (2949; 27% instances), PROPN (1347; 12% instances), ADJ (359; 3% instances), NUM (210; 2% instances), ADV (195; 2% instances), ADP (149; 1% instances), PRON (100; 1% instances), DET (32; 0% instances), PART (29; 0% instances), CCONJ (20; 0% instances), AUX (7; 0% instances), X (7; 0% instances)

10881 (99%) CCONJ nodes are leaves.

81 (1%) CCONJ nodes have one child.

19 (0%) CCONJ nodes have two children.

3 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 4.

Children of CCONJ nodes are attached using 13 different relations: punct (49; 38% instances), fixed (40; 31% instances), nummod (22; 17% instances), conj (6; 5% instances), advmod (2; 2% instances), dep (2; 2% instances), det (2; 2% instances), amod (1; 1% instances), case (1; 1% instances), cc (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances)

Children of CCONJ nodes belong to 11 different parts of speech: PUNCT (49; 38% instances), VERB (24; 19% instances), NUM (23; 18% instances), CCONJ (20; 16% instances), ADP (3; 2% instances), ADV (2; 2% instances), DET (2; 2% instances), NOUN (2; 2% instances), PROPN (2; 2% instances), ADJ (1; 1% instances), X (1; 1% instances)