home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-GSD: POS Tags: CCONJ

There are 6 CCONJ lemmas (0%), 47 CCONJ types (0%) and 10953 CCONJ tokens (3%). Out of 16 observed tags, the rank of CCONJ is: 9 in number of lemmas, 14 in number of types and 8 in number of tokens.

The 10 most frequent CCONJ lemmas: e, que, _, mas, ou, logo

The 10 most frequent CCONJ types: e, que, mas, ou, se, quando, como, porque, enquanto, pois

The 10 most frequent ambiguous lemmas: e (CCONJ 6323, VERB 1), que (CCONJ 1889, ADP 2, SCONJ 1), _ (PROPN 32806, ADP 9506, NUM 8462, PRON 7364, DET 4461, NOUN 3563, AUX 2298, CCONJ 1840, PUNCT 1596, VERB 1247, SYM 1008, PART 746, ADJ 703, X 526, ADV 231, SCONJ 1), mas (CCONJ 497, ADV 1), logo (ADV 46, NOUN 2, CCONJ 1)

The 10 most frequent ambiguous types: e (CCONJ 6349, ADJ 14, X 9, ADP 2, AUX 2, DET 1, VERB 1), que (PRON 2962, CCONJ 2230, ADP 115, DET 7, NOUN 3, SCONJ 2, X 1), mas (CCONJ 500, ADV 1), se (PRON 748, PART 390, CCONJ 187, ADP 3), quando (CCONJ 158, ADV 104, ADP 3), como (ADP 731, CCONJ 99, ADV 68), porque (CCONJ 108, ADV 6), enquanto (CCONJ 63, ADV 7, ADP 4), pois (CCONJ 68, PART 1), embora (CCONJ 33, ADV 2)

Morphology

The form / lemma ratio of CCONJ is 7.833333 (the average of all parts of speech is 3.372737).

The 1st highest number of forms (46) was observed with the lemma “_”: &, EO, a, and, animado.Quando, assim, até, because, but, caso, como, conforme, de, do, e, either, embora, enquanto, então, et, he, i.e., logo, mais, mas, mesmo, n, nem, or, ou, pois, porque, porém, q, qua, quando, quanto, que, se, segundo, seja, sem, tampouco, tanto, têm, y.

The 2nd highest number of forms (2) was observed with the lemma “e”: &, e.

The 3rd highest number of forms (1) was observed with the lemma “logo”: Logo.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 12 different relations: cc (7694; 70% instances), mark (2837; 26% instances), fixed (331; 3% instances), dep (28; 0% instances), nummod (22; 0% instances), conj (11; 0% instances), advmod (10; 0% instances), nsubj (6; 0% instances), ccomp (5; 0% instances), obj (5; 0% instances), case (3; 0% instances), nsubj:pass (1; 0% instances)

Parents of CCONJ nodes belong to 14 different parts of speech: VERB (5566; 51% instances), NOUN (2914; 27% instances), PROPN (1341; 12% instances), ADJ (361; 3% instances), NUM (208; 2% instances), ADV (195; 2% instances), ADP (149; 1% instances), PRON (98; 1% instances), DET (32; 0% instances), PART (28; 0% instances), SYM (28; 0% instances), CCONJ (20; 0% instances), X (7; 0% instances), AUX (6; 0% instances)

10852 (99%) CCONJ nodes are leaves.

81 (1%) CCONJ nodes have one child.

17 (0%) CCONJ nodes have two children.

3 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 4.

Children of CCONJ nodes are attached using 13 different relations: punct (45; 36% instances), fixed (40; 32% instances), nummod (22; 18% instances), conj (6; 5% instances), advmod (2; 2% instances), dep (2; 2% instances), det (2; 2% instances), amod (1; 1% instances), case (1; 1% instances), cc (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances)

Children of CCONJ nodes belong to 11 different parts of speech: PUNCT (45; 36% instances), VERB (24; 19% instances), NUM (23; 18% instances), CCONJ (20; 16% instances), ADP (3; 2% instances), ADV (2; 2% instances), DET (2; 2% instances), NOUN (2; 2% instances), PROPN (2; 2% instances), ADJ (1; 1% instances), X (1; 1% instances)