Treebank Statistics: UD_Pomak-Philotis: POS Tags: CCONJ
There are 15 CCONJ
lemmas (0%), 17 CCONJ
types (0%) and 4491 CCONJ
tokens (5%).
Out of 16 observed tags, the rank of CCONJ
is: 14 in number of lemmas, 16 in number of types and 7 in number of tokens.
The 10 most frequent CCONJ
lemmas: i, alá, am, íli, níta, amá, áma, ni, hem, lǽjkim
The 10 most frequent CCONJ
types: i, alá, am, íli, níta, amá, ála, áma, ni, hem
The 10 most frequent ambiguous lemmas: áma (CCONJ 29, INTJ 1), ni (CCONJ 27, PART 5), dalí (ADV 8, PART 8, CCONJ 2), ja (PRON 5141, PART 24, CCONJ 2), a (INTJ 18, CCONJ 1)
The 10 most frequent ambiguous types: ni (CCONJ 27, PART 3, PRON 1), dalí (ADV 8, PART 5, CCONJ 2), ja (PRON 107, AUX 4, CCONJ 2), A (INTJ 17, CCONJ 1)
- ni
- dalí
- ja
- A
Morphology
The form / lemma ratio of CCONJ
is 1.133333 (the average of all parts of speech is 2.731846).
The 1st highest number of forms (2) was observed with the lemma “alá”: alá, ála.
The 2nd highest number of forms (2) was observed with the lemma “níta”: níta, níto.
The 3rd highest number of forms (1) was observed with the lemma “a”: A.
CCONJ
does not occur with any features.
Relations
CCONJ
nodes are attached to their parents using 6 different relations: cc (4466; 99% instances), discourse (18; 0% instances), obj (4; 0% instances), advmod (1; 0% instances), mark (1; 0% instances), root (1; 0% instances)
Parents of CCONJ
nodes belong to 10 different parts of speech: VERB (3394; 76% instances), NOUN (720; 16% instances), ADJ (118; 3% instances), PRON (67; 1% instances), PROPN (54; 1% instances), DET (44; 1% instances), NUM (36; 1% instances), ADV (32; 1% instances), PART (25; 1% instances), (1; 0% instances)
4488 (100%) CCONJ
nodes are leaves.
2 (0%) CCONJ
nodes have one child.
1 (0%) CCONJ
nodes have two children.
The highest child degree of a CCONJ
node is 2.
Children of CCONJ
nodes are attached using 3 different relations: punct (2; 50% instances), advmod (1; 25% instances), nsubj (1; 25% instances)
Children of CCONJ
nodes belong to 3 different parts of speech: PUNCT (2; 50% instances), ADV (1; 25% instances), NOUN (1; 25% instances)