home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: POS Tags: SCONJ

There are 14 SCONJ lemmas (0%), 1 SCONJ types (6%) and 16614 SCONJ tokens (2%). Out of 16 observed tags, the rank of SCONJ is: 11 in number of lemmas, 14 in number of types and 10 in number of tokens.

The 10 most frequent SCONJ lemmas: _، mA، w، l، f، ,، b، k، :، TBupdate

The 10 most frequent SCONJ types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 221327, PUNCT 71973, ADJ 68841, ADP 62617, VERB 55127, PROPN 48391, ADV 23955, SCONJ 15652, NUM 15105, PRON 12926, AUX 6881, DET 6354, CCONJ 3889, PART 1501, X 380, INTJ 56), mA (SCONJ 667, PRON 320, ADV 5, VERB 1), w (CCONJ 43819, SCONJ 235, ADP 42, NOUN 41, VERB 40, ADJ 14, PRON 12, PROPN 9, DET 4, PART 3, NUM 2, PUNCT 2, X 2), l (ADP 15628, PART 165, NOUN 29, SCONJ 28, ADV 2, VERB 2, ADJ 1, DET 1, NUM 1, PROPN 1, PUNCT 1, X 1), f (CCONJ 1360, PART 815, SCONJ 18, PRON 4, VERB 3, NOUN 2, ADP 1, ADV 1, PUNCT 1), , (PUNCT 254, CCONJ 68, NOUN 9, ADJ 8, ADP 7, NUM 5, SCONJ 4, VERB 4, PRON 3, PROPN 3, ADV 2, DET 1, PART 1), b (ADP 12334, NOUN 21, DET 2, PRON 2, SCONJ 2, X 2, ADJ 1, VERB 1), k (ADP 1066, PRON 213, SCONJ 2, NOUN 1, PROPN 1, PUNCT 1, VERB 1), : (PUNCT 2347, SCONJ 1, VERB 1), TBupdate (NOUN 408, ADJ 340, VERB 268, X 190, PROPN 69, PUNCT 15, ADP 1, SCONJ 1)

The 10 most frequent ambiguous types: _ (NOUN 221899, ADP 91743, PUNCT 75266, ADJ 69355, PROPN 57421, VERB 55469, CCONJ 49161, PRON 43495, ADV 24067, SCONJ 16614, NUM 15377, AUX 9155, DET 6363, PART 2521, X 927, INTJ 56)

Morphology

The form / lemma ratio of SCONJ is 0.071429 (the average of all parts of speech is 0.003044).

The 1st highest number of forms (1) was observed with the lemma “,”: _.

The 2nd highest number of forms (1) was observed with the lemma “:”: _.

The 3rd highest number of forms (1) was observed with the lemma “TBupdate”: _.

SCONJ occurs with 9 features: Gender (25; 0% instances), Number (25; 0% instances), AdpType (21; 0% instances), Definite (16; 0% instances), Person (13; 0% instances), Case (11; 0% instances), Mood (8; 0% instances), Voice (8; 0% instances), Polarity (6; 0% instances)

SCONJ occurs with 19 feature-value pairs: AdpType=Prep, Case=Acc, Case=Gen, Case=Nom, Definite=Com, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Mood=Ind, Mood=Jus, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Voice=Act, Voice=Pass

SCONJ occurs with 21 feature combinations. The most frequent feature combination is _ (16562 tokens). Examples: _

Relations

SCONJ nodes are attached to their parents using 4 different relations: mark (16609; 100% instances), dep (3; 0% instances), obj (1; 0% instances), root (1; 0% instances)

Parents of SCONJ nodes belong to 14 different parts of speech: NOUN (6370; 38% instances), VERB (5841; 35% instances), PRON (2666; 16% instances), PROPN (828; 5% instances), ADJ (431; 3% instances), ADV (315; 2% instances), DET (92; 1% instances), NUM (37; 0% instances), X (13; 0% instances), AUX (9; 0% instances), PART (5; 0% instances), CCONJ (4; 0% instances), SCONJ (2; 0% instances), (1; 0% instances)

16609 (100%) SCONJ nodes are leaves.

3 (0%) SCONJ nodes have one child.

0 (0%) SCONJ nodes have two children.

2 (0%) SCONJ nodes have three or more children.

The highest child degree of a SCONJ node is 9.

Children of SCONJ nodes are attached using 8 different relations: punct (5; 28% instances), ccomp (3; 17% instances), dep (3; 17% instances), mark (2; 11% instances), nmod (2; 11% instances), cc (1; 6% instances), cop (1; 6% instances), obj (1; 6% instances)

Children of SCONJ nodes belong to 8 different parts of speech: PUNCT (5; 28% instances), NOUN (3; 17% instances), VERB (3; 17% instances), PROPN (2; 11% instances), SCONJ (2; 11% instances), ADJ (1; 6% instances), AUX (1; 6% instances), CCONJ (1; 6% instances)