home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sindhi-Isra: POS Tags: SCONJ

There are 19 SCONJ lemmas (0%), 24 SCONJ types (0%) and 2391 SCONJ tokens (3%). Out of 15 observed tags, the rank of SCONJ is: 12 in number of lemmas, 12 in number of types and 10 in number of tokens.

The 10 most frequent SCONJ lemmas: ته, پر, جيڪڏهن, تنهنڪري, جو, ڇاڪاڻ, تہ, بلڪه, جي, سو

The 10 most frequent SCONJ types: ته, پر, جيڪڏهن, تنهنڪري, جو, ڇاڪاڻ, تہ, بلڪه, جي, سو

The 10 most frequent ambiguous lemmas: ته (SCONJ 1692, PART 284), پر (SCONJ 373, CCONJ 10, ADJ 1), جو (ADP 1855, SCONJ 50, DET 12, PRON 1), تہ (SCONJ 22, PART 5), جي (ADP 3320, PRON 73, NOUN 12, SCONJ 11, DET 5, CCONJ 1), سو (PRON 54, SCONJ 11, ADJ 5), _ (NOUN 3847, PROPN 838, VERB 171, ADJ 67, NUM 58, ADV 26, PART 24, ADP 22, PUNCT 22, AUX 16, PRON 13, DET 12, SCONJ 9, INTJ 2), جيتوڻيڪ (SCONJ 9, ADV 4), ڇو (ADV 42, PRON 37, SCONJ 4), جنهنڪري (ADV 7, SCONJ 2)

The 10 most frequent ambiguous types: ته (SCONJ 1692, PART 284), پر (SCONJ 373, CCONJ 10, ADJ 1), جو (ADP 1351, SCONJ 50, DET 12), تہ (SCONJ 22, PART 5), جي (ADP 3320, NOUN 12, SCONJ 11, PRON 2, CCONJ 1, PROPN 1), سو (PRON 53, SCONJ 11), جيتوڻيڪ (SCONJ 9, ADV 4), ڇو (ADV 42, PRON 37, SCONJ 4), پوءِ (ADP 135, ADV 105, SCONJ 3), انڪري (ADV 8, SCONJ 2)

Morphology

The form / lemma ratio of SCONJ is 1.263158 (the average of all parts of speech is 1.870009).

The 1st highest number of forms (6) was observed with the lemma “_”: البت, بشرطيڪه, جيڪڏھن, ليڪن, پوءِ, ڪه.

The 2nd highest number of forms (1) was observed with the lemma “انڪري”: انڪري.

The 3rd highest number of forms (1) was observed with the lemma “بلڪ”: بلڪ.

SCONJ occurs with 1 features: ExtPos (22; 1% instances)

SCONJ occurs with 1 feature-value pairs: ExtPos=SCONJ

SCONJ occurs with 2 feature combinations. The most frequent feature combination is _ (2369 tokens). Examples: ته, پر, جيڪڏهن, تنهنڪري, جو, تہ, بلڪه, جي, سو, جيتوڻيڪ

Relations

SCONJ nodes are attached to their parents using 7 different relations: mark (2309; 97% instances), fixed (56; 2% instances), dep (16; 1% instances), obj (5; 0% instances), cc (2; 0% instances), obl (2; 0% instances), case (1; 0% instances)

Parents of SCONJ nodes belong to 12 different parts of speech: VERB (1833; 77% instances), NOUN (314; 13% instances), ADJ (104; 4% instances), ADV (40; 2% instances), SCONJ (27; 1% instances), AUX (23; 1% instances), DET (21; 1% instances), PRON (14; 1% instances), PROPN (7; 0% instances), PART (4; 0% instances), NUM (3; 0% instances), ADP (1; 0% instances)

2352 (98%) SCONJ nodes are leaves.

38 (2%) SCONJ nodes have one child.

1 (0%) SCONJ nodes have two children.

The highest child degree of a SCONJ node is 2.

Children of SCONJ nodes are attached using 4 different relations: fixed (32; 80% instances), nmod (5; 13% instances), punct (2; 5% instances), advmod:emph (1; 3% instances)

Children of SCONJ nodes belong to 4 different parts of speech: SCONJ (27; 68% instances), PART (6; 15% instances), PRON (5; 13% instances), PUNCT (2; 5% instances)