Treebank Statistics: UD_Bororo-BDT: POS Tags: SCONJ
There are 137 SCONJ
lemmas (1%), 139 SCONJ
types (1%) and 1263 SCONJ
tokens (1%).
Out of 17 observed tags, the rank of SCONJ
is: 9 in number of lemmas, 9 in number of types and 11 in number of tokens.
The 10 most frequent SCONJ
lemmas: kodi, du, dutabo, dukeje, _, keje, wo, koiare, boe, bu
The 10 most frequent SCONJ
types: kodi, dukeje, dutabo, kodire, dukejere, iwo, kejere, koiare, bukeje, Boe
The 10 most frequent ambiguous lemmas: kodi (SCONJ 354, ADV 222, VERB 62, PRON 42, NOUN 10, ADP 3), du (ADV 586, NOUN 407, SCONJ 263, VERB 197, PART 45, PROPN 18, X 11, PRON 2), dutabo (SCONJ 122, ADV 17), dukeje (ADV 133, SCONJ 82, X 50, VERB 20, NOUN 4), _ (NOUN 5910, VERB 3398, ADV 1856, PRON 1359, ADP 1308, PROPN 1165, X 926, PUNCT 459, DET 149, INTJ 122, SCONJ 55, CCONJ 30, PART 29), keje (ADP 1500, VERB 85, NOUN 50, SCONJ 45, PRON 1), wo (NOUN 41, SCONJ 38, VERB 3), koiare (ADV 30, SCONJ 22, ADP 1), boe (NOUN 2742, VERB 78, SCONJ 21, X 19, PRON 12, ADV 4), bu (VERB 194, NOUN 155, SCONJ 20)
The 10 most frequent ambiguous types: dukeje (SCONJ 305, ADP 56, X 50), dutabo (SCONJ 122, ADV 81, ADP 30), kodire (ADV 121, SCONJ 73, VERB 26), dukejere (SCONJ 52, ADV 7, VERB 7), iwo (SCONJ 38, PRON 25, VERB 18, NOUN 2), kejere (VERB 73, ADV 40, SCONJ 36, ADP 11), koiare (VERB 44, SCONJ 22, ADP 20, ADV 20, NOUN 3), bukeje (SCONJ 22, ADV 4), Boe (NOUN 724, ADV 52, X 29, SCONJ 14, PRON 4), mato (ADV 267, NOUN 43, VERB 10, SCONJ 9)
- dukeje
- dutabo
- kodire
- dukejere
- iwo
- kejere
- koiare
- VERB 44: 9Aeku koiare umode aro pegado ma , kode , tawuje , barigudu jaedo apiji .
- SCONJ 22: 2 Nowu inodu jamedu koiare ure imedu rorogodudo .
- ADP 20: Mare paga karega , tudu koiare ure boedo boe pawuje iji .
- ADV 20: Inagore ju boi koiare iro oino awogai .
- NOUN 3: 2 Adão koiare boe eiamedu boe ewi aregodure .
- bukeje
- Boe
- mato
Morphology
The form / lemma ratio of SCONJ
is 1.014599 (the average of all parts of speech is 1.360106).
The 1st highest number of forms (6) was observed with the lemma “_”: Sodoma, bakuguma, bore, dutabore, kodi, pobore.
The 2nd highest number of forms (4) was observed with the lemma “kodi”: kodi, kodie, kodire, koduie.
The 3rd highest number of forms (2) was observed with the lemma “boe”: Boe, Boere.
SCONJ
occurs with 4 features: Mood (225; 18% instances), Number (38; 3% instances), Person (38; 3% instances), Speech (27; 2% instances)
SCONJ
occurs with 4 feature-value pairs: Mood=Ind
, Number=Sing
, Person=1
, Speech=Ind
SCONJ
occurs with 4 feature combinations.
The most frequent feature combination is _
(973 tokens).
Examples: kodi, dukeje, dutabo, koiare, bukeje, Boe, mato, dukejere, dukoiare, kaere
Relations
SCONJ
nodes are attached to their parents using 8 different relations: mark (998; 79% instances), case (162; 13% instances), nsubj (56; 4% instances), obl (20; 2% instances), root (18; 1% instances), nmod (6; 0% instances), dep (2; 0% instances), obj (1; 0% instances)
Parents of SCONJ
nodes belong to 14 different parts of speech: VERB (902; 71% instances), NOUN (149; 12% instances), ADV (49; 4% instances), PRON (43; 3% instances), NUM (30; 2% instances), PROPN (26; 2% instances), (18; 1% instances), X (14; 1% instances), ADP (12; 1% instances), AUX (6; 0% instances), DET (5; 0% instances), INTJ (5; 0% instances), CCONJ (2; 0% instances), PART (2; 0% instances)
1208 (96%) SCONJ
nodes are leaves.
35 (3%) SCONJ
nodes have one child.
12 (1%) SCONJ
nodes have two children.
8 (1%) SCONJ
nodes have three or more children.
The highest child degree of a SCONJ
node is 5.
Children of SCONJ
nodes are attached using 11 different relations: punct (20; 22% instances), nsubj (15; 17% instances), nmod (10; 11% instances), obl (10; 11% instances), advmod (9; 10% instances), det (9; 10% instances), dep (8; 9% instances), case (3; 3% instances), obj (2; 2% instances), parataxis (2; 2% instances), cc (1; 1% instances)
Children of SCONJ
nodes belong to 12 different parts of speech: PUNCT (20; 22% instances), NOUN (19; 21% instances), PRON (12; 13% instances), ADV (11; 12% instances), DET (9; 10% instances), ADP (5; 6% instances), PROPN (5; 6% instances), NUM (4; 4% instances), AUX (1; 1% instances), CCONJ (1; 1% instances), VERB (1; 1% instances), X (1; 1% instances)