Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: POS Tags: SCONJ
There are 56 SCONJ
lemmas (1%), 61 SCONJ
types (1%) and 1064 SCONJ
tokens (1%).
Out of 17 observed tags, the rank of SCONJ
is: 10 in number of lemmas, 10 in number of types and 14 in number of tokens.
The 10 most frequent SCONJ
lemmas: mar, nuair, ma, is, ged, far, agus, gus, mur, mus
The 10 most frequent SCONJ
types: mar, nuair, ma, ged, far, ‘s, agus, gus, mur, mus
The 10 most frequent ambiguous lemmas: mar (SCONJ 197, ADP 167, NOUN 11, PRON 5, ADV 4, VERB 1, X 1), ma (SCONJ 113, ADP 15, ADV 10, ADJ 2), is (AUX 1149, CCONJ 1031, SCONJ 77, PART 1), far (SCONJ 62, ADP 14, NOUN 1), agus (CCONJ 1532, SCONJ 47), gus (SCONJ 46, ADP 24), uair (NOUN 60, ADV 29, SCONJ 20), an (DET 5347, ADP 2929, ADV 311, PART 219, PRON 94, ADJ 27, SCONJ 18, INTJ 2, NOUN 1, X 1), mun (ADP 20, SCONJ 18), bho (ADP 279, SCONJ 15, NOUN 6)
The 10 most frequent ambiguous types: mar (SCONJ 186, ADP 149, NOUN 11, PRON 5, ADV 4, X 1), ma (SCONJ 98, ADP 15, ADV 10, ADJ 2), far (SCONJ 62, ADP 14, NOUN 1), ’s (CCONJ 655, AUX 235, SCONJ 53, ADP 5, PART 1), agus (CCONJ 1463, SCONJ 47), gus (SCONJ 46, ADP 23), is (CCONJ 255, AUX 236, SCONJ 22), uair (NOUN 50, ADV 29, SCONJ 20), mun (ADP 20, SCONJ 18), an (DET 2298, ADP 1699, ADV 293, PART 212, PRON 94, AUX 37, ADJ 27, SCONJ 15, NOUN 1, X 1)
- mar
- SCONJ 186: uill ‘s iongantach mar a chuala
- ADP 149: Chuala e i a’ snotairich mar each aig ceann sgrìob ghoirid .
- NOUN 11: och uill faodaidh tusa air mar a bhios tu ‘cluinntinn a’s a’ mhedia
- PRON 5: dè mar a tha iad còmhla ri iadsan anns an taigh ‘s dè dè na
- ADV 4: B’ iad daoine cho càirdeil ’s cho làn de dheagh bheus ’s a dh’iarradh tu , ’s bha bailtean beaga eile air feadh na Gaidhealtachd ’s nan Eilean aig an àm ud mar an ceudna .
- X 1: ò ma dh’fhaoidte gum bi i ma dh’fhaoidte nach bi i aig sibh ach ‘son dhà no trì bhliadhnaichean ach mar mar as trice bidh iad aig sibh fichead bliadhna suas gu fichead bliadhna
- ma
- SCONJ 98: Ach ma dh’fhaoidte thèid d’ fhuadachadh air falbh às a-seo buileach “ .
- ADP 15: thà dìreach ma chòig seachdainn a [Name]
- ADV 10: Tha dùil gum mol e gearradh de ma naoi deug not anns an bhliadhna thar nan còig bliadhna ro sinn .
- ADJ 2: lìnigeadh an taighe loisg àsan loisg iad a h-uile sgath de lìnigeadh an taighe ma dheireadh
- far
- ’s
- CCONJ 655: ’s dè fhuair thu le e ?
- AUX 235: ò ’s an e
- SCONJ 53: tha dà bhàta aig e ag iasgach ’s a h-uile duine eile a’ feitheamh
- ADP 5: ò uill mar a bha an t-àite ’s a robh e ro e nuair a bha e ann an [Placename]
- PART 1: Mar a thuirt Samaidh , “ Is sinne ’s fhaide a mhaireas a dh’aindeoin deacaireachd . “
- agus
- gus
- is
- uair
- mun
- an
- DET 2298: agus ciamar a bha a’ homework an do choimhead an tidsear ri e ?
- ADP 1699: tha thusa an do ghille mhòr a-neist a [Name]
- ADV 293: a bheil an cnatan air duine sam bith eile thall an sin a [Name] ?
- PART 212: turkey burger an robh e math ?
- PRON 94: chan eil duine ag an draibheadh a-nisd
- AUX 37: an toil le thu mince pies ?
- ADJ 27: bha iad dìreach a’ tighinn a-staigh à taobh an iar
- SCONJ 15: Bha ‘n t-itealan air a slighe eadar Casablanca agus Tunis an uair a chaidh a gabhail thairis .
- NOUN 1: B’ an e an Cataibh ‘s air taobh an ear shiorrachdan Rois is Inbhir Nis a bu làidire a ghreimich an creideamh soisgeulach anns an ochdamh linn deug .
- X 1: Ann an sgrìobhaidhean Èireannach tha i air a h-ainmeachadh mar Eachtra an Cheatharnaigh Chaoilriabhaigh no Eachtra Cheatharnaigh Uí Dhomhnaill
Morphology
The form / lemma ratio of SCONJ
is 1.089286 (the average of all parts of speech is 1.311377).
The 1st highest number of forms (3) was observed with the lemma “is”: ’s, is, ’s.
The 2nd highest number of forms (2) was observed with the lemma “’ar”: ‘ar, ‘ar.
The 3rd highest number of forms (2) was observed with the lemma “an”: ‘n, an.
SCONJ
occurs with 2 features: ExtPos (60; 6% instances), Typo (5; 0% instances)
SCONJ
occurs with 3 feature-value pairs: ExtPos=ADV
, ExtPos=SCONJ
, Typo=Yes
SCONJ
occurs with 4 feature combinations.
The most frequent feature combination is _
(999 tokens).
Examples: mar, nuair, ma, ged, far, ‘s, agus, gus, mur, mus
Relations
SCONJ
nodes are attached to their parents using 6 different relations: mark (1019; 96% instances), fixed (41; 4% instances), advcl (1; 0% instances), advmod (1; 0% instances), case (1; 0% instances), reparandum (1; 0% instances)
Parents of SCONJ
nodes belong to 10 different parts of speech: VERB (811; 76% instances), NOUN (104; 10% instances), PRON (76; 7% instances), SCONJ (42; 4% instances), ADJ (13; 1% instances), PROPN (12; 1% instances), ADV (2; 0% instances), PART (2; 0% instances), NUM (1; 0% instances), X (1; 0% instances)
1001 (94%) SCONJ
nodes are leaves.
49 (5%) SCONJ
nodes have one child.
14 (1%) SCONJ
nodes have two children.
The highest child degree of a SCONJ
node is 2.
Children of SCONJ
nodes are attached using 3 different relations: fixed (73; 95% instances), punct (3; 4% instances), reparandum (1; 1% instances)
Children of SCONJ
nodes belong to 6 different parts of speech: SCONJ (42; 55% instances), CCONJ (28; 36% instances), PUNCT (3; 4% instances), PRON (2; 3% instances), ADJ (1; 1% instances), AUX (1; 1% instances)