Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: POS Tags: ADJ
There are 667 ADJ
lemmas (11%), 807 ADJ
types (10%) and 3522 ADJ
tokens (4%).
Out of 17 observed tags, the rank of ADJ
is: 3 in number of lemmas, 3 in number of types and 10 in number of tokens.
The 10 most frequent ADJ
lemmas: eile, mòr, math, bi, sam, beag, ùr, mór, seann, a
The 10 most frequent ADJ
types: eile, bith, sam, ùr, math, beag, mhòr, mòr, a, cinnteach
The 10 most frequent ambiguous lemmas: eile (ADJ 242, NOUN 3), mòr (ADJ 152, ADV 9), math (ADJ 146, ADV 135, NOUN 7), bi (VERB 5158, NOUN 285, ADJ 107, ADV 4), beag (ADJ 105, NOUN 4), ùr (ADJ 102, ADV 2), mór (ADJ 66, ADV 12), a (PART 3253, DET 592, PRON 429, ADP 279, ADV 140, ADJ 51, SCONJ 7, X 6, INTJ 4, PROPN 3, CCONJ 2), fada (ADJ 50, ADV 28), cinnteach (ADJ 44, ADV 5)
The 10 most frequent ambiguous types: eile (ADJ 239, NOUN 3), bith (ADJ 106, NOUN 3, PRON 1, SCONJ 1), ùr (ADJ 84, ADV 2), math (ADV 124, ADJ 73, NOUN 5), beag (ADJ 50, NOUN 1), mòr (ADJ 54, ADV 9), a (PART 3247, DET 597, PRON 429, ADP 262, ADV 139, ADJ 51, SCONJ 7, X 6, CCONJ 2, PROPN 2, INTJ 1), cinnteach (ADJ 43, ADV 5), thall (ADV 39, ADJ 34), fhearr (ADJ 32, NOUN 11)
- eile
- bith
- ADJ 106: a bheil an cnatan air duine sam bith eile thall an sin a [Name] ?
- NOUN 3: Thug cuideigin gàire air mi nuair a thuirt iad gun deigheadh a’ Ghàidhlig cha mhór á bith nan tuiteadh inneal-spreadhaidh air an togalach .
- PRON 1: Thighearna , nuair a chuala Fionn seo , cha robh aig a’ chlaidheamh ri fuighleach beum fhàgail gar bith có air e a bhuailt e .
- SCONJ 1: thathas ag ràdh gur e born-again-Christian a tha an e gu bheil e air cùlaibh sin a bhith a’ dìonaich Iùdhaich aig cosgais sam bith agus a bhith a’ coimhead air na h-Arabaich mar gum b’ e iadsan ciontach as bith dè nì iad
- ùr
- math
- beag
- mòr
- a
- PART 3247: agus ciamar a bha a’ homework an do choimhead an tidsear ri e ?
- DET 597: ‘s an robh a h-uile duine eile air na trì duilleagan a dhèanamh ?
- PRON 429: Rinn e - fhèin gàire , an a shuidhe air a’ chloich .
- ADP 262: Chan eil a leithid a fhacal idir aig iad .
- ADV 139: tha e air a bhith ann a shin bho chionn bhliadhnachan
- ADJ 51: Freadaidh Dhòmhnaill Bhàin ann an Uibhist a Tuath
- SCONJ 7: Cha robh cùmhnadh aig iadsan air airgead , is a chionn ‘s nach robh an tìde ach goirid chaidh iad chun a h-uile dibhearsain a bha Lunnainn dheàlrach a’ tathann .
- X 6: a Mhurchaidh is e prògram ùr tha seo a’ tòiseachadh air an telly Give a Pet a Home
- CCONJ 2: ’s e [?] fhios aig thu [?] a’ rathaid mhòir a thoireadh cha robh tearradh an rathad an e agus dh’fhaodadh sibh a bhith a’ bristeadh chlachan mar a bhios iad ann am Porterfields ‘s dòcha fhathast agus dh’fhaodadh tu bhith ag obair le ag obair air stamh
- PROPN 2: Chaidh a’ chuid mhath de an chiad là a chur seachad a’ siubhal air itealain eadar Steòrnabhagh is Glaschu is eadar Glaschu is Baile a Cliath .
- INTJ 1: a ars’ ise ‘s tusa a mharbh mo thrì bhràithrean-sa an-dè ars’ ise ach marbhaidh mis’ thus’ an-diugh agus a-nuas gun do ghabh i a h-uile sùrdag a bheireadh i
- cinnteach
- thall
- fhearr
Morphology
The form / lemma ratio of ADJ
is 1.209895 (the average of all parts of speech is 1.311377).
The 1st highest number of forms (9) was observed with the lemma “mòr”: mhotha, mhuth’, mhò, mhòir, mhòr, mhòr’, mòr, mòr’, mòra.
The 2nd highest number of forms (6) was observed with the lemma “beag”: beag, beaga, bheag, bheaga, bhig, lugha.
The 3rd highest number of forms (6) was observed with the lemma “math”: fhearr, fheàirrde, fheàrr, fhèarr, math, mhath.
ADJ
occurs with 8 features: Number (1463; 42% instances), Case (1445; 41% instances), Gender (1445; 41% instances), Degree (224; 6% instances), ExtPos (220; 6% instances), Foreign (36; 1% instances), CleftType (11; 0% instances), Typo (9; 0% instances)
ADJ
occurs with 14 feature-value pairs: Case=Dat
, Case=Gen
, Case=Nom
, Case=Voc
, CleftType=Adj
, Degree=Cmp,Sup
, ExtPos=ADJ
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Number=Dual
, Number=Plur
, Number=Sing
, Typo=Yes
ADJ
occurs with 33 feature combinations.
The most frequent feature combination is _
(1578 tokens).
Examples: bith, cinnteach, math, faisg, seann, thall, droch, tuath, coltach, dheireadh
Relations
ADJ
nodes are attached to their parents using 21 different relations: amod (2146; 61% instances), xcomp:pred (833; 24% instances), fixed (222; 6% instances), conj (112; 3% instances), advmod (63; 2% instances), root (45; 1% instances), advcl (25; 1% instances), csubj:cop (14; 0% instances), ccomp (13; 0% instances), xcomp (11; 0% instances), compound (9; 0% instances), csubj:cleft (6; 0% instances), parataxis (6; 0% instances), acl:relcl (5; 0% instances), dislocated (4; 0% instances), acl (2; 0% instances), obj (2; 0% instances), appos (1; 0% instances), discourse (1; 0% instances), flat (1; 0% instances), obl (1; 0% instances)
Parents of ADJ
nodes belong to 14 different parts of speech: NOUN (2147; 61% instances), VERB (785; 22% instances), ADJ (318; 9% instances), PROPN (154; 4% instances), (45; 1% instances), PRON (41; 1% instances), NUM (10; 0% instances), ADV (8; 0% instances), X (5; 0% instances), ADP (4; 0% instances), PART (2; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)
2257 (64%) ADJ
nodes are leaves.
796 (23%) ADJ
nodes have one child.
292 (8%) ADJ
nodes have two children.
177 (5%) ADJ
nodes have three or more children.
The highest child degree of a ADJ
node is 7.
Children of ADJ
nodes are attached using 31 different relations: advmod (381; 19% instances), obl (349; 17% instances), fixed (219; 11% instances), mark:prt (155; 8% instances), cop (153; 8% instances), conj (143; 7% instances), punct (125; 6% instances), xcomp (81; 4% instances), ccomp (76; 4% instances), cc (73; 4% instances), nsubj (34; 2% instances), advcl:relcl (31; 2% instances), csubj:cop (30; 1% instances), advcl (25; 1% instances), obl:unmarked (20; 1% instances), case (14; 1% instances), mark (14; 1% instances), xcomp:pred (13; 1% instances), discourse (12; 1% instances), parataxis (12; 1% instances), csubj:cleft (11; 1% instances), compound (9; 0% instances), amod (6; 0% instances), flat (6; 0% instances), det (3; 0% instances), obj (3; 0% instances), vocative (3; 0% instances), dep (2; 0% instances), nmod:unmarked (2; 0% instances), nummod (1; 0% instances), reparandum (1; 0% instances)
Children of ADJ
nodes belong to 16 different parts of speech: NOUN (400; 20% instances), ADV (376; 19% instances), ADJ (318; 16% instances), VERB (180; 9% instances), PART (166; 8% instances), AUX (153; 8% instances), PRON (129; 6% instances), PUNCT (125; 6% instances), CCONJ (73; 4% instances), PROPN (37; 2% instances), ADP (15; 1% instances), SCONJ (13; 1% instances), INTJ (11; 1% instances), NUM (4; 0% instances), X (4; 0% instances), DET (3; 0% instances)