Treebank Statistics: UD_Welsh-CCG: POS Tags: AUX
There are 7 AUX
lemmas (0%), 62 AUX
types (1%) and 3046 AUX
tokens (6%).
Out of 15 observed tags, the rank of AUX
is: 13 in number of lemmas, 9 in number of types and 7 in number of tokens.
The 10 most frequent AUX
lemmas: yn, bod, wedi, ar, am, newydd, heb
The 10 most frequent AUX
types: yn, ‘n, wedi, mae, yw, oedd, bod, fod, oes, ar
The 10 most frequent ambiguous lemmas: yn (AUX 1367, PART 1106, ADP 1046), bod (VERB 1782, AUX 1087, NOUN 308), wedi (AUX 482, ADP 6, SCONJ 5), ar (ADP 747, AUX 39), am (ADP 331, AUX 27), newydd (ADJ 82, AUX 26), heb (ADP 33, AUX 18)
The 10 most frequent ambiguous types: yn (AUX 882, PART 743, ADP 732), ‘n (AUX 478, PART 322, PRON 31, ADP 11), wedi (AUX 465, SCONJ 5, ADP 2), mae (VERB 257, AUX 91, NOUN 2), yw (AUX 176, VERB 39), oedd (AUX 147, VERB 134), bod (NOUN 195, AUX 91), fod (NOUN 96, AUX 76), oes (AUX 42, VERB 21, NOUN 7), ar (ADP 708, AUX 39)
- yn
- ‘n
- wedi
- mae
- VERB 257: Efrog Newydd yw ‘r ddinas mae pobl yn meddwl gyntaf am hi .
- AUX 91: Dyma bump o raeadrau Eryri mae ‘n rhaid i chi eu gweld .
- NOUN 2: Bu Dafydd ap Gwilym yn byw yn y pentref ac mae yna gerdd am e ‘i hun yn eglwys Llanbadarn , yn canolbwyntio ar y merched yn y gynulleidfa yn hytrach na ‘r gwasanaeth .
- yw
- oedd
- bod
- fod
- oes
- ar
Morphology
The form / lemma ratio of AUX
is 8.857143 (the average of all parts of speech is 1.452021).
The 1st highest number of forms (53) was observed with the lemma “bod”: Buodd, Byddaf, Byddan, Byddech, Byddi, Bûm, Maen, Oeddet, Ydach, baech, baent, baswn, bawn, bod, bu, buoch, bydd, bydda, byddai, byddant, byddwch, bysa, dw, dwi, fo, fod, fu, fydd, fyddai, ma’, mae, mod, oedd, oeddech, oeddem, oeddwn, oes, s’, sy, sydd, wy, wyf, wyt, ydi, ydoedd, ydw, ydy, ydych, ydym, ydyn, ydynt, ydyw, yw.
The 2nd highest number of forms (3) was observed with the lemma “wedi”: ‘di, di, wedi.
The 3rd highest number of forms (2) was observed with the lemma “yn”: ‘n, yn.
AUX
occurs with 6 features: Number (1087; 36% instances), VerbForm (1087; 36% instances), Person (917; 30% instances), Tense (916; 30% instances), Mood (909; 30% instances), Mutation (125; 4% instances)
AUX
occurs with 18 feature-value pairs: Mood=Cnd
, Mood=Ind
, Mood=Sub
, Mutation=NM
, Mutation=SM
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Fut
, Tense=Imp
, Tense=Past
, Tense=Pqp
, Tense=Pres
, VerbForm=Fin
, VerbForm=FinRel
, VerbForm=Vnoun
AUX
occurs with 36 feature combinations.
The most frequent feature combination is _
(1959 tokens).
Examples: yn, ‘n, wedi, ar, am, newydd, heb, ‘di, di
Relations
AUX
nodes are attached to their parents using 10 different relations: aux (1966; 65% instances), cop (996; 33% instances), root (42; 1% instances), conj (18; 1% instances), acl:relcl (10; 0% instances), ccomp (6; 0% instances), advcl (4; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), appos (1; 0% instances)
Parents of AUX
nodes belong to 10 different parts of speech: NOUN (2507; 82% instances), ADJ (441; 14% instances), (42; 1% instances), NUM (21; 1% instances), VERB (17; 1% instances), PROPN (7; 0% instances), PRON (6; 0% instances), ADV (3; 0% instances), AUX (1; 0% instances), SYM (1; 0% instances)
2956 (97%) AUX
nodes are leaves.
11 (0%) AUX
nodes have one child.
17 (1%) AUX
nodes have two children.
62 (2%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 8.
Children of AUX
nodes are attached using 14 different relations: punct (75; 25% instances), nsubj (59; 20% instances), advmod (54; 18% instances), xcomp (48; 16% instances), cc (16; 5% instances), obl (16; 5% instances), mark (8; 3% instances), advcl (6; 2% instances), fixed (6; 2% instances), conj (3; 1% instances), appos (1; 0% instances), case (1; 0% instances), csubj (1; 0% instances), det (1; 0% instances)
Children of AUX
nodes belong to 14 different parts of speech: NOUN (104; 35% instances), PUNCT (75; 25% instances), PART (32; 11% instances), PRON (22; 7% instances), ADV (19; 6% instances), CCONJ (16; 5% instances), PROPN (9; 3% instances), SCONJ (7; 2% instances), ADJ (3; 1% instances), VERB (3; 1% instances), ADP (2; 1% instances), AUX (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)