home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hausa-SouthernAutogramm: POS Tags: AUX

There are 1 AUX lemmas (0%), 108 AUX types (6%) and 2160 AUX tokens (15%). Out of 16 observed tags, the rank of AUX is: 16 in number of lemmas, 4 in number of types and 2 in number of tokens.

The 10 most frequent AUX lemmas: _

The 10 most frequent AUX types: ya, nèː, yaː, kaː, kà, ta, akà, à, an, sukà

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: ya (AUX 229, PRON 1), nèː (AUX 120, PART 77), (AUX 97, PRON 10), ta (AUX 82, PART 19, PRON 18, ADP 11), à (ADP 140, AUX 72), (AUX 58, ADP 1), ka (AUX 44, PRON 6, INTJ 1), (AUX 38, PRON 14), (AUX 30, PRON 24), (AUX 20, PRON 6)

Morphology

The form / lemma ratio of AUX is 108.000000 (the average of all parts of speech is 1.357040).

The 1st highest number of forms (108) was observed with the lemma “_”: akà, akàn, akè, akèː, am, an, anàː, baː’à, baː’àː, baːkà, baːkàː, baːmàː, baːnàː, baːsàː, baːsù, baːtà, baːyà, baːyàː, baː~, bài, bàkà, bàkì, bàmù, bàn, bàsù, bàtà, bàʼà, bâi, bân, cèː, inàː, ka, ka:, kakè, kakèː, kanàː, kaː, kikà, kikèː, kin, kinàː, kukà, kukàn, kukè, kukèː, kun, kunàː, kyâː, kà, kâː, kèː, kì, kù, mukà, mukàn, mukè, mukèː, mun, munàː, mwâː, mù, na, nakè, nakèː, naː, neː, nikè, nàː, nèː, shikè, shì, sukà, sukàn, sukè, sukèː, sun, sunàː, sù, ta, takàn, takè, takèː, tanàː, taː, tà, tâː, ya, yakàn, yakè, yakèː, yanàː, yaː, yà, zaː’à, zaːkà, zaːkì, zaːkù, zaːmù, zaːsù, zaːtà, zaː~, zaːʔà, zaːʼà, zâi, zân, à, âː, ìn.

AUX occurs with 6 features: Person (2016; 93% instances), Aspect (1753; 81% instances), Gender (1160; 54% instances), Number (765; 35% instances), Tense (174; 8% instances), Polarity (91; 4% instances)

AUX occurs with 19 feature-value pairs: Aspect=Aor, Aspect=Hab, Aspect=Perf, Aspect=PerfBkg, Aspect=PerfNeg, Aspect=Prog, Aspect=ProgBkg, Aspect=ProgNeg, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Person=4, Polarity=Neg, Tense=Fut, Tense=Pred

AUX occurs with 108 feature combinations. The most frequent feature combination is Aspect=PerfBkg|Gender=Masc|Person=3 (231 tokens). Examples: ya, bài

Relations

AUX nodes are attached to their parents using 13 different relations: aux (1763; 82% instances), cop (233; 11% instances), root (49; 2% instances), reparandum (38; 2% instances), advcl:cleft (16; 1% instances), ccomp (13; 1% instances), acl:relcl (11; 1% instances), advcl (11; 1% instances), conj (10; 0% instances), parataxis (10; 0% instances), discourse (3; 0% instances), dep (2; 0% instances), acl (1; 0% instances)

Parents of AUX nodes belong to 12 different parts of speech: VERB (1808; 84% instances), NOUN (140; 6% instances), PRON (91; 4% instances), (49; 2% instances), ADV (33; 2% instances), AUX (16; 1% instances), PROPN (9; 0% instances), X (7; 0% instances), ADJ (2; 0% instances), NUM (2; 0% instances), PART (2; 0% instances), ADP (1; 0% instances)

2013 (93%) AUX nodes are leaves.

50 (2%) AUX nodes have one child.

27 (1%) AUX nodes have two children.

70 (3%) AUX nodes have three or more children.

The highest child degree of a AUX node is 8.

Children of AUX nodes are attached using 21 different relations: punct (88; 24% instances), ccomp (49; 13% instances), advmod (36; 10% instances), discourse (33; 9% instances), advcl (31; 8% instances), mark (18; 5% instances), dislocated (17; 5% instances), iobj (15; 4% instances), nsubj (14; 4% instances), obj (14; 4% instances), obl (12; 3% instances), parataxis (12; 3% instances), conj (10; 3% instances), reparandum (10; 3% instances), dep (7; 2% instances), advcl:cleft (2; 1% instances), cc (2; 1% instances), acl:relcl (1; 0% instances), csubj (1; 0% instances), obl:arg (1; 0% instances), vocative (1; 0% instances)

Children of AUX nodes belong to 13 different parts of speech: VERB (103; 28% instances), PUNCT (88; 24% instances), NOUN (38; 10% instances), PRON (28; 7% instances), ADV (25; 7% instances), SCONJ (22; 6% instances), PART (20; 5% instances), AUX (16; 4% instances), INTJ (16; 4% instances), CCONJ (7; 2% instances), X (6; 2% instances), PROPN (4; 1% instances), NUM (1; 0% instances)