Statistics of X in UD_Zaar-Autogramm

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: `X`

There are 207 X lemmas (13%), 217 X types (8%) and 395 X tokens (2%). Out of 16 observed tags, the rank of X is: 3 in number of lemmas, 3 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: ʃèː, XX, x, nan, kura, shi, ba, a, tunda, ʧét

The 10 most frequent X types: ʃèː, XX, nan, shi, ba, kura, a, tunda, ʧét, kafin

The 10 most frequent ambiguous lemmas: ʃèː (X 24, PART 2), XX (X 14, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 6, SCONJ 1), bâː (PART 9, X 3), dâːmáː (X 3, ADV 1), hár (ADP 25, SCONJ 5, ADV 4, X 3), swǎːt (X 3, ADV 1), wéy (PART 29, X 3), yànzú (X 2, ADV 1), Lim (X 2, PROPN 1)

The 10 most frequent ambiguous types: ʃèː (AUX 42, X 23, PART 2), XX (X 15, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 6, SCONJ 1), ʧét (X 6, VERB 1), bâː (PART 9, X 3), dâːmáː (X 3, ADV 1), hár (ADP 24, SCONJ 5, ADV 4, X 3), wéy (PART 29, X 3), yànzú (X 3, ADV 1), Lim (X 2, PROPN 1)

ʃèː
- AUX 42: ʃèː myâːn ?//= yel =əm !//
- X 23: á wû tu [ ʃèː < ká !//= ʃèː kyàːní kə́ kə́ kə́ kə́ fî ŋǎːn ?//] //
- PART 2: ʃèː gín ɗangəní >+ ʃèː fi //
XX
- X 15: but XX &//
- ADV 2: XX &//
- DET 1: XX //
- INTJ 1: XX &//
- PART 1: XX &//
- VERB 1: XX &//
tunda
- X 6: tunda Zəgì àː *kap-íː gə̀t kàm < ma *kap-íː gə̀t //
- SCONJ 1: tunda káː ngúp wúlɣə̂n vìː tə́ yáːníː káwây < to shi ke nan //
ʧét
- X 6: tôː mə́ wûl tu < kusuŋ yáːwón ɲolí ʧét ʧét ʧét ʧét ʧét ʧét //!
- VERB 1: myáː nger mə nda =ni || myáː nger mə nda =ni káwêy < séː ə̀ː ɗan mə ɬə́ tulíː < séː mə ɬə́ tu èː gàri gón |c mə ʧét < [ wannan < wane &//] &//
bâː
- PART 9: tóː < gìːr =wàːsə̀ŋ gəní < á gàː =ʃí bâː ʒà hŋ́ oː //
- X 3: wéy á < éy yâːn tá fî =ni maːndə tə́ kúni =âtn < bâː dàːmuwa //
dâːmáː
- X 3: dâːmáː tə́ wû tu kápkə̂n gə̀t < séː kyáː &//
- ADV 1: âː dâːmáː gèntsə̀ tə̀tàyáː mbûɗíː áy !//
hár
- ADP 24: Tʃôkn yáː yâddéy kàm < hár wò mán ʃiː wò naː ɗàrí nandam //
- SCONJ 5: tôː ká rîːp //= ká *rîːp-íː =tə hár fi ɗan gyópti ʧǐː //
- ADV 4: hár ɗan &//
- X 3: tòː < tá fî =tə naː íri ɮə̀pmgə̀n gíː hár yànzú //
wéy
- PART 29: wéy á < éy yâːn tá fî =ni maːndə tə́ kúni =âtn < bâː dàːmuwa //
- X 3: tə́ ʒìɗ =mə̀ tu wéy gàː || wéy kú~ &//
yànzú
- X 3: tòː < tá fî =tə naː íri ɮə̀pmgə̀n gíː hár yànzú //
- ADV 1: tôː myàːní kúmá < myàːyí nat kâːr =wàːsə̀ŋ háy yànzú //
Lim
- X 2: wéy wannan Lim ne ?//
- PROPN 1: Lim || Lìːmƙása kenan ?//

Morphology

The form / lemma ratio of X is 1.048309 (the average of all parts of speech is 1.611418).

The 1st highest number of forms (8) was observed with the lemma “X”: X, ki, kira, wace, yírtə, ƙasa, ɣá~, ʧi.

The 2nd highest number of forms (2) was observed with the lemma “dàːmuwa”: dàːmuwa, dàːmuwá.

The 3rd highest number of forms (2) was observed with the lemma “kura”: kura, kurâs.

X occurs with 1 features: Foreign (263; 67% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (263 tokens). Examples: nan, shi, ba, kura, a, tunda, kafin, wannan, ɗaya, OK

Relations

X nodes are attached to their parents using 26 different relations: flat:foreign (89; 23% instances), obl (74; 19% instances), root (40; 10% instances), dep (26; 7% instances), discourse (24; 6% instances), obj (21; 5% instances), compound:redup (19; 5% instances), nmod (18; 5% instances), xcomp (12; 3% instances), fixed (10; 3% instances), parataxis (10; 3% instances), reparandum (9; 2% instances), dislocated (7; 2% instances), nsubj (7; 2% instances), advmod (4; 1% instances), conj (4; 1% instances), advcl (3; 1% instances), appos (3; 1% instances), obl:arg (3; 1% instances), vocative (3; 1% instances), cc (2; 1% instances), compound (2; 1% instances), flat (2; 1% instances), cc:preconj (1; 0% instances), flat:name (1; 0% instances), mark (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: VERB (152; 38% instances), X (147; 37% instances), (40; 10% instances), NOUN (17; 4% instances), PART (13; 3% instances), INTJ (7; 2% instances), AUX (5; 1% instances), PROPN (5; 1% instances), ADV (3; 1% instances), NUM (3; 1% instances), PRON (2; 1% instances), SCONJ (1; 0% instances)

214 (54%) X nodes are leaves.

83 (21%) X nodes have one child.

51 (13%) X nodes have two children.

47 (12%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 34 different relations: punct (95; 25% instances), flat:foreign (89; 23% instances), discourse (30; 8% instances), compound:redup (19; 5% instances), nmod (18; 5% instances), advmod (13; 3% instances), case (13; 3% instances), fixed (11; 3% instances), dep (10; 3% instances), ccomp (9; 2% instances), aux (8; 2% instances), acl (7; 2% instances), nmod:poss (6; 2% instances), obj (5; 1% instances), parataxis (5; 1% instances), reparandum (5; 1% instances), appos (4; 1% instances), conj (4; 1% instances), det (4; 1% instances), dislocated (3; 1% instances), obl:arg (3; 1% instances), acl:relcl (2; 1% instances), advcl (2; 1% instances), flat (2; 1% instances), mark (2; 1% instances), nsubj (2; 1% instances), obl (2; 1% instances), vocative (2; 1% instances), xcomp (2; 1% instances), amod (1; 0% instances), cc (1; 0% instances), compound (1; 0% instances), compound:prt (1; 0% instances), flat:name (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (147; 38% instances), PUNCT (95; 25% instances), PART (24; 6% instances), VERB (23; 6% instances), NOUN (22; 6% instances), INTJ (14; 4% instances), PRON (12; 3% instances), SCONJ (9; 2% instances), ADP (8; 2% instances), ADV (8; 2% instances), AUX (8; 2% instances), DET (5; 1% instances), PROPN (5; 1% instances), ADJ (1; 0% instances), NUM (1; 0% instances)

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: X

Morphology

Relations

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: `X`