Statistics of PART in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: `PART`

There are 33 PART lemmas (2%), 93 PART types (3%) and 1935 PART tokens (9%). Out of 16 observed tags, the rank of PART is: 12 in number of lemmas, 8 in number of types and 5 in number of tokens.

The 10 most frequent PART lemmas: tòː, hŋ́, ɗi, kən, aː, bàː, oː, ni, kúmá, mə́n

The 10 most frequent PART types: tôː, hŋ́, ɗi, bàː, aː, tòː, oː, ni, kúmá, máː

The 10 most frequent ambiguous lemmas: tòː (PART 443, INTJ 17, CCONJ 1), aː (PART 156, INTJ 3), ni (PART 94, X 1), kúmá (PART 67, CCONJ 6, X 1), mə́n (PART 64, NOUN 44), máː (PART 62, ADV 28, X 1), wéy (PART 33, SCONJ 2), eː (PART 22, INTJ 5), ɗa (ADP 56, ADV 36, SCONJ 18, PART 14, AUX 7), kóː (CCONJ 92, PART 13, SCONJ 12, DET 5, ADV 3, ADP 1, X 1)

The 10 most frequent ambiguous types: tôː (PART 327, INTJ 12, CCONJ 1), aː (PART 122, INTJ 2), tòː (PART 113, INTJ 5), ni (PART 85, X 1), kúmá (PART 66, CCONJ 6), máː (PART 62, AUX 30, ADV 28, X 1), mə́n (PART 57, NOUN 55), wéy (PART 33, SCONJ 2), ŋaː (ADJ 38, PART 24, NOUN 9, ADV 2), eː (PART 21, INTJ 4)

tôː
- PART 327: mə́ nat ŋamtsə́ ɗi < tôː mə́ máni mə́ mán tsə́tn ni //
- INTJ 12: tôː ɣən //
- CCONJ 1: m̀ː < wò tuːr náɣɗêʃíː //= tôː < ngaː vər mí |c tə́ gyaː bàːbá //= tôː mə́ ʧî //
aː
- PART 122: tá má gàʤí hŋ́ aː ?//
- INTJ 2: tôː aː < má fi tə wúr ɣəní eː ?//
tòː
- PART 113: kóː má ɬə́ hâɗá tə́ Dàːdámmu á Sáːní ɗáni tòː //
- INTJ 5: tòː //
ni
- PART 85: éy < kyáː mân < ká mán fí ni maːndə tə́ kúɲêtn //
- X 1: á wû tu [ ba zan hau ba !//= ba zan hau ba !//= ni za ku gwada min boko ?//= ba zan hau ba !//] //
kúmá
- PART 66: myáːni kúmá < kâːy !//
- CCONJ 6: mə́ wúl tə tu mi tə́ túrâtn kúmá mi tə́ kúɲêtn //
máː
- PART 62: sòːséy máː !//
- AUX 30: máː yí wum éy ɗa áy yǎː wulíː //
- ADV 28: sòːséy máː !//
- X 1: myàː yí fi tə gistə́ |a deːdéː á wátan || dǎː máː tôː wátan tára kóː góːma //
mə́n
- PART 57: má ngêláŋ //= má yê ʃí kàːsuwa Kímsə́y mə́n kóː yi wuriː ?//
- NOUN 55: ngə́tn tə́ mə́n dwaːndə //
wéy
- PART 33: wéy Á~ < éy yâːn tá fî ni maːndə tə́ kúɲêtn < bâː dàːmuwa //
- SCONJ 2: bàː wéy ʧáː lə́ːr ni kúniwòs ɗi tə̀ màn yèl tə̀ hŋ́ |c àmmáː ʧáː mân nə́ níː ?//
ŋaː
- ADJ 38: Féːlêks ( kyâːn máː káː rigá kə yisə́ŋ tə́y //) gíː < ŋaː laː ɓas tə //
- PART 24: ɗan kyáː ɓwaː ŋáː < kə yél ɮǐːwâː máː àː kə́ːʃíː ŋaː ?//
- NOUN 9: yâːn nə ŋaː gə̀t < wò som gə nə́ wút bàɬkə̀nì //
- ADV 2: séː á wû tu [ tôː < fǐn ɗi ŋaː lap-láp !//] //
eː
- PART 21: tôː aː < má fi tə wúr ɣəní eː ?//
- INTJ 4: tôː < eː séː lː~ &//

Morphology

The form / lemma ratio of PART is 2.818182 (the average of all parts of speech is 1.692524).

The 1st highest number of forms (20) was observed with the lemma “kən”: gən, gəndí, gəní, gənín, gə̂n, kən, kəndá, kəndí, kəní, kənín, n, ŋ, əŋ, ɣən, ɣəndá, ɣəndí, ɣəní, ɣənín, ɣəŋ, ɣə̂n.

The 2nd highest number of forms (15) was observed with the lemma “hŋ́”: hŋ́, hŋ́ə́y, hḿ, n, ń, ň, ŋ, ŋə́y, ŋ́, ŋ̌, ǐn, ə́ŋ, ə̌n, ə̌ŋ, ə̌ːníː.

The 3rd highest number of forms (7) was observed with the lemma “aː”: aː, yaː, àː, âː, ŋaː, ŋâː, ŋǎː.

PART occurs with 5 features: PartType (1932; 100% instances), Polarity (460; 24% instances), Evident (33; 2% instances), Deixis (23; 1% instances), Foreign (14; 1% instances)

PART occurs with 13 feature-value pairs: Deixis=Prox, Deixis=Remt, Evident=Nfh, Foreign=Yes, PartType=Adv, PartType=Case, PartType=Disc, PartType=Foc, PartType=Ill, PartType=Neg, PartType=Pred, PartType=Top, Polarity=Neg

PART occurs with 21 feature combinations. The most frequent feature combination is PartType=Disc (513 tokens). Examples: tôː, tòː, kúmá, wéy, kóː, wàːtòː, kàm, fáː, fâː, koː

Relations

PART nodes are attached to their parents using 9 different relations: discourse (848; 44% instances), advmod (703; 36% instances), compound:prt (355; 18% instances), root (14; 1% instances), ccomp (5; 0% instances), fixed (3; 0% instances), parataxis (3; 0% instances), reparandum (3; 0% instances), conj (1; 0% instances)

Parents of PART nodes belong to 16 different parts of speech: VERB (1420; 73% instances), NOUN (219; 11% instances), PRON (107; 6% instances), ADV (54; 3% instances), AUX (28; 1% instances), PROPN (24; 1% instances), INTJ (20; 1% instances), PART (16; 1% instances), (14; 1% instances), X (10; 1% instances), ADP (8; 0% instances), NUM (5; 0% instances), ADJ (3; 0% instances), DET (3; 0% instances), SCONJ (3; 0% instances), CCONJ (1; 0% instances)

1806 (93%) PART nodes are leaves.

109 (6%) PART nodes have one child.

10 (1%) PART nodes have two children.

10 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 6.

Children of PART nodes are attached using 10 different relations: punct (120; 68% instances), discourse (14; 8% instances), nsubj (11; 6% instances), advcl (9; 5% instances), advmod (7; 4% instances), mark (6; 3% instances), reparandum (3; 2% instances), xcomp (3; 2% instances), dislocated (2; 1% instances), obl (1; 1% instances)

Children of PART nodes belong to 10 different parts of speech: PUNCT (120; 68% instances), PART (16; 9% instances), NOUN (11; 6% instances), VERB (8; 5% instances), SCONJ (6; 3% instances), INTJ (5; 3% instances), PRON (4; 2% instances), PROPN (4; 2% instances), ADV (1; 1% instances), CCONJ (1; 1% instances)

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: PART

Morphology

Relations

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: `PART`