home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: PART

There are 44 PART lemmas (5%), 85 PART types (6%) and 836 PART tokens (11%). Out of 16 observed tags, the rank of PART is: 8 in number of lemmas, 8 in number of types and 4 in number of tokens.

The 10 most frequent PART lemmas: tòː, hŋ́, bàː, ɗi, máː, oː, ni, aː, kən, nə

The 10 most frequent PART types: tòː, bàː, máː, tôː, oː, ɗi, hŋ́, nə, ni, n

The 10 most frequent ambiguous lemmas: máː (PART 54, ADV 1), ni (PART 39, X 1), (PART 37, INTJ 1), (PART 35, ADP 1), mə́n (PART 31, NOUN 21), kúmá (PART 29, CCONJ 3), wéy (PART 29, X 3), (PART 15, ADV 9, SCONJ 1), ŋaː (ADJ 15, PART 14, NOUN 7), ɗa (ADP 24, PART 11, SCONJ 4, X 2, ADV 1)

The 10 most frequent ambiguous types: máː (PART 54, AUX 14, ADV 1), (PART 36, ADP 1), ni (PART 35, X 1), n (PART 32, ADP 1), mə́n (PART 29, NOUN 21), wéy (PART 29, X 3), (PART 28, INTJ 1), kúmá (PART 28, CCONJ 3), (AUX 21, PART 15, ADV 9, SCONJ 1, VERB 1), ɗa (ADP 20, PART 9, SCONJ 9, ADV 3, X 2)

Morphology

The form / lemma ratio of PART is 1.931818 (the average of all parts of speech is 1.640000).

The 1st highest number of forms (14) was observed with the lemma “kən”: gən, gəndí, gə̂n, kən, kəndí, n, ngən, ən, ɣən, ɣəndá, ɣəndí, ɣəní, ɣə́n, ɣə̂n.

The 2nd highest number of forms (11) was observed with the lemma “hŋ́”: hŋ́, hḿ, n, ń, ŋ, ŋ́, ǎn, ǐn, ə̌n, ə̌níː, ə̌ŋ.

The 3rd highest number of forms (5) was observed with the lemma “aː”: aː, àː, âː, ǎ, ǎː.

PART occurs with 7 features: Polarity (176; 21% instances), PartType (132; 16% instances), VerbType (93; 11% instances), Aspect (42; 5% instances), Mood (36; 4% instances), Deixis (8; 1% instances), PronType (1; 0% instances)

PART occurs with 10 feature-value pairs: Aspect=Inch, Aspect=Res, Deixis=Prox, Mood=Irr, Mood=Qot, PartType=Emp, PartType=Int, Polarity=Neg, PronType=Int, VerbType=Cop

PART occurs with 15 feature combinations. The most frequent feature combination is _ (363 tokens). Examples: tòː, máː, tôː, ɗi, mə́n, kúmá, tá, ɗì, kàm, fáː

Relations

PART nodes are attached to their parents using 16 different relations: advmod (342; 41% instances), discourse (262; 31% instances), compound:prt (131; 16% instances), root (51; 6% instances), dep (7; 1% instances), case (6; 1% instances), ccomp (6; 1% instances), fixed (6; 1% instances), parataxis (6; 1% instances), advcl (5; 1% instances), reparandum (5; 1% instances), mark (4; 0% instances), acl:relcl (2; 0% instances), acl (1; 0% instances), compound (1; 0% instances), flat:foreign (1; 0% instances)

Parents of PART nodes belong to 13 different parts of speech: VERB (550; 66% instances), (51; 6% instances), NOUN (48; 6% instances), PART (47; 6% instances), AUX (39; 5% instances), PRON (33; 4% instances), ADV (21; 3% instances), X (18; 2% instances), INTJ (9; 1% instances), PROPN (8; 1% instances), SCONJ (7; 1% instances), ADJ (4; 0% instances), NUM (1; 0% instances)

706 (84%) PART nodes are leaves.

68 (8%) PART nodes have one child.

14 (2%) PART nodes have two children.

48 (6%) PART nodes have three or more children.

The highest child degree of a PART node is 9.

Children of PART nodes are attached using 15 different relations: punct (121; 40% instances), xcomp (36; 12% instances), advmod (31; 10% instances), nsubj (31; 10% instances), discourse (22; 7% instances), advcl (16; 5% instances), mark (14; 5% instances), dislocated (13; 4% instances), parataxis (7; 2% instances), reparandum (4; 1% instances), dep (2; 1% instances), obl (2; 1% instances), conj (1; 0% instances), csubj (1; 0% instances), vocative (1; 0% instances)

Children of PART nodes belong to 14 different parts of speech: PUNCT (121; 40% instances), PART (47; 16% instances), NOUN (42; 14% instances), VERB (24; 8% instances), PRON (21; 7% instances), SCONJ (13; 4% instances), PROPN (8; 3% instances), X (8; 3% instances), ADV (6; 2% instances), INTJ (5; 2% instances), ADJ (3; 1% instances), CCONJ (2; 1% instances), ADP (1; 0% instances), AUX (1; 0% instances)