Treebank Statistics: UD_Zaar-Autogramm: POS Tags: PART
There are 33 PART lemmas (2%), 93 PART types (3%) and 1935 PART tokens (9%).
Out of 16 observed tags, the rank of PART is: 12 in number of lemmas, 8 in number of types and 5 in number of tokens.
The 10 most frequent PART lemmas: tòː, hŋ́, ɗi, kən, aː, bàː, oː, ni, kúmá, mə́n
The 10 most frequent PART types: tôː, hŋ́, ɗi, bàː, aː, tòː, oː, ni, kúmá, máː
The 10 most frequent ambiguous lemmas: tòː (PART 443, INTJ 17, CCONJ 1), aː (PART 156, INTJ 3), ni (PART 94, X 1), kúmá (PART 67, CCONJ 6, X 1), mə́n (PART 64, NOUN 44), máː (PART 62, ADV 28, X 1), wéy (PART 33, SCONJ 2), eː (PART 22, INTJ 5), ɗa (ADP 56, ADV 36, SCONJ 18, PART 14, AUX 7), kóː (CCONJ 92, PART 13, SCONJ 12, DET 5, ADV 3, ADP 1, X 1)
The 10 most frequent ambiguous types: tôː (PART 327, INTJ 12, CCONJ 1), aː (PART 122, INTJ 2), tòː (PART 113, INTJ 5), ni (PART 85, X 1), kúmá (PART 66, CCONJ 6), máː (PART 62, AUX 30, ADV 28, X 1), mə́n (PART 57, NOUN 55), wéy (PART 33, SCONJ 2), ŋaː (ADJ 38, PART 24, NOUN 9, ADV 2), eː (PART 21, INTJ 4)
- tôː
- aː
- tòː
- ni
- kúmá
- máː
- mə́n
- wéy
- ŋaː
- eː
Morphology
The form / lemma ratio of PART is 2.818182 (the average of all parts of speech is 1.692524).
The 1st highest number of forms (20) was observed with the lemma “kən”: gən, gəndí, gəní, gənín, gə̂n, kən, kəndá, kəndí, kəní, kənín, n, ŋ, əŋ, ɣən, ɣəndá, ɣəndí, ɣəní, ɣənín, ɣəŋ, ɣə̂n.
The 2nd highest number of forms (15) was observed with the lemma “hŋ́”: hŋ́, hŋ́ə́y, hḿ, n, ń, ň, ŋ, ŋə́y, ŋ́, ŋ̌, ǐn, ə́ŋ, ə̌n, ə̌ŋ, ə̌ːníː.
The 3rd highest number of forms (7) was observed with the lemma “aː”: aː, yaː, àː, âː, ŋaː, ŋâː, ŋǎː.
PART occurs with 5 features: PartType (1932; 100% instances), Polarity (460; 24% instances), Evident (33; 2% instances), Deixis (23; 1% instances), Foreign (14; 1% instances)
PART occurs with 13 feature-value pairs: Deixis=Prox, Deixis=Remt, Evident=Nfh, Foreign=Yes, PartType=Adv, PartType=Case, PartType=Disc, PartType=Foc, PartType=Ill, PartType=Neg, PartType=Pred, PartType=Top, Polarity=Neg
PART occurs with 21 feature combinations.
The most frequent feature combination is PartType=Disc (513 tokens).
Examples: tôː, tòː, kúmá, wéy, kóː, wàːtòː, kàm, fáː, fâː, koː
Relations
PART nodes are attached to their parents using 9 different relations: discourse (848; 44% instances), advmod (703; 36% instances), compound:prt (355; 18% instances), root (14; 1% instances), ccomp (5; 0% instances), fixed (3; 0% instances), parataxis (3; 0% instances), reparandum (3; 0% instances), conj (1; 0% instances)
Parents of PART nodes belong to 16 different parts of speech: VERB (1420; 73% instances), NOUN (219; 11% instances), PRON (107; 6% instances), ADV (54; 3% instances), AUX (28; 1% instances), PROPN (24; 1% instances), INTJ (20; 1% instances), PART (16; 1% instances), (14; 1% instances), X (10; 1% instances), ADP (8; 0% instances), NUM (5; 0% instances), ADJ (3; 0% instances), DET (3; 0% instances), SCONJ (3; 0% instances), CCONJ (1; 0% instances)
1806 (93%) PART nodes are leaves.
109 (6%) PART nodes have one child.
10 (1%) PART nodes have two children.
10 (1%) PART nodes have three or more children.
The highest child degree of a PART node is 6.
Children of PART nodes are attached using 10 different relations: punct (120; 68% instances), discourse (14; 8% instances), nsubj (11; 6% instances), advcl (9; 5% instances), advmod (7; 4% instances), mark (6; 3% instances), reparandum (3; 2% instances), xcomp (3; 2% instances), dislocated (2; 1% instances), obl (1; 1% instances)
Children of PART nodes belong to 10 different parts of speech: PUNCT (120; 68% instances), PART (16; 9% instances), NOUN (11; 6% instances), VERB (8; 5% instances), SCONJ (6; 3% instances), INTJ (5; 3% instances), PRON (4; 2% instances), PROPN (4; 2% instances), ADV (1; 1% instances), CCONJ (1; 1% instances)