Treebank Statistics: UD_Zaar-Autogramm: POS Tags: PART
There are 44 PART
lemmas (5%), 85 PART
types (6%) and 836 PART
tokens (11%).
Out of 16 observed tags, the rank of PART
is: 8 in number of lemmas, 8 in number of types and 4 in number of tokens.
The 10 most frequent PART
lemmas: tòː, hŋ́, bàː, ɗi, máː, oː, ni, aː, kən, nə
The 10 most frequent PART
types: tòː, bàː, máː, tôː, oː, ɗi, hŋ́, nə, ni, n
The 10 most frequent ambiguous lemmas: máː (PART 54, ADV 1), ni (PART 39, X 1), aː (PART 37, INTJ 1), nə (PART 35, ADP 1), mə́n (PART 31, NOUN 21), kúmá (PART 29, CCONJ 3), wéy (PART 29, X 3), tá (PART 15, ADV 9, SCONJ 1), ŋaː (ADJ 15, PART 14, NOUN 7), ɗa (ADP 24, PART 11, SCONJ 4, X 2, ADV 1)
The 10 most frequent ambiguous types: máː (PART 54, AUX 14, ADV 1), nə (PART 36, ADP 1), ni (PART 35, X 1), n (PART 32, ADP 1), mə́n (PART 29, NOUN 21), wéy (PART 29, X 3), aː (PART 28, INTJ 1), kúmá (PART 28, CCONJ 3), tá (AUX 21, PART 15, ADV 9, SCONJ 1, VERB 1), ɗa (ADP 20, PART 9, SCONJ 9, ADV 3, X 2)
- máː
- nə
- ni
- n
- mə́n
- wéy
- aː
- kúmá
- tá
- ɗa
Morphology
The form / lemma ratio of PART
is 1.931818 (the average of all parts of speech is 1.640000).
The 1st highest number of forms (14) was observed with the lemma “kən”: gən, gəndí, gə̂n, kən, kəndí, n, ngən, ən, ɣən, ɣəndá, ɣəndí, ɣəní, ɣə́n, ɣə̂n.
The 2nd highest number of forms (11) was observed with the lemma “hŋ́”: hŋ́, hḿ, n, ń, ŋ, ŋ́, ǎn, ǐn, ə̌n, ə̌níː, ə̌ŋ.
The 3rd highest number of forms (5) was observed with the lemma “aː”: aː, àː, âː, ǎ, ǎː.
PART
occurs with 7 features: Polarity (176; 21% instances), PartType (132; 16% instances), VerbType (93; 11% instances), Aspect (42; 5% instances), Mood (36; 4% instances), Deixis (8; 1% instances), PronType (1; 0% instances)
PART
occurs with 10 feature-value pairs: Aspect=Inch
, Aspect=Res
, Deixis=Prox
, Mood=Irr
, Mood=Qot
, PartType=Emp
, PartType=Int
, Polarity=Neg
, PronType=Int
, VerbType=Cop
PART
occurs with 15 feature combinations.
The most frequent feature combination is _
(363 tokens).
Examples: tòː, máː, tôː, ɗi, mə́n, kúmá, tá, ɗì, kàm, fáː
Relations
PART
nodes are attached to their parents using 16 different relations: advmod (342; 41% instances), discourse (262; 31% instances), compound:prt (131; 16% instances), root (51; 6% instances), dep (7; 1% instances), case (6; 1% instances), ccomp (6; 1% instances), fixed (6; 1% instances), parataxis (6; 1% instances), advcl (5; 1% instances), reparandum (5; 1% instances), mark (4; 0% instances), acl:relcl (2; 0% instances), acl (1; 0% instances), compound (1; 0% instances), flat:foreign (1; 0% instances)
Parents of PART
nodes belong to 13 different parts of speech: VERB (550; 66% instances), (51; 6% instances), NOUN (48; 6% instances), PART (47; 6% instances), AUX (39; 5% instances), PRON (33; 4% instances), ADV (21; 3% instances), X (18; 2% instances), INTJ (9; 1% instances), PROPN (8; 1% instances), SCONJ (7; 1% instances), ADJ (4; 0% instances), NUM (1; 0% instances)
706 (84%) PART
nodes are leaves.
68 (8%) PART
nodes have one child.
14 (2%) PART
nodes have two children.
48 (6%) PART
nodes have three or more children.
The highest child degree of a PART
node is 9.
Children of PART
nodes are attached using 15 different relations: punct (121; 40% instances), xcomp (36; 12% instances), advmod (31; 10% instances), nsubj (31; 10% instances), discourse (22; 7% instances), advcl (16; 5% instances), mark (14; 5% instances), dislocated (13; 4% instances), parataxis (7; 2% instances), reparandum (4; 1% instances), dep (2; 1% instances), obl (2; 1% instances), conj (1; 0% instances), csubj (1; 0% instances), vocative (1; 0% instances)
Children of PART
nodes belong to 14 different parts of speech: PUNCT (121; 40% instances), PART (47; 16% instances), NOUN (42; 14% instances), VERB (24; 8% instances), PRON (21; 7% instances), SCONJ (13; 4% instances), PROPN (8; 3% instances), X (8; 3% instances), ADV (6; 2% instances), INTJ (5; 2% instances), ADJ (3; 1% instances), CCONJ (2; 1% instances), ADP (1; 0% instances), AUX (1; 0% instances)