Treebank Statistics: UD_Chintang-CTNTB: POS Tags: PART
There are 23 PART lemmas (1%), 68 PART types (2%) and 1589 PART tokens (11%).
Out of 16 observed tags, the rank of PART is: 8 in number of lemmas, 7 in number of types and 4 in number of tokens.
The 10 most frequent PART lemmas: ta, na, yaŋ, lo, ni, aŋ, hola, caĩ, raicha, manchi
The 10 most frequent PART types: ta, na, yaŋ, lo, ni, aŋ, hola, raicha, o, le
The 10 most frequent ambiguous lemmas: ta (PART 387, VERB 55), na (PART 245, NOUN 10, CCONJ 7), yaŋ (PART 244, SCONJ 1), lo (PART 122, INTJ 27), aŋ (PART 62, ADV 13), manchi (PART 44, INTJ 2), o (PART 39, INTJ 1), maha (PART 37, INTJ 3), them (PRON 61, PART 19, DET 2, ADV 1, VERB 1), khoi (INTJ 10, PART 3)
The 10 most frequent ambiguous types: ta (PART 361, VERB 2), na (PART 243, CCONJ 4), yaŋ (PART 243, SCONJ 1), lo (PART 120, INTJ 2), aŋ (PART 61, ADV 10), o (PART 38, INTJ 6, NOUN 1), them (PRON 31, PART 19, ADV 1, DET 1), mahaʔ (PART 16, INTJ 1), manche (PART 8, INTJ 1), khoi (PART 3, INTJ 1)
- ta
- na
- yaŋ
- lo
- aŋ
- o
- them
- mahaʔ
- manche
- khoi
Morphology
The form / lemma ratio of PART is 2.956522 (the average of all parts of speech is 2.521544).
The 1st highest number of forms (10) was observed with the lemma “gonei”: ei, gone, gonei, goneĩ, konei, nai, ne, nei, one, oneĩ.
The 2nd highest number of forms (10) was observed with the lemma “manchi”: Manchiʔ, ma, mahaʔ, mahã, manche, mancheʔ, manchi, manchiʔŋa, nchi, nchiʔ.
The 3rd highest number of forms (6) was observed with the lemma “ai”: ai, aitira, aiŋa, e, i, isaŋa.
PART occurs with 3 features: InfStruct (1070; 67% instances), Polarity (54; 3% instances), Case (5; 0% instances)
PART occurs with 6 feature-value pairs: Case=Erg, Case=Loc, InfStruct=Foc, InfStruct=Top, InfStruct=Uniq, Polarity=Neg
PART occurs with 8 feature combinations.
The most frequent feature combination is InfStruct=Foc (810 tokens).
Examples: ta, yaŋ, lo, le, taʔ, ai, leʔ, i, leʔle, aitira
Relations
PART nodes are attached to their parents using 11 different relations: advmod:emph (1067; 67% instances), discourse (462; 29% instances), advmod:cop (41; 3% instances), conj (5; 0% instances), advmod (4; 0% instances), root (3; 0% instances), acl:nmlz (2; 0% instances), advcl (2; 0% instances), advmod:nmlz (1; 0% instances), cc (1; 0% instances), parataxis (1; 0% instances)
Parents of PART nodes belong to 13 different parts of speech: VERB (686; 43% instances), NOUN (515; 32% instances), ADV (139; 9% instances), PRON (132; 8% instances), PROPN (38; 2% instances), CCONJ (27; 2% instances), NUM (20; 1% instances), ADJ (15; 1% instances), DET (7; 0% instances), INTJ (5; 0% instances), (3; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)
1529 (96%) PART nodes are leaves.
57 (4%) PART nodes have one child.
2 (0%) PART nodes have two children.
1 (0%) PART nodes have three or more children.
The highest child degree of a PART node is 4.
Children of PART nodes are attached using 11 different relations: punct (47; 72% instances), cc (6; 9% instances), advmod (2; 3% instances), mark (2; 3% instances), parataxis (2; 3% instances), advcl (1; 2% instances), advmod:emph (1; 2% instances), det (1; 2% instances), discourse (1; 2% instances), nsubj (1; 2% instances), obj (1; 2% instances)
Children of PART nodes belong to 11 different parts of speech: PUNCT (47; 72% instances), CCONJ (6; 9% instances), ADV (2; 3% instances), SCONJ (2; 3% instances), VERB (2; 3% instances), ADJ (1; 2% instances), DET (1; 2% instances), INTJ (1; 2% instances), NOUN (1; 2% instances), PART (1; 2% instances), PRON (1; 2% instances)