home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSD: POS Tags: AUX

There are 29 AUX lemmas (0%), 67 AUX types (0%) and 3892 AUX tokens (3%). Out of 16 observed tags, the rank of AUX is: 15 in number of lemmas, 11 in number of types and 9 in number of tokens.

The 10 most frequent AUX lemmas: 是、 了、 為、 被、 會、 可以、 著、 可、 能、 要

The 10 most frequent AUX types: 是、 了、 為、 被、 會、 可以、 著、 可、 也是、 能

The 10 most frequent ambiguous lemmas: 是 (AUX 1062, VERB 384), 了 (AUX 764, PART 44, VERB 2), 為 (VERB 609, AUX 581, ADP 131, PROPN 1), 會 (AUX 224, PART 137, NOUN 3), 著 (AUX 131, VERB 2), 可 (AUX 114, SCONJ 1), 能 (AUX 104, PART 8), 要 (AUX 68, VERB 7), 可能 (AUX 60, NOUN 9), 過 (AUX 60, VERB 7, PROPN 2, ADJ 1)

The 10 most frequent ambiguous types: 是 (AUX 884, VERB 322), 了 (AUX 764, PART 44, VERB 2), 為 (AUX 566, VERB 495, ADP 131, PROPN 1), 會 (AUX 200, PART 137, NOUN 3), 著 (AUX 131, VERB 2), 可 (AUX 106, SCONJ 1), 也是 (AUX 76, VERB 5, SCONJ 3), 能 (AUX 72, PART 8), 要 (AUX 67, VERB 7), 可能 (AUX 60, NOUN 9)

Morphology

The form / lemma ratio of AUX is 2.310345 (the average of all parts of speech is 1.004819).

The 1st highest number of forms (21) was observed with the lemma “是”: 不是, 且是, 也是, 亦是, 仍是, 便是, 則是, 卻是, 又是, 只是, 就是, 或是, 才是, 是, 是否, 是否是, 更是, 正是, 而是, 還是, 都是.

The 2nd highest number of forms (8) was observed with the lemma “為”: 亦為, 以為, 以爲, 則為, 更為, 為, 爲, 認為.

The 3rd highest number of forms (4) was observed with the lemma “能”: 不能, 未能, 沒能, 能.

AUX occurs with 3 features: Aspect (955; 25% instances), Voice (425; 11% instances), Polarity (112; 3% instances)

AUX occurs with 4 feature-value pairs: Aspect=Perf, Aspect=Prog, Polarity=Neg, Voice=Pass

AUX occurs with 5 feature combinations. The most frequent feature combination is _ (2400 tokens). Examples: 是、 為、 會、 可以、 可、 也是、 能、 要、 可能、 就是

Relations

AUX nodes are attached to their parents using 8 different relations: aux (1827; 47% instances), cop (1630; 42% instances), aux:pass (425; 11% instances), conj (3; 0% instances), xcomp (3; 0% instances), ccomp (2; 0% instances), acl:relcl (1; 0% instances), root (1; 0% instances)

Parents of AUX nodes belong to 11 different parts of speech: VERB (2226; 57% instances), NOUN (1164; 30% instances), PART (247; 6% instances), NUM (93; 2% instances), ADJ (87; 2% instances), PROPN (51; 1% instances), X (17; 0% instances), AUX (3; 0% instances), ADP (2; 0% instances), PRON (1; 0% instances), (1; 0% instances)

3876 (100%) AUX nodes are leaves.

11 (0%) AUX nodes have one child.

2 (0%) AUX nodes have two children.

3 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 5.

Children of AUX nodes are attached using 11 different relations: punct (8; 30% instances), advmod (3; 11% instances), cc (3; 11% instances), conj (3; 11% instances), mark (3; 11% instances), nsubj (2; 7% instances), acl (1; 4% instances), advcl (1; 4% instances), csubj (1; 4% instances), mark:rel (1; 4% instances), parataxis (1; 4% instances)

Children of AUX nodes belong to 8 different parts of speech: PUNCT (8; 30% instances), SCONJ (4; 15% instances), VERB (4; 15% instances), ADV (3; 11% instances), AUX (3; 11% instances), CCONJ (3; 11% instances), NOUN (1; 4% instances), PRON (1; 4% instances)