home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: AUX

There are 29 AUX lemmas (0%), 65 AUX types (0%) and 3892 AUX tokens (3%). Out of 16 observed tags, the rank of AUX is: 15 in number of lemmas, 11 in number of types and 9 in number of tokens.

The 10 most frequent AUX lemmas: 是、 了、 为、 被、 会、 可以、 着、 可、 能、 要

The 10 most frequent AUX types: 是、 了、 为、 被、 会、 可以、 着、 可、 也是、 能

The 10 most frequent ambiguous lemmas: 是 (AUX 1062, VERB 384), 了 (AUX 764, PART 44, VERB 2), 为 (VERB 609, AUX 581, ADP 133, PROPN 1), 会 (AUX 224, PART 137, NOUN 3), 着 (AUX 131, VERB 1), 可 (AUX 114, SCONJ 1), 能 (AUX 104, PART 8), 要 (AUX 68, VERB 7), 可能 (AUX 60, NOUN 9), 过 (AUX 60, VERB 7, PROPN 2, ADJ 1)

The 10 most frequent ambiguous types: 是 (AUX 884, VERB 322), 了 (AUX 764, PART 44, VERB 2), 为 (AUX 568, VERB 496, ADP 133, PROPN 1), 会 (AUX 200, PART 137, NOUN 3), 着 (AUX 131, VERB 1), 可 (AUX 106, SCONJ 1), 也是 (AUX 76, VERB 5, SCONJ 3), 能 (AUX 72, PART 8), 要 (AUX 67, VERB 7), 可能 (AUX 60, NOUN 9)

Morphology

The form / lemma ratio of AUX is 2.241379 (the average of all parts of speech is 1.004660).

The 1st highest number of forms (21) was observed with the lemma “是”: 不是, 且是, 也是, 亦是, 仍是, 便是, 则是, 却是, 又是, 只是, 就是, 或是, 才是, 是, 是否, 是否是, 更是, 正是, 而是, 还是, 都是.

The 2nd highest number of forms (6) was observed with the lemma “为”: 为, 亦为, 以为, 则为, 更为, 认为.

The 3rd highest number of forms (4) was observed with the lemma “能”: 不能, 未能, 没能, 能.

AUX occurs with 3 features: Aspect (955; 25% instances), Voice (425; 11% instances), Polarity (112; 3% instances)

AUX occurs with 4 feature-value pairs: Aspect=Perf, Aspect=Prog, Polarity=Neg, Voice=Pass

AUX occurs with 5 feature combinations. The most frequent feature combination is _ (2400 tokens). Examples: 是、 为、 会、 可以、 可、 也是、 能、 要、 可能、 就是

Relations

AUX nodes are attached to their parents using 8 different relations: aux (1827; 47% instances), cop (1630; 42% instances), aux:pass (425; 11% instances), conj (3; 0% instances), xcomp (3; 0% instances), ccomp (2; 0% instances), acl:relcl (1; 0% instances), root (1; 0% instances)

Parents of AUX nodes belong to 11 different parts of speech: VERB (2226; 57% instances), NOUN (1175; 30% instances), PART (248; 6% instances), NUM (93; 2% instances), ADJ (87; 2% instances), PROPN (40; 1% instances), X (16; 0% instances), AUX (3; 0% instances), ADP (2; 0% instances), PRON (1; 0% instances), (1; 0% instances)

3876 (100%) AUX nodes are leaves.

11 (0%) AUX nodes have one child.

2 (0%) AUX nodes have two children.

3 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 5.

Children of AUX nodes are attached using 11 different relations: punct (8; 30% instances), advmod (3; 11% instances), cc (3; 11% instances), conj (3; 11% instances), mark (3; 11% instances), nsubj (2; 7% instances), acl (1; 4% instances), advcl (1; 4% instances), csubj (1; 4% instances), mark:rel (1; 4% instances), parataxis (1; 4% instances)

Children of AUX nodes belong to 8 different parts of speech: PUNCT (8; 30% instances), SCONJ (4; 15% instances), VERB (4; 15% instances), ADV (3; 11% instances), AUX (3; 11% instances), CCONJ (3; 11% instances), NOUN (1; 4% instances), PRON (1; 4% instances)