Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: AUX
There are 29 AUX
lemmas (0%), 65 AUX
types (0%) and 3892 AUX
tokens (3%).
Out of 16 observed tags, the rank of AUX
is: 15 in number of lemmas, 11 in number of types and 9 in number of tokens.
The 10 most frequent AUX
lemmas: 是、 了、 为、 被、 会、 可以、 着、 可、 能、 要
The 10 most frequent AUX
types: 是、 了、 为、 被、 会、 可以、 着、 可、 也是、 能
The 10 most frequent ambiguous lemmas: 是 (AUX 1062, VERB 384), 了 (AUX 764, PART 44, VERB 2), 为 (VERB 609, AUX 581, ADP 133, PROPN 1), 会 (AUX 224, PART 137, NOUN 3), 着 (AUX 131, VERB 1), 可 (AUX 114, SCONJ 1), 能 (AUX 104, PART 8), 要 (AUX 68, VERB 7), 可能 (AUX 60, NOUN 9), 过 (AUX 60, VERB 7, PROPN 2, ADJ 1)
The 10 most frequent ambiguous types: 是 (AUX 884, VERB 322), 了 (AUX 764, PART 44, VERB 2), 为 (AUX 568, VERB 496, ADP 133, PROPN 1), 会 (AUX 200, PART 137, NOUN 3), 着 (AUX 131, VERB 1), 可 (AUX 106, SCONJ 1), 也是 (AUX 76, VERB 5, SCONJ 3), 能 (AUX 72, PART 8), 要 (AUX 67, VERB 7), 可能 (AUX 60, NOUN 9)
- 是
- 了
- 为
- 会
- 着
- 可
- 也是
- 能
- 要
- 可能
Morphology
The form / lemma ratio of AUX
is 2.241379 (the average of all parts of speech is 1.004660).
The 1st highest number of forms (21) was observed with the lemma “是”: 不是, 且是, 也是, 亦是, 仍是, 便是, 则是, 却是, 又是, 只是, 就是, 或是, 才是, 是, 是否, 是否是, 更是, 正是, 而是, 还是, 都是.
The 2nd highest number of forms (6) was observed with the lemma “为”: 为, 亦为, 以为, 则为, 更为, 认为.
The 3rd highest number of forms (4) was observed with the lemma “能”: 不能, 未能, 没能, 能.
AUX
occurs with 3 features: Aspect (955; 25% instances), Voice (425; 11% instances), Polarity (112; 3% instances)
AUX
occurs with 4 feature-value pairs: Aspect=Perf
, Aspect=Prog
, Polarity=Neg
, Voice=Pass
AUX
occurs with 5 feature combinations.
The most frequent feature combination is _
(2400 tokens).
Examples: 是、 为、 会、 可以、 可、 也是、 能、 要、 可能、 就是
Relations
AUX
nodes are attached to their parents using 8 different relations: aux (1827; 47% instances), cop (1630; 42% instances), aux:pass (425; 11% instances), conj (3; 0% instances), xcomp (3; 0% instances), ccomp (2; 0% instances), acl:relcl (1; 0% instances), root (1; 0% instances)
Parents of AUX
nodes belong to 11 different parts of speech: VERB (2226; 57% instances), NOUN (1175; 30% instances), PART (248; 6% instances), NUM (93; 2% instances), ADJ (87; 2% instances), PROPN (40; 1% instances), X (16; 0% instances), AUX (3; 0% instances), ADP (2; 0% instances), PRON (1; 0% instances), (1; 0% instances)
3876 (100%) AUX
nodes are leaves.
11 (0%) AUX
nodes have one child.
2 (0%) AUX
nodes have two children.
3 (0%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 5.
Children of AUX
nodes are attached using 11 different relations: punct (8; 30% instances), advmod (3; 11% instances), cc (3; 11% instances), conj (3; 11% instances), mark (3; 11% instances), nsubj (2; 7% instances), acl (1; 4% instances), advcl (1; 4% instances), csubj (1; 4% instances), mark:rel (1; 4% instances), parataxis (1; 4% instances)
Children of AUX
nodes belong to 8 different parts of speech: PUNCT (8; 30% instances), SCONJ (4; 15% instances), VERB (4; 15% instances), ADV (3; 11% instances), AUX (3; 11% instances), CCONJ (3; 11% instances), NOUN (1; 4% instances), PRON (1; 4% instances)