Treebank Statistics: UD_Chinese-GSD: POS Tags: AUX
There are 29 AUX
lemmas (0%), 67 AUX
types (0%) and 3892 AUX
tokens (3%).
Out of 16 observed tags, the rank of AUX
is: 15 in number of lemmas, 11 in number of types and 9 in number of tokens.
The 10 most frequent AUX
lemmas: 是、 了、 為、 被、 會、 可以、 著、 可、 能、 要
The 10 most frequent AUX
types: 是、 了、 為、 被、 會、 可以、 著、 可、 也是、 能
The 10 most frequent ambiguous lemmas: 是 (AUX 1062, VERB 384), 了 (AUX 764, PART 44, VERB 2), 為 (VERB 609, AUX 581, ADP 131, PROPN 1), 會 (AUX 224, PART 137, NOUN 3), 著 (AUX 131, VERB 2), 可 (AUX 114, SCONJ 1), 能 (AUX 104, PART 8), 要 (AUX 68, VERB 7), 可能 (AUX 60, NOUN 9), 過 (AUX 60, VERB 7, PROPN 2, ADJ 1)
The 10 most frequent ambiguous types: 是 (AUX 884, VERB 322), 了 (AUX 764, PART 44, VERB 2), 為 (AUX 566, VERB 495, ADP 131, PROPN 1), 會 (AUX 200, PART 137, NOUN 3), 著 (AUX 131, VERB 2), 可 (AUX 106, SCONJ 1), 也是 (AUX 76, VERB 5, SCONJ 3), 能 (AUX 72, PART 8), 要 (AUX 67, VERB 7), 可能 (AUX 60, NOUN 9)
- 是
- 了
- 為
- 會
- 著
- 可
- 也是
- 能
- 要
- 可能
Morphology
The form / lemma ratio of AUX
is 2.310345 (the average of all parts of speech is 1.004819).
The 1st highest number of forms (21) was observed with the lemma “是”: 不是, 且是, 也是, 亦是, 仍是, 便是, 則是, 卻是, 又是, 只是, 就是, 或是, 才是, 是, 是否, 是否是, 更是, 正是, 而是, 還是, 都是.
The 2nd highest number of forms (8) was observed with the lemma “為”: 亦為, 以為, 以爲, 則為, 更為, 為, 爲, 認為.
The 3rd highest number of forms (4) was observed with the lemma “能”: 不能, 未能, 沒能, 能.
AUX
occurs with 3 features: Aspect (955; 25% instances), Voice (425; 11% instances), Polarity (112; 3% instances)
AUX
occurs with 4 feature-value pairs: Aspect=Perf
, Aspect=Prog
, Polarity=Neg
, Voice=Pass
AUX
occurs with 5 feature combinations.
The most frequent feature combination is _
(2400 tokens).
Examples: 是、 為、 會、 可以、 可、 也是、 能、 要、 可能、 就是
Relations
AUX
nodes are attached to their parents using 8 different relations: aux (1827; 47% instances), cop (1630; 42% instances), aux:pass (425; 11% instances), conj (3; 0% instances), xcomp (3; 0% instances), ccomp (2; 0% instances), acl:relcl (1; 0% instances), root (1; 0% instances)
Parents of AUX
nodes belong to 11 different parts of speech: VERB (2226; 57% instances), NOUN (1164; 30% instances), PART (247; 6% instances), NUM (93; 2% instances), ADJ (87; 2% instances), PROPN (51; 1% instances), X (17; 0% instances), AUX (3; 0% instances), ADP (2; 0% instances), PRON (1; 0% instances), (1; 0% instances)
3876 (100%) AUX
nodes are leaves.
11 (0%) AUX
nodes have one child.
2 (0%) AUX
nodes have two children.
3 (0%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 5.
Children of AUX
nodes are attached using 11 different relations: punct (8; 30% instances), advmod (3; 11% instances), cc (3; 11% instances), conj (3; 11% instances), mark (3; 11% instances), nsubj (2; 7% instances), acl (1; 4% instances), advcl (1; 4% instances), csubj (1; 4% instances), mark:rel (1; 4% instances), parataxis (1; 4% instances)
Children of AUX
nodes belong to 8 different parts of speech: PUNCT (8; 30% instances), SCONJ (4; 15% instances), VERB (4; 15% instances), ADV (3; 11% instances), AUX (3; 11% instances), CCONJ (3; 11% instances), NOUN (1; 4% instances), PRON (1; 4% instances)