Treebank Statistics: UD_Korean-KSL: POS Tags: NOUN
There are 6823 NOUN
lemmas (38%), 6773 NOUN
types (38%) and 19534 NOUN
tokens (29%).
Out of 15 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: 수, 한국어+를, 것+이, 한국, 것+은, 사람+이, 사람+들+이, 것, 시간+이, 것+을
The 10 most frequent NOUN
types: 수, 한국어를, 것이, 한국, 것은, 사람이, 사람들이, 것, 시간이, 것을
The 10 most frequent ambiguous lemmas: 수 (NOUN 829, NUM 1), 다음 (NOUN 43, ADV 12), 한국+의 (NOUN 40, ADJ 1), 반 (NOUN 38, NUM 10, ADV 9, ADP 1), 이번 (NOUN 38, DET 1), 친구 (NOUN 32, ADV 1), 마지막 (NOUN 24, ADV 2), 고향 (NOUN 23, ADV 3), 날 (NOUN 23, ADV 1), 말+이 (NOUN 20, ADV 1)
The 10 most frequent ambiguous types: 수 (NOUN 829, NUM 1, X 1), 한국어 (NOUN 79, VERB 1), 있기 (NOUN 44, AUX 7), 다음 (NOUN 43, ADV 12), 한국의 (NOUN 40, ADJ 1), 반 (NOUN 38, NUM 10, ADV 9, ADP 1), 이번 (NOUN 38, DET 1), 친구 (NOUN 32, ADV 1), 하기 (NOUN 30, AUX 1), 날 (NOUN 26, VERB 4, ADV 2)
- 수
- 한국어
- 있기
- 다음
- 한국의
- 반
- 이번
- 친구
- 하기
- 날
Morphology
The form / lemma ratio of NOUN
is 0.992672 (the average of all parts of speech is 1.004265).
The 1st highest number of forms (2) was observed with the lemma “것+을”: 걸, 것을.
The 2nd highest number of forms (2) was observed with the lemma “것+이”: 것이, 게.
The 3rd highest number of forms (2) was observed with the lemma “나+를”: 나를, 날.
NOUN
does not occur with any features.
Relations
NOUN
nodes are attached to their parents using 22 different relations: nsubj (7158; 37% instances), obj (5386; 28% instances), nmod (2139; 11% instances), obl (1545; 8% instances), nmod:poss (955; 5% instances), conj (584; 3% instances), dislocated (558; 3% instances), flat (400; 2% instances), list (267; 1% instances), root (97; 0% instances), advcl (94; 0% instances), case (88; 0% instances), acl (73; 0% instances), appos (43; 0% instances), vocative (41; 0% instances), compound (36; 0% instances), amod (20; 0% instances), ccomp (19; 0% instances), csubj (11; 0% instances), fixed (11; 0% instances), nummod (6; 0% instances), parataxis (3; 0% instances)
Parents of NOUN
nodes belong to 13 different parts of speech: VERB (12462; 64% instances), NOUN (3986; 20% instances), ADJ (2045; 10% instances), ADV (806; 4% instances), (97; 0% instances), ADP (81; 0% instances), NUM (27; 0% instances), PRON (15; 0% instances), PROPN (9; 0% instances), AUX (2; 0% instances), DET (2; 0% instances), CCONJ (1; 0% instances), X (1; 0% instances)
10262 (53%) NOUN
nodes are leaves.
7803 (40%) NOUN
nodes have one child.
1045 (5%) NOUN
nodes have two children.
424 (2%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 9.
Children of NOUN
nodes are attached using 30 different relations: acl (3103; 27% instances), nmod (1581; 14% instances), amod (1020; 9% instances), nmod:poss (972; 9% instances), case (754; 7% instances), det (681; 6% instances), conj (565; 5% instances), punct (528; 5% instances), flat (481; 4% instances), nsubj (385; 3% instances), obj (236; 2% instances), obl (207; 2% instances), list (193; 2% instances), advcl (156; 1% instances), advmod (156; 1% instances), nummod (142; 1% instances), appos (48; 0% instances), cc (27; 0% instances), aux (23; 0% instances), ccomp (22; 0% instances), dislocated (22; 0% instances), compound (20; 0% instances), goeswith (20; 0% instances), fixed (14; 0% instances), mark (13; 0% instances), vocative (5; 0% instances), cop (3; 0% instances), parataxis (2; 0% instances), csubj (1; 0% instances), dep (1; 0% instances)
Children of NOUN
nodes belong to 14 different parts of speech: NOUN (3986; 35% instances), VERB (3460; 30% instances), ADJ (920; 8% instances), DET (685; 6% instances), ADP (614; 5% instances), ADV (614; 5% instances), PUNCT (528; 5% instances), PRON (351; 3% instances), NUM (162; 1% instances), AUX (27; 0% instances), X (20; 0% instances), PROPN (8; 0% instances), CCONJ (5; 0% instances), SYM (1; 0% instances)