Treebank Statistics: UD_Korean-KSL: POS Tags: PRON
There are 254 PRON lemmas (1%), 253 PRON types (1%) and 4522 PRON tokens (3%).
Out of 16 observed tags, the rank of PRON is: 5 in number of lemmas, 6 in number of types and 7 in number of tokens.
The 10 most frequent PRON lemmas: 저+는, 우리, 나+는, 저+의, 내+가, 우리+는, 나+의, 제+가, 우리+가, 나+에게
The 10 most frequent PRON types: 저는, 우리, 나는, 내가, 제, 우리는, 제가, 내, 우리가, 나에게
The 10 most frequent ambiguous lemmas: 저+는 (PRON 748, VERB 2), 우리 (PRON 564, NOUN 11), 나+는 (PRON 533, VERB 11), 내+가 (PRON 302, NOUN 6), 자기 (NOUN 66, PRON 50), 나 (PRON 28, ADP 3, NOUN 3), 나+ㄴ (PRON 23, VERB 7), 누구+나 (PRON 23, ADV 1), 저 (PRON 16, DET 6, ADV 1), 그것+이 (PRON 12, NOUN 1)
The 10 most frequent ambiguous types: 저는 (PRON 747, VERB 2), 우리 (PRON 564, NOUN 11, ADV 1), 나는 (PRON 531, VERB 11), 내가 (PRON 302, NOUN 6), 제 (PRON 301, NOUN 3, NUM 1, PART 1), 내 (PRON 179, NOUN 1), 자기 (NOUN 66, PRON 50, VERB 3), 나 (PRON 28, ADP 3, NOUN 3, X 2), 난 (PRON 25, VERB 7, X 1), 누구나 (PRON 23, ADV 1)
- 저는
- 우리
- 나는
- 내가
- 제
- 내
- 자기
- 나
- 난
- 누구나
Morphology
The form / lemma ratio of PRON is 0.996063 (the average of all parts of speech is 1.008073).
The 1st highest number of forms (3) was observed with the lemma “저+의”: 재, 저의, 제.
The 2nd highest number of forms (2) was observed with the lemma “그것+이”: 그것이, 그게.
The 3rd highest number of forms (2) was observed with the lemma “나+는”: 나는, 난.
PRON occurs with 2 features: PronType (4522; 100% instances), Typo (2; 0% instances)
PRON occurs with 4 feature-value pairs: PronType=Dem, PronType=Int, PronType=Prs, Typo=Yes
PRON occurs with 4 feature combinations.
The most frequent feature combination is PronType=Prs (4443 tokens).
Examples: 저는, 우리, 나는, 내가, 제, 우리는, 제가, 내, 우리가, 나에게
Relations
PRON nodes are attached to their parents using 15 different relations: nsubj (2539; 56% instances), nmod:poss (684; 15% instances), nmod (439; 10% instances), obl (404; 9% instances), dislocated (275; 6% instances), obj (154; 3% instances), advcl (5; 0% instances), flat (4; 0% instances), acl (3; 0% instances), appos (3; 0% instances), conj (3; 0% instances), vocative (3; 0% instances), amod (2; 0% instances), ccomp (2; 0% instances), compound (2; 0% instances)
Parents of PRON nodes belong to 11 different parts of speech: VERB (2816; 62% instances), NOUN (862; 19% instances), ADJ (489; 11% instances), ADV (323; 7% instances), AUX (23; 1% instances), ADP (2; 0% instances), DET (2; 0% instances), PRON (2; 0% instances), INTJ (1; 0% instances), NUM (1; 0% instances), PART (1; 0% instances)
4407 (97%) PRON nodes are leaves.
106 (2%) PRON nodes have one child.
7 (0%) PRON nodes have two children.
2 (0%) PRON nodes have three or more children.
The highest child degree of a PRON node is 3.
Children of PRON nodes are attached using 16 different relations: conj (28; 22% instances), case (25; 20% instances), acl (20; 16% instances), nmod (12; 10% instances), flat (8; 6% instances), punct (7; 6% instances), amod (6; 5% instances), nmod:poss (4; 3% instances), det (3; 2% instances), nsubj (3; 2% instances), advcl (2; 2% instances), advmod (2; 2% instances), appos (2; 2% instances), list (2; 2% instances), goeswith (1; 1% instances), obj (1; 1% instances)
Children of PRON nodes belong to 10 different parts of speech: NOUN (49; 39% instances), ADP (25; 20% instances), VERB (16; 13% instances), ADV (10; 8% instances), ADJ (9; 7% instances), PUNCT (7; 6% instances), DET (5; 4% instances), AUX (2; 2% instances), PRON (2; 2% instances), X (1; 1% instances)