Treebank Statistics: UD_Naga-Suansu: POS Tags: PROPN
There are 64 PROPN lemmas (7%), 81 PROPN types (7%) and 260 PROPN tokens (8%).
Out of 16 observed tags, the rank of PROPN is: 4 in number of lemmas, 3 in number of types and 5 in number of tokens.
The 10 most frequent PROPN lemmas: Peter, Maria, Jim, Donovan, Mary, Bar, Bob, Doug, Lieutenant, Lynn
The 10 most frequent PROPN types: Peter, Mariadi, Peternan, Jim, Maria, Mary, Donovan, Bar, Bob, Doug
The 10 most frequent ambiguous lemmas: Peter (PROPN 119, NOUN 1), Donovan (PROPN 5, X 1), Lieutenant (PROPN 3, NOUN 1), Soviet (PROPN 3, NOUN 2), , (PUNCT 145, PROPN 1, X 1), CIA (NOUN 1, PROPN 1), Cowan (PROPN 1, X 1), States (NOUN 1, PROPN 1), United (NOUN 1, PROPN 1), Watters (PROPN 1, X 1)
The 10 most frequent ambiguous types: Donovan (PROPN 4, X 1), Lieutenant (PROPN 3, NOUN 1), Soviet (PROPN 3, NOUN 2), , (PUNCT 145, PROPN 1, X 1), CIAnahn (NOUN 1, PROPN 1), Cowan (PROPN 1, X 1), Peterdi (NOUN 1, PROPN 1), States (NOUN 1, PROPN 1), United (NOUN 1, PROPN 1), Watters (PROPN 1, X 1)
- Donovan
- Lieutenant
- Soviet
- ,
- CIAnahn
- Cowan
- Peterdi
- States
- United
- Watters
Morphology
The form / lemma ratio of PROPN is 1.265625 (the average of all parts of speech is 1.348066).
The 1st highest number of forms (5) was observed with the lemma “Maria”: Maria, Mariadi, Mariala, Marianahn, Marianan.
The 2nd highest number of forms (5) was observed with the lemma “Peter”: Peter, Peterdi, Peternan, Peterva, Petervada.
The 3rd highest number of forms (3) was observed with the lemma “Union”: Union, Unionla, Unionnahn.
PROPN occurs with 3 features: Number (256; 98% instances), Case (73; 28% instances), Abbr (1; 0% instances)
PROPN occurs with 11 feature-value pairs: Abbr=Yes, Case=Abl, Case=Ben, Case=Dat, Case=Erg, Case=Gen, Case=GenAbl, Case=Loc, Case=Top, Number=Plur, Number=Sing
PROPN occurs with 13 feature combinations.
The most frequent feature combination is Number=Sing (184 tokens).
Examples: Peter, Mariadi, Jim, Maria, Peternan, Donovan, Mary, Bar, Bob, Doug
Relations
PROPN nodes are attached to their parents using 14 different relations: nsubj (130; 50% instances), obj (39; 15% instances), flat:name (23; 9% instances), vocative (19; 7% instances), obl (11; 4% instances), flat (9; 3% instances), root (9; 3% instances), conj (4; 2% instances), iobj (4; 2% instances), nmod (4; 2% instances), nmod:poss (4; 2% instances), appos (2; 1% instances), compound (1; 0% instances), orphan (1; 0% instances)
Parents of PROPN nodes belong to 8 different parts of speech: VERB (190; 73% instances), PROPN (32; 12% instances), NOUN (18; 7% instances), (9; 3% instances), AUX (4; 2% instances), INTJ (4; 2% instances), ADJ (2; 1% instances), PRON (1; 0% instances)
212 (82%) PROPN nodes are leaves.
30 (12%) PROPN nodes have one child.
12 (5%) PROPN nodes have two children.
6 (2%) PROPN nodes have three or more children.
The highest child degree of a PROPN node is 5.
Children of PROPN nodes are attached using 14 different relations: punct (28; 36% instances), flat:name (23; 29% instances), conj (5; 6% instances), flat (5; 6% instances), advmod (3; 4% instances), cc (3; 4% instances), appos (2; 3% instances), nsubj (2; 3% instances), orphan (2; 3% instances), advmod:emph (1; 1% instances), discourse (1; 1% instances), nmod (1; 1% instances), nummod (1; 1% instances), parataxis (1; 1% instances)
Children of PROPN nodes belong to 8 different parts of speech: PROPN (32; 41% instances), PUNCT (28; 36% instances), NOUN (7; 9% instances), ADV (4; 5% instances), CCONJ (3; 4% instances), VERB (2; 3% instances), INTJ (1; 1% instances), NUM (1; 1% instances)