Treebank Statistics: UD_English-EWT: POS Tags: PROPN
There are 4810 PROPN
lemmas (26%), 4986 PROPN
types (22%) and 16103 PROPN
tokens (6%).
Out of 17 observed tags, the rank of PROPN
is: 2 in number of lemmas, 2 in number of types and 8 in number of tokens.
The 10 most frequent PROPN
lemmas: Bush, US, al, Iraq, enron, State, Iran, China, September, Qaeda
The 10 most frequent PROPN
types: bush, US, al, Iraq, enron, Iran, China, states, Qaeda, John
The 10 most frequent ambiguous lemmas: al (PROPN 67, X 1), enron (PROPN 7, NOUN 5), president (NOUN 30, PROPN 1), American (ADJ 88, PROPN 54), mark (NOUN 13, VERB 12, PROPN 2, X 1), North (PROPN 34, ADJ 2), god (PROPN 5, NOUN 4), south (NOUN 10, ADV 6, ADJ 1, PROPN 1), air (NOUN 34, PROPN 1), bay (PROPN 2, NOUN 1)
The 10 most frequent ambiguous types: al (PROPN 67, X 1), states (NOUN 10, PROPN 6, VERB 5), John (PROPN 75, X 4), president (NOUN 24, PROPN 1), may (AUX 221, PROPN 1), google (PROPN 3, VERB 2), Vince (PROPN 45, X 1), mark (NOUN 10, VERB 6, PROPN 2), Paul (PROPN 35, X 1), north (NOUN 6, ADV 5, ADJ 2, PROPN 2)
- al
- states
- John
- president
- may
- AUX 221: Adobe Acrobat Reader 4.0 may be downloaded for FREE from www.adobe.com .
- PROPN 1: Problem is , for some reason , the visa process took longer than it should , thus I missed school this semester ( visa was issued to me about 25 days after school started so I could n’t attend ) , now I no longer want to go into that school ( because they only would accept me again on September of 2012 ) , I found a school that accepted me for may 2012 , can I use the same visa that was issued to me ?
- Vince
- mark
- Paul
- north
- NOUN 6: Bilboa on the north coast , Pamplona and the very famous Guernica .
- ADV 5: There ‘s a Miramar in Florida , just north of Miami .
- ADJ 2: There s a reason why Frank mcclelland was named best chef of the north east reigon .
- PROPN 2: I have never been anywhere out side my home town Charlotte north Carolina please help !!!!
Morphology
The form / lemma ratio of PROPN
is 1.036590 (the average of all parts of speech is 1.234270).
The 1st highest number of forms (4) was observed with the lemma “Friday”: Fri, Fri., Fridays, friday.
The 2nd highest number of forms (4) was observed with the lemma “March”: MARCH, Mar, March, Marches.
The 3rd highest number of forms (4) was observed with the lemma “McDonald”: Mc.Donald, McDonal, mc, mcdonald.
PROPN
occurs with 4 features: Number (16103; 100% instances), Abbr (121; 1% instances), Typo (27; 0% instances), Style (2; 0% instances)
PROPN
occurs with 5 feature-value pairs: Abbr=Yes
, Number=Plur
, Number=Sing
, Style=Expr
, Typo=Yes
PROPN
occurs with 7 feature combinations.
The most frequent feature combination is Number=Sing
(15231 tokens).
Examples: bush, US, al, Iraq, enron, Iran, China, Qaeda, John, india
Relations
PROPN
nodes are attached to their parents using 27 different relations: compound (3388; 21% instances), nsubj (2008; 12% instances), nmod (1979; 12% instances), flat (1849; 11% instances), obl (1811; 11% instances), root (1419; 9% instances), conj (1020; 6% instances), appos (640; 4% instances), obj (629; 4% instances), nmod:poss (482; 3% instances), list (200; 1% instances), vocative (131; 1% instances), nsubj:pass (114; 1% instances), iobj (76; 0% instances), xcomp (65; 0% instances), obl:tmod (64; 0% instances), parataxis (63; 0% instances), nmod:tmod (38; 0% instances), ccomp (32; 0% instances), advcl (27; 0% instances), nmod:npmod (25; 0% instances), obl:npmod (23; 0% instances), acl:relcl (8; 0% instances), acl (5; 0% instances), discourse (3; 0% instances), csubj (2; 0% instances), reparandum (2; 0% instances)
Parents of PROPN
nodes belong to 14 different parts of speech: PROPN (5835; 36% instances), VERB (4251; 26% instances), NOUN (3948; 25% instances), (1419; 9% instances), ADJ (368; 2% instances), ADV (106; 1% instances), PRON (79; 0% instances), NUM (45; 0% instances), INTJ (18; 0% instances), SYM (14; 0% instances), AUX (8; 0% instances), DET (7; 0% instances), X (3; 0% instances), ADP (2; 0% instances)
6654 (41%) PROPN
nodes are leaves.
4561 (28%) PROPN
nodes have one child.
2440 (15%) PROPN
nodes have two children.
2448 (15%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 18.
Children of PROPN
nodes are attached using 36 different relations: case (4545; 23% instances), punct (2657; 14% instances), compound (2217; 11% instances), flat (1902; 10% instances), det (1463; 8% instances), conj (1170; 6% instances), amod (1132; 6% instances), cc (732; 4% instances), appos (630; 3% instances), nummod (597; 3% instances), list (573; 3% instances), nmod (517; 3% instances), cop (190; 1% instances), nsubj (179; 1% instances), advmod (149; 1% instances), acl:relcl (128; 1% instances), parataxis (125; 1% instances), nmod:poss (117; 1% instances), nmod:tmod (55; 0% instances), acl (47; 0% instances), mark (42; 0% instances), discourse (35; 0% instances), aux (30; 0% instances), cc:preconj (30; 0% instances), obl (24; 0% instances), nmod:npmod (20; 0% instances), advcl (11; 0% instances), expl (5; 0% instances), advcl:relcl (4; 0% instances), obl:tmod (4; 0% instances), vocative (4; 0% instances), orphan (3; 0% instances), reparandum (3; 0% instances), goeswith (2; 0% instances), det:predet (1; 0% instances), obl:npmod (1; 0% instances)
Children of PROPN
nodes belong to 17 different parts of speech: PROPN (5835; 30% instances), ADP (3940; 20% instances), PUNCT (2657; 14% instances), DET (1469; 8% instances), ADJ (1136; 6% instances), NOUN (1048; 5% instances), NUM (865; 4% instances), CCONJ (724; 4% instances), PART (584; 3% instances), VERB (321; 2% instances), AUX (221; 1% instances), PRON (174; 1% instances), ADV (146; 1% instances), X (103; 1% instances), SYM (61; 0% instances), SCONJ (34; 0% instances), INTJ (26; 0% instances)