Treebank Statistics: UD_Dutch-LassySmall: POS Tags: PROPN
There are 7839 PROPN
lemmas (28%), 7998 PROPN
types (24%) and 30349 PROPN
tokens (10%).
Out of 16 observed tags, the rank of PROPN
is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.
The 10 most frequent PROPN
lemmas: van, de, België, Brussel, Duitsland, Vlaanderen, Wereldoorlog, Antwerpen, juni, Nederland
The 10 most frequent PROPN
types: van, de, België, Brussel, Duitsland, Wereldoorlog, Vlaanderen, Antwerpen, juni, Frankrijk
The 10 most frequent ambiguous lemmas: van (ADP 9299, PROPN 350), de (DET 18976, PROPN 197, X 41), Nederland (PROPN 153, X 30), II (PROPN 131, NUM 2, X 2), Prince (PROPN 120, X 1), Dylan (PROPN 113, X 1), Vlaams (ADJ 244, PROPN 98), november (PROPN 94, X 3), Che (PROPN 79, X 1), Hasselt (PROPN 79, X 1)
The 10 most frequent ambiguous types: van (ADP 9203, PROPN 350), de (DET 16356, PROPN 197, X 41), Wereldoorlog (PROPN 165, NOUN 1), II (PROPN 131, NUM 2, X 2), Verenigde (PROPN 118, VERB 4), Prince (PROPN 116, X 1), staten (NOUN 64, PROPN 1), Vlaams (PROPN 98, ADJ 44), Tweede (PROPN 96, ADJ 12), Dylan (PROPN 89, X 1)
- van
- de
- Wereldoorlog
- II
- Verenigde
- Prince
- staten
- NOUN 64: Hij zou de Potomac overtrekken , de staten Maryland en Pennsylvania in .
- PROPN 1: Ze bespaarde de Verenigde staten zowel de radicale decentralisatie die de anti-federalisten voorstonden , als ook de meer extreme conservatieve visie van mensen zoals Alexander Hamilton , die pleitten voor een sterke uitvoerende macht en een centrale regering .
- Vlaams
- Tweede
- Dylan
Morphology
The form / lemma ratio of PROPN
is 1.020283 (the average of all parts of speech is 1.223065).
The 1st highest number of forms (3) was observed with the lemma “Belga”: Belgae, belga, belga’s.
The 2nd highest number of forms (3) was observed with the lemma “België”: BELGIË, België, Belgiës.
The 3rd highest number of forms (3) was observed with the lemma “Bernini”: Berini, Bernini, Bernini’s.
PROPN
occurs with 3 features: Number (15871; 52% instances), Gender (14492; 48% instances), ExtPos (199; 1% instances)
PROPN
occurs with 7 feature-value pairs: ExtPos=ADP
, ExtPos=PROPN
, Gender=Com
, Gender=Com,Neut
, Gender=Neut
, Number=Plur
, Number=Sing
PROPN
occurs with 12 feature combinations.
The most frequent feature combination is _
(14478 tokens).
Examples: van, de, Wereldoorlog, II, Verenigde, staten, Tweede, Vlaams, Eerste, I
Relations
PROPN
nodes are attached to their parents using 23 different relations: flat (9715; 32% instances), nmod (5379; 18% instances), nsubj (3445; 11% instances), obl (2537; 8% instances), conj (2221; 7% instances), appos (2020; 7% instances), root (1574; 5% instances), parataxis (858; 3% instances), obj (718; 2% instances), obl:arg (460; 2% instances), nsubj:pass (440; 1% instances), nmod:poss (282; 1% instances), obl:agent (253; 1% instances), advcl (114; 0% instances), xcomp (114; 0% instances), iobj (100; 0% instances), acl (67; 0% instances), acl:relcl (20; 0% instances), orphan (17; 0% instances), ccomp (7; 0% instances), amod (5; 0% instances), nsubj:outer (2; 0% instances), csubj (1; 0% instances)
Parents of PROPN
nodes belong to 12 different parts of speech: PROPN (11524; 38% instances), NOUN (7539; 25% instances), VERB (7452; 25% instances), (1574; 5% instances), NUM (1374; 5% instances), ADJ (433; 1% instances), DET (111; 0% instances), PRON (105; 0% instances), X (96; 0% instances), ADV (85; 0% instances), ADP (37; 0% instances), SYM (19; 0% instances)
13570 (45%) PROPN
nodes are leaves.
6980 (23%) PROPN
nodes have one child.
5190 (17%) PROPN
nodes have two children.
4609 (15%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 30.
Children of PROPN
nodes are attached using 24 different relations: flat (8946; 25% instances), case (7739; 22% instances), punct (5163; 15% instances), det (4430; 13% instances), conj (2324; 7% instances), nmod (1621; 5% instances), cc (1283; 4% instances), amod (928; 3% instances), parataxis (533; 2% instances), appos (514; 1% instances), nummod (393; 1% instances), acl:relcl (329; 1% instances), mark (268; 1% instances), acl (240; 1% instances), nsubj (123; 0% instances), cop (122; 0% instances), advmod (44; 0% instances), nmod:poss (34; 0% instances), orphan (32; 0% instances), cc:preconj (26; 0% instances), obl (16; 0% instances), advcl (6; 0% instances), aux (5; 0% instances), ccomp (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: PROPN (11524; 33% instances), ADP (7801; 22% instances), PUNCT (5163; 15% instances), DET (4500; 13% instances), NOUN (1556; 4% instances), CCONJ (1372; 4% instances), ADJ (711; 2% instances), NUM (705; 2% instances), VERB (555; 2% instances), ADV (337; 1% instances), SYM (314; 1% instances), SCONJ (267; 1% instances), AUX (127; 0% instances), X (108; 0% instances), PRON (80; 0% instances)