home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-CTeTex: POS Tags: PROPN

There are 1 PROPN lemmas (6%), 152 PROPN types (7%) and 293 PROPN tokens (3%). Out of 17 observed tags, the rank of PROPN is: 12 in number of lemmas, 4 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: _

The 10 most frequent PROPN types: NPAC, HATS, TCS, ASPERA-3, AC-130U, MEX, NDE, EIRENE, FPMS, MC-130H

The 10 most frequent ambiguous lemmas: _ (NOUN 2649, PUNCT 1455, DET 936, ADP 781, VERB 721, ADJ 647, AUX 492, NUM 317, PROPN 293, CCONJ 267, ADV 185, PART 165, SCONJ 163, SYM 98, PRON 83, X 17, INTJ 4)

The 10 most frequent ambiguous types: VIL (PROPN 3, NOUN 1), SOA (NOUN 4, PROPN 2), FCP (NOUN 14, PROPN 1), OA (NOUN 4, PROPN 1), Shunting (NOUN 2, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 152.000000 (the average of all parts of speech is 125.235294).

The 1st highest number of forms (152) was observed with the lemma “_”: 100BaseT, AC-130U, ALQ-172, APAF, ASHRAE, ASPERA-3, C, CIWS, CMIP, CSCIs, DH, DTMF, DigitalHome, DoC, EDR, EIRENE, ESPC, ETCS, EVLA, F, FCP, FEA, FL-Lower, FL-Upper, FPMS, FTP, Fahrenheit, Fourier, GDMO, GPS, GSM-R, Glass, Google, HATS, HH681MPCC1, HVAC, IADE, IBIT, ICAO, IDE, IEEE, IP, ISA, ISO, Java, LTA, Linux, MC-130H, MEX, MIL-STD-464, MMC, MPCC01, MultiMahjongClient, MultiMahjongServer, NAS, NDE, NESDIS, NOAA, NPA, NPA-NXX, NPA-NXX-xxxx, NPAC, NPOESS, OA, OIW, OM, OMB, PCI, PHIN, Poisson, R6-28.1, R6-28.2, R6-29.1, R6-29.2, RR3-137.1, RR3-137.2, RR3-137.3, RR3-137.4, RVS, RVSM, Radstone, SDLC, SLDC, SOA, SQL, SRS015, SRS035, SRS043, SRS044, SRS064, SRS091, SRS093, SRS094, SRS095, SRS096, SRS097, SRS098, SRS101, SRS102, SRS104, SRS117, SRS173, SRS178, SRS181, SRS184, SRS187, SRS208, SRS215, SRS234, SRS245, SRS254, SRS256, SRS257, SRS275, SRS283, SRS284, SSS017, SSS024, SSS025, SSS026, SSS027, SSS028, SSS031, SSS037, SSS075, SSS083, SSS084, SSS085, SSS086, SSS087, SSS090, SSS222, SSS223, SSS224, SSS225, SSS542, SSS554, SV-423.3, SV-515, Shunting, Solaris, Sun, TCS, TGF, THEMAS, UDP, USSD, VIL, VxWorks, Windows, YD681MPCC1, zip.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 12 different relations: nmod (120; 41% instances), appos (61; 21% instances), compound (46; 16% instances), conj (21; 7% instances), nsubj (13; 4% instances), obj (8; 3% instances), obl (8; 3% instances), flat (4; 1% instances), nsubj:pass (4; 1% instances), obl:agent (4; 1% instances), amod (3; 1% instances), iobj (1; 0% instances)

Parents of PROPN nodes belong to 5 different parts of speech: NOUN (213; 73% instances), VERB (49; 17% instances), PROPN (24; 8% instances), SYM (5; 2% instances), ADJ (2; 1% instances)

152 (52%) PROPN nodes are leaves.

50 (17%) PROPN nodes have one child.

69 (24%) PROPN nodes have two children.

22 (8%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 12 different relations: punct (132; 50% instances), conj (34; 13% instances), det (29; 11% instances), case (24; 9% instances), cc (17; 6% instances), nmod (8; 3% instances), flat (6; 2% instances), advmod (5; 2% instances), amod (5; 2% instances), appos (4; 2% instances), acl (1; 0% instances), acl:relcl (1; 0% instances)

Children of PROPN nodes belong to 11 different parts of speech: PUNCT (132; 50% instances), DET (29; 11% instances), ADP (24; 9% instances), PROPN (24; 9% instances), NOUN (22; 8% instances), CCONJ (13; 5% instances), NUM (7; 3% instances), ADJ (5; 2% instances), ADV (5; 2% instances), SYM (4; 2% instances), VERB (1; 0% instances)