The data are taken from Phonological Textual Sub-corpus. The domain is the phonological word
· Abbreviations and symbols
· Detailed description of the Phonological Corpus (including transcription)
| Length | Occ PhWto | Rat PhWto | Occ PhWty | Rat PhWty |
| 1 | 135221 | 0.05377 | 20 | 0.00005 |
| 2 | 114848 | 0.04567 | 208 | 0.00054 |
| 3 | 151070 | 0.06007 | 1382 | 0.00358 |
| 4 | 324535 | 0.12905 | 6471 | 0.01678 |
| 5 | 404569 | 0.16087 | 18015 | 0.04672 |
| 6 | 375889 | 0.14947 | 35016 | 0.09082 |
| 7 | 320649 | 0.12750 | 54045 | 0.14017 |
| 8 | 253850 | 0.10094 | 64945 | 0.16844 |
| 9 | 174862 | 0.06953 | 62324 | 0.16164 |
| 10 | 114268 | 0.04544 | 51576 | 0.13377 |
| 11 | 67158 | 0.02671 | 37063 | 0.09612 |
| 12 | 38168 | 0.01518 | 24153 | 0.06264 |
| 13 | 20164 | 0.00802 | 14449 | 0.03747 |
| 14 | 10484 | 0.00417 | 8099 | 0.02101 |
| 15 | 4932 | 0.00196 | 4134 | 0.01072 |
| 16 | 2439 | 0.00097 | 2122 | 0.00550 |
| 17 | 996 | 0.00040 | 893 | 0.00232 |
| 18 | 449 | 0.00018 | 408 | 0.00106 |
| 19 | 160 | 0.00006 | 147 | 0.00038 |
| 20 | 63 | 0.00002 | 58 | 0.00015 |
| 21 | 23 | <0.00001 | 23 | 0.00006 |
| 22 | 10 | <0.00001 | 9 | 0.00002 |
| 23 | 1 | <0.00001 | 1 | <0.00001 |
| 24 | 3 | <0.00001 | 3 | <0.00001 |
| 25 | 2 | <0.00001 | 2 | <0.00001 |
| 26 | 1 | <0.00001 | 1 | <0.00001 |
| 27 | 5 | <0.00001 | 1 | <0.00001 |
| 28 | 0 | 0 | 0 | 0 |
| 29 | 0 | 0 | 0 | 0 |
| 30 | 0 | 0 | 0 | 0 |
| 31 | 0 | 0 | 0 | 0 |
| 32 | 1 | <0.00001 | 1 | <0.00001 |
| 33 | 0 | 0 | 0 | 0 |
| 34 | 0 | 0 | 0 | 0 |
| 35 | 0 | 0 | 0 | 0 |
| 36 | 0 | 0 | 0 | 0 |
| 37 | 0 | 0 | 0 | 0 |
| 38 | 0 | 0 | 0 | 0 |
| 39 | 0 | 0 | 0 | 0 |
| 40 | 0 | 0 | 0 | 0 |
| 41 | 0 | 0 | 0 | 0 |
| 42 | 0 | 0 | 0 | 0 |
| 43 | 0 | 0 | 0 | 0 |
| 44 | 0 | 0 | 0 | 0 |
| 45 | 0 | 0 | 0 | 0 |
| 46 | 0 | 0 | 0 | 0 |
| 47 | 0 | 0 | 0 | 0 |
| 48 | 0 | 0 | 0 | 0 |
| 49 | 1 | <0.00001 | 1 | <0.00001 |
| Total | 2514821 | 1 | 385570 | 1 |