Pinyin Letter Frequency 拼音字母頻率

By Xah Lee. Date: . Last updated: .

Pinyin Letter Frequency

chinese pinyin letter frequency 20285
Pinyin letter frequency chart. 〔see Character Frequency Plot
i14.29
n11.24
a10.78
e8.20
u7.19
h6.73
g6.00
o5.40
d3.60
s3.35
z3.31
y3.15
j1.95
l1.78
w1.71
x1.66
t1.66
b1.64
m1.52
c1.08
k1.01
r0.92
q0.89
f0.44
p0.29
v0.08

Efficient Pinyin Keyboard Layout

Here we try to find out which keyboard layout is best for input Chinese with pinyin input method.

pinyin on qwerty heatmap 2024-04-28
pinyin on qwerty heatmap 2024-04-28
pinyin on dvorak heatmap 2024-04-28
pinyin on dvorak heatmap 2024-04-28
pinyin on colemak heatmap 2024-04-28
pinyin on colemak heatmap 2024-04-28

Pinyin Letter Frequency Problem, the Removal of V

There is a interesting issue about v and ü in Chinese pinyin. In pinyin, the letter v is not used, but you have ü. However, for pinyin input system, you have a hack of typing v for ü, because otherwise ü is hard to type.

on Microsoft Windows's pinyin input, u also do ü. But not on MacOS.

So, now there is a interesting question when you compile statistics of pinyin letter frequency. Given a piece of Chinese text, you can translate them into pinyin, then compute the letter frequency. In this way, you'll see zero use of v. However, this is not a proper stat for the purpose of keyboard layout, because, people do type v, while your stat no use of the key v.

To fix it, one needs to convert ü to v, then, compute the statistics. But this may not be readily done, because in order to do that, the software that convert chinese into pinyin will need to include tones to create ü.

But, this “error” isn't too bad. Because the character ü in pinyin does not occur frequently. I think mostly it's only used for the chars 女 綠.

Chinese input