PinHua - A Chinese Low-Conflict Romanization System

Goal

One-to-many mapping between a Chinese character and a code

Low conflict rate

Based on Pinyin system

Add two extra letters based on four leading strokes to distinguish different characters with the same pronunciation

Choose a Pinyin for each character, using the data source A and data source B.

If a character is in data source B, choose the first Pinyin.

If a character is not in data source B, and has a Pinyin using tone 5, chose the one with tone 5

Otherwise, choose the first PinYin in alphabetical order

Pick the four leading strokes of each character, using data source C

Fill in zeros "0" if there is less than four characters

Convert the four strokes to two letters as table below

Combine PinYin and the two-letter suffix suffix to generate PinHua

A. Frequency and Pinyin of Chinese characters from http://lingua.mtsu.edu/chinese-computing/statistics/char/list.php?Which=MO (Modern Chinese Character Frequency List by Jun Da (jda@mtsu.edu))