Decomposition of Hanzi for writing practice

#1
I'm trying to create flascards (or a dictionary) as writing aid.
No Etymology, only a practical decomposition frequently used in schools.

Like source I used https://github.com/nieldlr/Hanzi/blob/master/lib/data/kradfile-u.txt file.
https://github.com/nieldlr/hanzi/blob/master/lib/data/kradfile-u.txt.js file.

For DecompositionOfCharacters.txt file (600 Characters HSK1...HSK3) a single record appear as
  • 这 zhè (辵) 这 : 亠乂辶 lid, bend, walk, [ Def.: this, the, here]
The definition part is: (Radical) Hanzi : decomposition description [ translation]

For decomposition7120.txt (7120 Characters) as:
  • 诟 gou4 讠厂一口 speech, cliff, one, mouth
The definition part is: decomposition description.

Not all is perfect: èr two is sometime missed...
Do you know some better files?
 

Attachments

Last edited:
#7
https://github.com/nieldlr/hanzi/blob/master/lib/data/kradfile-u.txt.js file.

For DecompositionOfCharacters.txt file (600 Characters HSK1...HSK3) a single record appear as
  • 这 zhè (辵) 这 : 亠乂辶 lid, bend, walk, [ Def.: this, the, here]
The definition part is: (Radical) Hanzi : decomposition description [ translation]

For decomposition7120.txt (7120 Characters) as:
  • 诟 gou4 讠厂一口 speech, cliff, one, mouth
Furio,

I've a couple of questions:

1/ How did you go from the kradfile-u.txt.js to your two decomposition.txt files? Did you do that by hand?

2/ For the 7120 character file, is there a reason for that number? The source files has over 13 thousand entries and I was wondering why you didn't include them all.

Thanks.
 
#8
1/ How did you go from the kradfile-u.txt.js to your two decomposition.txt files? Did you do that by hand?
I'm using three tools, using Notepad+ macro I transform the original
京 : 口 小 亠
replacing ":" with TAB
After, using Excel and Pleco I obtain the final result

2/ For the 7120 character file, is there a reason for that number? The source files has over 13 thousand entries and I was wondering why you didn't include them all.
I don't remember, perhaps an original file was shorter.
In effect for HSK we need only 2700 Hanzi : that was my initial goal.
 
Top