本帖最后由 tiger 于 4-15-2025 14:09 编辑
用了字典,速度很快,程序如下:
encoding = utf8是因为我把hlm.txt分成上中下三个文件的时候save成了utf8编码
wordLen = 3的时候都是秒出,Python的字典非常给力。wordLen = 5也只需要几秒钟。
[2902, '道:“']
[2247, ' ']
[1123, '\n ']
[1123, '\n\n ']
[944, '笑道:']
[586, '.\n\n']
[493, '了。”']
[427, ':“你']
[363, '。”宝']
[328, ':“我']
[321, '说道:']
[309, '王夫人']
[296, '”宝玉']
[279, ':“这']
[265, '.宝玉']
[250, '玉道:']
[247, '说:“']
[238, '。”贾']
[234, '。”说']
[224, ',只见']
[220, '林黛玉']
[206, '听了,']
[195, '说着,']
[192, '”\n\n']
[177, '”说着']
[176, '宝玉道']
[175, '老太太']
[171, '"宝玉']
[169, ',一面']
[160, '刘姥姥']
[159, '凤姐儿']
[155, '的。”']
[155, '呢。”']
[154, '去了.']
[152, '玉笑道']
[143, '。”\n']
[142, '人道:']
[137, '。”凤']
[137, '”凤姐']
[127, '宝玉笑']
[123, '罢。”']
[123, '出来,']
[122, '宝玉听']
[122, '听说,']
[119, '来了,']
[118, ',也不']
[113, '?"宝']
[112, ',宝玉']
[109, '起来,']
[106, ':“好']
|