论文部分内容阅读
【目的】研究《红楼梦》前八十回与后四十回的关系,从而判定《红楼梦》是否为一人所写。【方法】定量统计和定性分析相结合,比较前、中、后四十回的独有词;利用虚词、词及词类的N元文法模型、实词以及词长进行聚类;计算三个部分的相似度。【结果】证明前八十回与后四十回有差异。前八十回用词连贯性较高,更重视细节描写,长词较少,可读性更强;后四十回更重视动作和场景化描写,长词较多,可读性稍弱。【局限】仅限于词和N元文法,未能进一步考察语义、语篇等方面的特征。【结论】从词、词类、短语串和词类串等方面分析,前八十回与后四十回很可能并非一人所作。
【Objective】 To study the relationship between the first eighty backs and the last forty backs of “A Dream of Red Mansions” to determine if “A Dream of Red Mansions” is written by one person. 【Method】 Quantitative statistics and qualitative analysis were combined to compare the unique words in the first, middle and last 40 times. Clustering was performed by using N grammatical models, real words and word length of function words, words and parts of speech; Similarity. 【Result】 It is proved that there are differences between the first eighty backs and the last forty backs. The first eighty back words with higher consistency, pay more attention to detail description, less long words, more readable; after more than 40 times more emphasis on action and scene description, more words, readability weaker. [Limitations] is limited to words and N grammar, failed to further examine the characteristics of semantics, texts and other aspects. 【Conclusion】 From the aspects of words, parts of speech, phrase strings and word strings, it is probable that the first eighty and the last forty are probably not made by one person.