The frequency with which the letters of the English alphabet appear in writings has been applied to the field of cryptography, the development of keyboard mechanics, and the study of linguistics. We expanded on the statistical analysis of the English alphabet by examining the average frequency which each letter appears in different categories of writings. We evaluated news articles, novels, plays, scientific publications and calculated the frequency of each letter of the alphabet, the information density of each letter, and the overall letter distribution. Furthermore, we developed a metric known as distance, d that can be used to algorithmically recognize different categories of writings. The results of our study can be applied to information transmission, large data curation, and linguistics.
翻译:英文字母在文本中出现的频率已被应用于密码学、键盘力学开发及语言学研究领域。本研究通过考察不同文体类别中每个字母的平均出现频率,深化了对英文字母的统计分析。我们评估了新闻文章、小说、戏剧及科学出版物,计算了每个字母的出现频率、信息密度以及总体字母分布。此外,我们提出了一种称为"距离d"的度量指标,可基于算法识别不同文体类别。本研究结果可应用于信息传输、大数据整理及语言学领域。
Alphabet is mostly a collection of companies. This newer Google is a bit slimmed down, with the companies that are pretty far afield of our main internet products contained in Alphabet instead.https://abc.xyz/