TextDescriptives is a Python package for calculating a large variety of metrics from text. It is built on top of spaCy and can be easily integrated into existing workflows. The package has already been used for analysing the linguistic stability of clinical texts, creating features for predicting neuropsychiatric conditions, and analysing linguistic goals of primary school students. This paper describes the package and its features.
翻译:TextDescriptives是一个用于从文本中计算多种指标的Python包。它基于spaCy构建,可轻松集成到现有的工作流程中。该包已被用于分析临床文本的语言稳定性、为预测神经精神疾病创建特征,以及分析小学生的语言目标。本文对该包及其功能进行了描述。