BlaBla: Linguistic Feature Extraction for Clinical Analysis in Multiple Languages


At the heart of BlaBla is the DocumentProcessor and the Document class. You have to import the DocumentProcessor class to process a piece of input text as shown in the below piece of code.

from bla_bla.document_processor import DocumentProcessor
with DocumentProcessor("stanza_config/stanza_config.yaml", "en") as doc_proc:
    content = "The picture shows a boy walking to the kitchen to pick a cookie from the cookie jar."
    doc = doc_proc.analyze(content, "string")
    res_json = doc.compute_features("noun_rate")

Under the hood, the DocumentProcessor object has an analyze method that will return an object of type Document class which can be used to compute features

Features Table