Reference

pybiber utility functions

Read in and prepare data

CorpusProcessor Main class that orchestrates corpus processing pipeline.
corpus_from_folder Import all text files from a directory.
get_text_paths Get a list of full paths for all text files.
readtext Import all text files from a list of paths.
spacy_parse Parse a corpus (legacy public API).
get_noun_phrases Extract expanded noun phrases using the ‘en_core_web_sm’ model.

pybiber pipeline

High-level orchestration wrappers

PybiberPipeline End-to-end convenience wrapper for common pybiber workflows.
run_biber_from_folder One-liner: read -> parse -> biber() from a folder of .txt files.
run_biber One-liner: parse -> biber() from an in-memory corpus DataFrame.

pybiber parse

Generate a biber document-feature matrix

biber Extract Biber features from a parsed corpus.

pybiber methods

Analyze a biber document-feature matrix

mda Execute Biber’s multi-dimensional anlaysis.
mda_biber Project results onto Biber’s dimensions.
pca Execute principal component analysis.
mdaviz_screeplot Generate a scree plot for determining factors.
mdaviz_groupmeans Generate a stick plot of the group means for a factor.
pcaviz_groupmeans Generate a scatter plot of the group means along 2 components.
pcaviz_contrib Generate a bar plot of variable contributions to a component.