Reference
pybiber utility functions
Read in and prepare data
CorpusProcessor | Main class that orchestrates corpus processing pipeline. |
corpus_from_folder | Import all text files from a directory. |
get_text_paths | Get a list of full paths for all text files. |
readtext | Import all text files from a list of paths. |
spacy_parse | Parse a corpus (legacy public API). |
get_noun_phrases | Extract expanded noun phrases using the ‘en_core_web_sm’ model. |
pybiber pipeline
High-level orchestration wrappers
PybiberPipeline | End-to-end convenience wrapper for common pybiber workflows. |
run_biber_from_folder | One-liner: read -> parse -> biber() from a folder of .txt files. |
run_biber | One-liner: parse -> biber() from an in-memory corpus DataFrame. |
pybiber parse
Generate a biber document-feature matrix
biber | Extract Biber features from a parsed corpus. |
pybiber methods
Analyze a biber document-feature matrix
mda | Execute Biber’s multi-dimensional anlaysis. |
mda_biber | Project results onto Biber’s dimensions. |
pca | Execute principal component analysis. |
mdaviz_screeplot | Generate a scree plot for determining factors. |
mdaviz_groupmeans | Generate a stick plot of the group means for a factor. |
pcaviz_groupmeans | Generate a scatter plot of the group means along 2 components. |
pcaviz_contrib | Generate a bar plot of variable contributions to a component. |