A Multidimensional Sentiment Analysis Approach to Understanding and Teaching About LLM-Generated Academic Writing
February 19, 2026
Writing instructors increasingly report an intuitive “feeling” that something is wrong with LLM-generated academic writing:
But this intuition leaves us helpless:
The reality:
Our approach:
Transform impressionistic observations → teachable knowledge
Using multivariate sentiment to make visible what experienced writers intuitively recognize
←
More Information Production
→
More Involved
←
More Information Production
→
More Involved
←
More Information Production
→
More Involved
←
More Information Production
→
More Involved
1. How do LLMs differ from human academic writing in emotional expression?
2. How do different LLM models compare?
Design:
Parallel construction:
Why this matters: More comparable than arbitrary human vs. AI samples
Dictionary-based stance/engagement studies:
Multivariate sentiment analysis:
| Category | Emotions |
|---|---|
| Positive | admiration, amusement, approval, caring, desire, excitement, gratitude, joy, love, optimism, pride, relief |
| Neutral | curiosity, confusion, neutral, realization, surprise |
| Negative | anger, annoyance, disappointment, disapproval, disgust, embarrassment, fear, grief, nervousness, remorse, sadness |
BERT-based model trained on 58,000 human-annotated Reddit comments
Each sentence gets probability scores across all 28 categories (not mutually exclusive)
Limitation: Misses subtle stance markers
Human academic sentence (Hyland 2005):
“In the chaparral at least, low temperature episodes usually result in gradual freeze-thaw event.”
GoEmotions: 97% neutral
✗ Hedges and scope limiters read as neutral
Affordance: Captures overall tonal quality
GPT-4o academic sentence:
“Such an approach exemplifies a necessary shift that can significantly strengthen our collective resolve against the looming impacts…”
GoEmotions: 99% approval
✓ Evaluative language drives emotional valence despite hedge word
GoEmotions complements dictionary approaches: reveals register-level appropriateness beyond specific lexical choices
Human academic writing:
GPT-4o academic writing:
Human academic writer (hedged, scoped):
“International mobility may, in turn, have a positive impact on large distance collaborations as mobile inventors act as bridges across teams…”
GPT-4o (emphatic, broad scope):
“The marriage of these approaches fosters a more nuanced appreciation of the Earth’s complex systems, empowering us to address the pressing environmental and engineering challenges of our time.”
Both tagged as “approval” — but the register appropriateness differs dramatically
GPT-4o doesn’t just use more positive language — it expresses emphatic approval in contexts where human academic writers employ:
This mirrors common novice writing problems:
Rate of emotions relative to human usage (1 = human usage rate)
GPT-4o → GPT-5 Mini:
“The Day ChatGPT Went Cold” (Freedman 2025):
| Emotion | Human | Llama 3 70B | Gemma 2 27B | GPT-4o | GPT-5 Mini |
|---|---|---|---|---|---|
| neutral | 88.6% | 73.0% | 71.2% | 65.5% | 84.1% |
| approval | 8.3% | 20.1% | 21.5% | 31.7% | 16.3% |
| admiration | 1.0% | 2.8% | 3.5% | 5.5% | 0.8% |
| optimism | 0.5% | 2.4% | 3.0% | 4.0% | 1.4% |
| disapproval | 1.1% | 1.0% | 0.8% | 0.4% | 0.6% |
Different models = different emotional signatures = no universal “fix” for AI writing
Instead of: “This doesn’t sound right”
We can say: “GPT-4o uses 4x more approval language than human academic writers and expresses that approval with broader scope and less hedging”
This specificity helps students:
Students learn to recognize:
Not memorizing: “GPT-4o uses too much approval language”
But developing: General awareness of emotional appropriateness in academic writing
Writing analytics assignments:
Revision practices:
Contrastive rhetoric:
Why sentiment analysis particularly helps:
Result: More equitable access to academic discourse conventions
Sentiment analysis provides:
Complements traditional approaches:
What sentiment analysis doesn’t do:
What sentiment analysis does do:
LLMs produce measurably different emotional registers — especially in academic writing
Patterns vary across models and will continue to shift with updates
Need stable frameworks, not model-specific knowledge
Sentiment analysis provides accessible metalanguage for discussing register appropriateness
Equity matters: Explicit instruction benefits those who struggle most with tacit conventions
By quantifying the affective dimensions of academic writing, we transform:
We give students agency:
In ways that enhance — not limit — their academic development
Contact:
Resources: