Comparison first
Baseline Delta Board
Flags "AI pivot" as positive, misses going-concern sarcasm.
Reads the post as bearish and marks risk_flag true.
Best for proving fine-tuning value against a schema-prompted base model.
Hero comparison, metric deltas, side-by-side cases, then limitations.
Two large output panels labeled Base and Fin-Qwen with a hard-eval delta strip.
Clean vs hard eval chart, explanation of why hard cases matter, small caveat block.
Static same-input comparison for AI hype, dilution, leverage, and ticker ambiguity.
Hard F1, Clean F1, Clean/Hard MAE, Qwen3-8B, QLoRA, DeepSeek teacher.
Very clear in 20 seconds; strong recruiter readability.
Less space for pipeline depth unless the page gets longer.