Pilot data · Spring 2026

Platform-in-development. Cross-semester pilot signal.

Bio 227 A&P II at an open-enrollment Hispanic-Serving Institution. Same instructor taught the same course with and without the platform one year apart. 20 matched quizzes. 481 student reflections. Not seeded. Not simulated. Not yet published. This is preliminary evidence that the platform-in-development is producing measurable effects on student outcomes — the kind of signal that justifies committing to the Fall 2026 formal study under an approved IRB protocol.

+4.4
Cross-semester mean gain (pts)
80%
Matched quizzes improved (16/20)
+15.6
Peak chapter gain (Q17 Respiratory)
481
Student reflections coded
Cross-semester pilot signal — same instructor, one year apart

Bio 227 A&P II — 20 matched quizzes

Spring 2025 (baseline, pre-platform) vs Spring 2026 (pilot, platform deployed). Same instructor. Same course. Same quiz instrument.
Baseline (SP25)
84.6%
84.6%
Pilot (SP26)
89.0%
89.0%
+4.4 pts mean delta
16 / 20 quizzes improved
+15.6 pts on Q17 (Respiratory anchor)
The anchor case

Q17 covers respiratory structure and function — ventilation mechanics, spirometry, V/Q relationships. Baseline: 78.7% with 7 of 30 students below 70. Pilot: 94.3% with 1 of 28 below 70. A +15.6 point gain and a near-elimination of failure on the single chapter with the most mature platform deployment (v16.1 Section Synthesis, direction-inversion intervention cluster, and 121 student reflections — the largest reflection corpus of any chapter). The respiratory chapter is the pilot's strongest single signal and the pre-registration anchor for the Fall 2026 formal study.

Bio 227 — clean pilot signal

20 matched quizzes · SP25 → SP26
SP25 baseline
84.6%
SP26 pilot
89.0%
Δ +4.4 pts · 16/20 improved · gains on reasoning content

Bio 226 — the content-type boundary

8 matched quizzes · SP24 → SP26
SP24 baseline
83.9%
SP26 pilot
79.7%
Δ −4.2 pts · losses on pure-ID content (bones, muscle names)
What the two courses together reveal

The Bio 227 pilot signal concentrates on reasoning-structured content — physiology, mechanism, cascade, feedback loops, cause and effect. Every substantial gain is on content with an embedded normal-function → failure-mode → intervention structure. The Bio 226 result concentrates losses on pure anatomical identification — bone markings, muscle names, structural nomenclature. The scaffold is a diagnostic reasoning tool; it exploits content structure it wasn't built to substitute for recognition memory. This is not a failure of the scaffold — it is a boundary condition, and a stronger finding than a uniform result would be because it characterizes exactly where the platform applies and where a complementary tool is needed. Reporting both courses honestly is how pilot data earns a prospective study.

Platform evolution note

The platform's measurement layer evolved mid-semester (March 2026) from per-student in-class checkpoints to post-session transcript audit feeding a Post-Session Announcement workflow. Chapter-level quiz outcomes under the new measurement architecture were equal to or greater than outcomes under the prior architecture — the Spring 2026 digestive chapter (Q19–Q22) averaged +6.0 points over the Spring 2025 baseline on the newer instrument. The pilot is not a fixed-dose intervention; the platform is in active development, and the evidence tracks its iteration. Fall 2026 formal data collection runs under a single locked platform version.

What students say when the scaffold asks
30 : 1
Diagnostic precision ratio
For every student who says "this is hard," thirty name exactly what's hard. The scaffold produces diagnostic specificity, not hand-waving.

481 reflections, coded

Thematic analysis of student free responses across 9 chapters
60%
Specific concept
41%
Metacognitive gap
21%
Strategy selection
12%
Scaffold attribution
8%
Self-correction
6%
Confidence
2%
Vague

Accuracy by reflection type

Students who self-correct their own misconceptions score highest
Self-correction
96.7%
96.7%
Confidence
91.4%
91.4%
Strategy
90.1%
90.1%
Specific concept
89.9%
89.9%
Metacognitive gap
87.8%
87.8%
Insight

Students who wrote "I used to think X but actually Y" — identifying AND correcting their own misconception — scored 96.7% on corresponding quiz items. The act of articulating the error in writing appears to consolidate the correction. This is the teach-back method measured at scale.

From the reflections (de-identified)

Real student responses, organized by what the scaffold produced
Specific concept identification"The hardest part for me is keeping the excitation-contraction steps in the right order, especially how the signal moves from the neuron to the muscle."
Self-correction"I used to think it came from the same place as osteoblasts, but it actually comes from monocytes."
Specific concept identification"The r⁴ relationship is definitely the most challenging. I understand that changes in vessel radius affect resistance dramatically, but applying it to clinical scenarios is where I struggle."
Metacognitive gap"I had to refresh myself on this whole section. I remembered it and understand it but I wasn't as confident. I think after the lesson I'll be more confident."
Strategy selection"The memory anchor that helped me was the can little girls eat spinach mnemonic for the epidermal layers."
Self-correction"Before this practice, I was least confident about distinguishing between pacemaker cells and contractile cells, especially which ions are responsible for each phase."
Specific concept identification"Calculating residual volume from capacity formulas always got me, but I looked back at my old notes and I think I figured it out."
Metacognitive gap"I'm not confident in my responses so even if I have an understanding I have to study more on kind of remembering everything."

Reflection depth

How much students write (voluntarily)
Detailed (150+)
51.6%
Medium (50-150)
30.3%
Short (<50)
18.1%
Median: 153 chars · Mean: 204 chars

QuizPrep items

237 submissions · 60 items · 9 chapters
0
items below 60% accuracy
Overall: 88.6% · Lowest chapter: Ch10 Musculoskeletal (80.2%) · Highest: Ch18 Cardiovascular (92.5%)
The three-layer pipeline

SG → QuizPrep → Quiz: what happens at each layer

Performance aligned by chapter across in-class diagnostics, practice, and summative assessment
Chapter SG In-Class QuizPrep Quiz QP→Quiz Δ Pattern
227 Ch22 Respiratory 75.9% 85.5% 88.7% +3.3 PIPELINE WORKING
227 Ch23 Digestive 92.1% 91.8% -0.3 CALIBRATED
226 Ch05 Integumentary 91.4% 91.2% -0.2 CALIBRATED
226 Ch06 Bones 86.4% 86.4% 0.0 CALIBRATED
227 Ch19 Vessels 91.6% 88.5% -3.1 MONITOR
226 Ch07 Skeleton 89.5% 82.6% -6.9 QP → QUIZ DROP
226 Ch10 Muscular 80.2% 72.1% -8.1 QP → QUIZ DROP
227 Ch18 Cardiovascular 92.5% 83.0% -9.5 TOO SCAFFOLDED
What the pipeline reveals

The QuizPrep-to-quiz deltas cluster by content type, not by chapter. Calibrated or gaining: Ch22 Respiratory, Ch23 Digestive, Ch05 Integumentary, Ch06 Bones functional content — all chapters with embedded reasoning structure. Dropping: Ch10 Muscular (−8.1), Ch18 Cardiovascular structural quiz (−9.5), Ch07 Skeleton (−6.9) — all chapters dominated by pure anatomical identification where the quiz tests recall of nomenclature the practice item couldn't simulate. The pattern confirms what the cross-semester pilot signal already showed: the scaffold produces gains on reasoning-structured content and does not substitute for memorization of nomenclature. The "SG In-Class" column only carries Ch22 data because that chapter was the last one measured under the platform's original per-student checkpoint architecture — the measurement migrated to a post-session transcript-audit workflow in mid-March, and the digestive chapter's gains were captured under the new architecture.

Same scaffold — different domain
See the other domain
The same 15 components power a dental clinic deployment — bilingual patient app, practice analytics dashboard, no-show prediction, misconception detection. Different content. Same scaffold.
🦷 Dental Demo →
Pilot deployment outcomes · Spring 2026 · The instructional software shown here is part of the NinoTech CC Suite — community college institutional infrastructure that also includes Admin Eval, Assessment Hub, and Honors Hub · Architecture validated under IRB-approved protocol (IRB2026001) · Prospective data collection begins Fall 2026

Pilot signal. Platform in development. Two domains.

The dental demo shows what the component library builds. The education page shows what the instructional platform is producing during its pilot phase. Same components. Different content. Measurable pilot-stage outcomes. The formal study begins Fall 2026; the platform build continues now.

Start your conversation →