BU Researchers Will Test If This AI Reading Tutor Actually Works

BU researchers name Amira Learning winner of the EVAL Challenge. The move aims to validate AI literacy tools amid new "Science of Reading" mandates.

Wednesday, March 4, 2026

Schools across the country are racing to adopt artificial intelligence, but a critical question remains unanswered: does any of it actually help children learn? Boston University’s EVAL Industry Collaborative is stepping in to provide an answer.

The collaborative recently named Amira Learning the winner of its inaugural EdTech Evaluation Challenge. This victory guarantees the company a "gold-standard" independent study to verify whether its AI-powered reading tutor delivers real educational results for students.

What Happened

The EVAL Challenge was designed to identify promising AI educational tools and subject them to rigorous scientific testing. While generative AI is transforming classrooms, independent verification is rare. According to Ola Ozernov-Palchik, a research assistant professor at Boston University and director of the EVAL Collaborative, fewer than 10 percent of AI tools currently used in schools have undergone independent validation.

"That leaves educators, families, and policymakers uncertain about which products truly improve learning outcomes," Ozernov-Palchik said.

Amira Learning stood out among 32 applicants for its readiness to handle a large-scale study. The platform uses AI to listen to students read aloud, providing real-time feedback on decoding, fluency, and vocabulary. Rather than relying on internal company data, Amira will now undergo a randomized controlled trial (RCT) designed by a cross-disciplinary team of BU researchers. This team includes experts in computing, economics, and special education, ensuring the findings are statistically valid and unbiased.

The Bigger Picture

This push for evidence comes at a critical moment for literacy instruction. Massachusetts lawmakers are currently considering legislation (H 4683 and S 2940) that would mandate schools adopt curriculum aligned with the "Science of Reading." This approach emphasizes structured instruction in phonics, phonemic awareness, and fluency, moving away from debunked methods that encourage students to guess words based on pictures.

The need for reform is urgent. According to 2025 MCAS results, 68 percent of Massachusetts third graders are not proficient readers. The data is even more concerning for vulnerable populations; 79 percent of fourth-grade English learners did not meet expectations in English language arts.

Other states implementing similar reforms have seen measurable success. New Mexico recently passed the High-Quality Literacy Instruction Act after seeing a 10 percentage-point increase in reading proficiency since 2022. By pairing strict curriculum standards with validated tools, Massachusetts aims to replicate these gains.

The lack of validation in the EdTech market has led to costly failures in other districts. As The Learning Standard previously reported, the Los Angeles Unified School District faced an FBI investigation after a multimillion-dollar launch of an untested AI chatbot collapsed. The BU initiative seeks to prevent similar debacles by creating a trusted pipeline for proving which tools are safe and effective.

What This Means for Families

For parents, the distinction between "research-based" marketing and actual independent evidence is vital. Many apps claim to accelerate learning, but few have the data to back it up.

The BU study represents a shift toward transparency. If tools like Amira Learning can prove their worth in a controlled academic setting, it gives school districts a safer roadmap for investment. It also ensures that screen time in the classroom is spent on verified learning activities rather than flashy but ineffective gadgets.

"Implementation must be guided by rigorous research," Ozernov-Palchik said. "Schools deserve independent, gold-standard evidence about which tools genuinely improve outcomes."

What You Can Do

  • Ask for evidence: When your school introduces a new app, ask the administration if independent studies support its effectiveness.
  • Check the method: Look for tools that align with the Science of Reading, focusing on phonics and decoding rather than guessing strategies.
  • Monitor progress: If your child uses an AI tutor, track their reading fluency at home to see if skills translate offline.
Share: