Skip to content

Thoughts on an English Speaking Product Strategy

MasakiMu319

Spoken English is a critical language skill and essential for practical communication ability. However, most learning apps on the market still underperform in this area and fail to solve the core user pain point: How do we improve spoken English effectively?

Analysis of Existing Approaches

Training Institutions

Traditional spoken-English training usually follows this process:

  1. Heavy listening input: consume listening materials to build language input.
  2. Imitation practice: shadowing/repeating to train pronunciation and intonation.
  3. Free expression: topic-based speaking to improve flexible language usage.

This works, but still has weaknesses:

AI Speaking Apps

Most speaking apps currently provide:

Despite rich features, common problems remain:

Optimized Solution: A Closed-Loop Spoken English Learning Path

Based on methods used by training institutions and observations from domestic and international speaking products, we designed a closed-loop path to improve spoken English systematically.

Learning Flow

  1. Knowledge Instruction: systematically teach key grammar, vocabulary, and sentence patterns at a level slightly above the learner’s current ability.
  2. In-class Practice: convert taught knowledge and examples into Q&A exercises for targeted reinforcement (interleaved with instruction for better efficiency).
  3. Speaking Test: AI generates scenarios (for example, buying coffee) and guides users to actively use learned knowledge to complete specific tasks.
  4. Weakness Coaching: AI analyzes test performance, diagnoses weak areas, and delivers targeted reinforcement modules to close the loop.

image-20240813172858436

Theoretical Foundation

Knowledge Instruction

This part is grounded in Stephen Krashen’s Input hypothesis:

image-20240813175944034

If language models/teachers provide enough comprehensible input, structures that learners are ready to acquire will appear in that input. Krashen argues this can improve grammatical accuracy better than direct grammar instruction alone.

A simple real-world example: when people want to express traffic congestion, many say “There were many cars,” because they were never sufficiently exposed to expressions like “heavy traffic,” “traffic jam,” or “traffic congestion.”

Knowledge instruction is fundamentally an input process. By continuously providing slightly challenging but understandable content, learners stay engaged while accumulating enough input for subsequent output stages (in-class practice and speaking tests).

Speaking Test

This part is grounded in Merrill Swain’s Output hypothesis:

image-20240813181646852

Output hypothesis proposes that language acquisition/learning may occur through language production (spoken or written), because learners are more likely to notice gaps in their knowledge while producing output and learn while trying to fill those gaps.

In real conversations, people often pre-plan a sentence (for example, “Can I have a latte?”) but when speaking, they produce something incomplete like “This… Thank you.” Only at actual output time do they notice the gap.

AI speaking tests simulate realistic situations (shopping, ordering food, etc.), forcing active expression in context. Using taught content as explicit test goals both evaluates mastery and strengthens retention.

Weakness Coaching

This part is grounded in Biggs’ Reflective Learning:

image-20240814115210657

Reflective learning involves reviewing prior experiences and critically analyzing events. By examining both successful and unsuccessful aspects, learners convert surface learning into deeper learning while identifying gaps for improvement.

Existing AI speaking apps can generate reports and point out issues, but most stop at result presentation and do not effectively guide reflection and remediation.

That is why weakness coaching is the key node in our closed loop. AI analyzes conversation logs from speaking tests, diagnoses mastery of current learning targets, then provides focused drills that guide reflection and correction.

Example: while learning “be verbs,” if a learner uses wrong tense in dialogue, the system can immediately recommend a micro-lesson such as “present continuous (be doing)” plus targeted exercises.

Advantages

Compared with traditional training and current apps, this closed-loop approach offers:

  1. Systematic structure: fuller knowledge instruction with level-appropriate difficulty.
  2. Targeted practice: diversified practice/testing to expose and fix errors quickly.
  3. Real-world readiness: AI scenarios with real-time feedback improve communicative competence.
  4. Personalization: adaptive path and recommendations based on user level and goals.
  5. True loop closure: weakness coaching continuously reinforces weak points and drives iterative improvement.
Previous
0x01 - Revisiting OS